Based on study of current video transcoding solutions, we proposed a distributed transcoding system. Video resources are stored in HDFS(Hadoop Distributed File System) and transcoded by MapReduce program using FFMPEG. In this paper, video segmentation strategy on distributed storage and how they affect accessing time are discussed. We also defined metadata of video formats and transcoding parameters. The distributed transcoding framework is proposed on basis of MapReduce programming model. Segmented source videos are transcoding in map tasks and merged into target video in reduce task. Experimental results show that transcoding time is dependent on segmentation size and trascoding cluster size. Compared with single PC, the proposed distributed video transcoding system implemented on 8 PCs can decrease about 80% of the transcoding time.
1 Ahmad I, Wei XH, Sun Y, Zhang YQ. Video Transcoding: AnOverview of Various Techniques and Research Issues. IEEETrans. on Multimedia, 2005,7(5).
2 杨戈,廖建新,朱晓民.流媒体分发系统关键技术综述.电子学报,2009,37(1):137-145.
3 Barlas G. A Taxonomy and DLT-based analysis of ClusterbasedVideo Trans/Encoding. 14th Euromicro InternationalConference on Parallel, Distributed, and Network-BasedProcessing, PDP 2006,388-395.
4 Cardellini V, Colajanni M, Lancellotti R, Yu PS. A distributedarchitecture of edge proxy servers for cooperative transcoding.The 3rd IEEE Workshop on Internet Applications,2003,66-70.
5 Guo JN, Bhuyan L. Load Sharing in a Transcoding Cluster.Distributed Computing, 2003,835.
6 Sambe Y, Watanabe S, Yu D, Nakamura T. Distributed videotranscoding and its application to grid delivery. Proc.ITC-CSCC2003, 2003, 921-924.
7 Grio African American Breaking News and Opinion,http://www.thegrio.com/.
8 The Apache Hadoop project. http://Hadoop. apache. org/.
9 Ghemawat S, Gobioff H, Leung ST. The Google File System.19th Symposium on Operating Systems Principles. LakeGeorge, New York, 2003,29,43.
10 Hadoop Makes Sense of Lots of Data. http://www.enterprisestorageforum.com/article.php/3890191/Hadoop-Makes-Sense-of-Lots-of-Data.htm
11 Borthakur D. The Hadoop Distributed File System:Architecture and Design. http://hadoop.apache.org/core/docs/current/hdfs_design.html, 2007
12 Dean J, Ghemawat S. MapReduce: Simplied Data Processingon Large Clusters. Proc. of OSDI’04: 6th Symposiumon Operating System Design and Implemention, SanFrancisco, CA, Dec. 2004.
13 Bellard F, FFMPEG multimedia system. http://FFMPEG.sourceforge.net/index.php
14 Mathes A. Folksonomies-cooperative classification andcommunication through shared metadata. Computer MediatedCommunication, LIS590CMC (Doctoral Seminar).Graduate School of Library and Information Science,University of Illinois Urbana-Champaign, December 2004.
15 Matsuyama K, Kraus M, Kitagawa K, Saito N. A Path-BasedRDF Query Language for CC/PP and UAProf. Proc. of the2nd IEEE Annual Conference on Pervasive Computing andCommunications Workshops (PERCOMW’04), 2004.