Methods of Dealing With Massive Small Files in Hadoop

WeChat

Mobile website

Home > Archive>Volume 24, Issue 11, 2015 >157-161

Methods of Dealing With Massive Small Files in Hadoop
DOI:
                        
CSTR:
                        [cstr]
                    
Author:
                        
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

HDFS provides the underlying storage for Hadoop, however, the HDFS deals with massive small files inefficiently and decreases system performance seriously. To solve this problem, we designed a file merging, indexing and retrieval solution. Then through a series of experiments compared to the original HDFS and HAR solution, it can be shown that our scheme can effectively reduce the memory usage of Namenode and improve the I/O performance of HDFS.

Reference

Cited by

Get Citation

李旭,李长云,张清清,胡淑新,周玲芳. Hadoop中处理海量小文件的方法.计算机系统应用,2015,24(11):157-161

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:March 08,2015
Revised:May 18,2015
Adopted:
Online: December 03,2015
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063