Research on Large Scale Data Loading Based on HBase

doi:10.15888/j.cnki.csa.005194

WeChat

Mobile website

Home > Archive>Volume 25, Issue 6, 2016 >231-237. DOI:10.15888/j.cnki.csa.005194

PDF HTML XML Export Cite reminder

Research on Large Scale Data Loading Based on HBase
DOI:
                        10.15888/j.cnki.csa.005194
                    
CSTR:
                        [cstr]
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Distributed database HBase has the greater advantage than traditional relational database in large scale data loading but there is also a lot of optimization space. We build HBase environment based on the Hadoop distributed platform, and optimize self-defining data loading algorithm. Firstly, this paper analysis the HBase underlying data store, experiments work out that data loading methods of HBase are insufficient in efficiency and flexibility. Furthermore, it proposes self-defining parallel data loading algorithm, and optimizes the cluster. The experimental results show that the optimized self-defining parallel data loading method can give full play to the cluster performance, has good loading efficiency and data operational capacity.

Reference

Cited by

Get Citation

贺正红,周娅,文缔尧,吴清霞.面向HBase的大规模数据加载研究.计算机系统应用,2016,25(6):231-237

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:October 19,2015
Revised:November 25,2015
Adopted:
Online: June 14,2016
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063