基于热度的Hadoop快速副本复制算法

AIPUB归智期刊联盟

微信公众号

网站二维码

2025年4月26日 2:26 星期六

首页 > 过刊浏览>2015年第24卷第9期 >146-151

PDF HTML阅读 XML下载导出引用引用提醒

基于热度的Hadoop快速副本复制算法
DOI:
                        
                    
CSTR:
                        
                    
作者:
                        张倩张倩
中国科学技术大学 自动化系, 合肥 230027
在期刊界中查找
在百度中查找
在本站中查找
郑烇郑烇
中国科学技术大学 自动化系, 合肥 230027
在期刊界中查找
在百度中查找
在本站中查找
王嵩王嵩
中国科学技术大学 自动化系, 合肥 230027
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(61174062)

Rapid Replica Copy Algorithm Based on Popularity in Hadoop

Author:

ZHANG Qian
ZHANG Qian
Department of Automation, University of Science and Technology of China, Hefei 230027, China
在期刊界中查找
在百度中查找
在本站中查找
ZHENG Quan
ZHENG Quan
Department of Automation, University of Science and Technology of China, Hefei 230027, China
在期刊界中查找
在百度中查找
在本站中查找
WANG Song
WANG Song
Department of Automation, University of Science and Technology of China, Hefei 230027, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

在云存储中心, 由于节点失效带来的文件数据块副本丢失不仅会影响系统的可靠性, 还会影响文件的并发访问效率. 针对Hadoop中默认的副本复制方法存在的问题, 即副本复制过程某些节点数据传输过于集中, 负载不均衡, 磁盘I/O吞吐率低, 提出一种基于热度的快速副本复制算法. 该算法优先复制热度高的数据块, 合理选择数据块复制的源节点和目的节点. 仿真结果表明, 该算法平衡了系统的工作负载, 提高了磁盘I/O吞吐率, 显著降低用户请求平均响应时间.

关键词:云存储;节点失效;Hadoop;副本复制;热点

Abstract:

In cloud storage centers, replica of file may be lost because of the failure of nodes, which will affect the reliability of system, as well as the efficiency of file concurrent access. There are some deficiencies in the default replica copy algorithm in Hadoop, such as a concentration of data transfer process on a few DataNodes, load imbalance, low disk I/O throughput. To address this issue, this paper proposes a rapid replica copy algorithm based on popularity in Hadoop. It handles the popular block firstly, and chooses source and destination DataNodes properly. The simulation results show that the proposed algorithm improves the disk I/O throughput, load balance, and reduces average service response time significantly.

Key words:cloud storage;node failure;Hadoop;replica copy;popularity

引用本文

张倩,郑烇,王嵩.基于热度的Hadoop快速副本复制算法.计算机系统应用,2015,24(9):146-151

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2014-12-30
最后修改日期:2015-02-02
录用日期:
在线发布日期: 2015-09-14
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码

微信公众号

网站二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码