本文已被:浏览 1647次 下载 3703次
Received:March 11, 2014 Revised:April 08, 2014
Received:March 11, 2014 Revised:April 08, 2014
中文摘要: HDFS文件系统通过多副本备份的方式解决数据损坏或丢失的问题, 但是随着存储系统内容增多, 在数据量级很大的时候, 这种容灾方案消耗的额外存储空间是实际存储内容的数倍, 不利于系统资源长期积累. 文章提出使用纠删码编/解码文件代替HDFS的副本备份容灾策略, 在保证数据安全性的前提下大大提高了存储空间利用率, 降低存储额外消耗.
Abstract:Through the multiple-backup strategy HDFS can restore data easily when data is damaged or missed. However, the data stored in system increases all the time. When the data scale has become very big, the strategy will need several times of storage space to store the backup data. This article proposes to use erasure codes to replace the multiple-backup strategy, which can greatly improve the storage efficiency and reduce extra storage expend.
keywords: HDFS erasure code multiple-backup
文章编号: 中图分类号: 文献标志码:
基金项目:
引用文本:
卞艺杰,马瑞敏,李亚冰,吴慧.基于纠删码的HDFS存储方案.计算机系统应用,2014,23(11):208-213
BIAN Yi-Jie,MA Rui-Min,LI Ya-Bing,WU Hui.HDFS Storage Solutions Based on Erasure Codes.COMPUTER SYSTEMS APPLICATIONS,2014,23(11):208-213
卞艺杰,马瑞敏,李亚冰,吴慧.基于纠删码的HDFS存储方案.计算机系统应用,2014,23(11):208-213
BIAN Yi-Jie,MA Rui-Min,LI Ya-Bing,WU Hui.HDFS Storage Solutions Based on Erasure Codes.COMPUTER SYSTEMS APPLICATIONS,2014,23(11):208-213