Fault Self-Recovery Algorithm for Management Node in Cloud Storage System

doi:10.15888/j.cnki.csa.005568

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-26- 16

Home > Archive>Volume 26, Issue 2, 2017 >112-117. DOI:10.15888/j.cnki.csa.005568

PDF HTML XML Export Cite reminder

Fault Self-Recovery Algorithm for Management Node in Cloud Storage System
DOI:
                        10.15888/j.cnki.csa.005568
                    
CSTR:
                        [cstr]
                    
Author:
                        MA Wei-JunMA Wei-Jun
Institute of Field Engineering, PLA University of Science and Technology, Nanjing 210014, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
WANG QiangWANG Qiang
Institute of Field Engineering, PLA University of Science and Technology, Nanjing 210014, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
HE Xiao-HuiHE Xiao-Hui
Institute of Field Engineering, PLA University of Science and Technology, Nanjing 210014, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHANG ShuZHANG Shu
Western Theater Air Meteorological Center, Chengdu 610000, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHANG QingZHANG Qing
Eastern Theater Air Meteorological Center, Nanjing 210018, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

In order to solve the storage service unavailable problem on account of the management node fault in huge cloud storage system, an analysis model for fault effect of management node is built and a dynamic self-recovery algorithm for management node based on message called FRA-M is presented. FRA-M implements the cooperation, transparent take-over and self-recovery of management nodes by metadata update control based on load balance. Experiment shows FRA-M can provide management nodes auto switching when fault occurs and achieve good load balance by favorable resource allocation. The performance of FRA-M is also maintained in a relatively stable interval by reasonable control of TCP timeout, fault detection cycle and fault detection timeout. The storage service availability, data usability and data reliability are guaranteed by FRA-M during the breakdown of management nodes.

Key words:cloud storage system;management node;self-recovery;metadata;load balancing;dynamic switching

Get Citation

马玮骏,王强,何晓晖,张舒,张庆.云存储系统管理节点故障自恢复算法.计算机系统应用,2017,26(2):112-117

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:May 10,2016
Revised:June 30,2016
Adopted:
Online: February 15,2017
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063