MapReduce Job Scheduling in Hybrid Storage Modes

doi:10.15888/j.cnki.csa.008998

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-25- 10

Home > Archive>Volume 32, Issue 3, 2023 >70-85. DOI:10.15888/j.cnki.csa.008998

PDF HTML XML Export Cite reminder

MapReduce Job Scheduling in Hybrid Storage Modes
DOI:
                        10.15888/j.cnki.csa.008998
                    
CSTR:
                        [cstr]
                    
Author:
                        YANG Zhen-YuYANG Zhen-Yu
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230022, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
NIU Tian-YangNIU Tian-Yang
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230022, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
LYU MinLYU Min
School of Computer Science and Technology, University of Science and Technology of China, Hefei 230022, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

In a heterogeneous Hadoop cluster scenario, the hybrid use of erasure codes and replica storage modes, as well as the real-time computing capability difference of server nodes lead to the low efficiency of MapReduce job processing. To deal with this problem, this study implements a scheduling strategy that dynamically adjusts MapReduce job assignment in multi-concurrent scenarios according to data storage situations and the real-time load of nodes. This strategy dynamically controls the concurrent amount of tasks of each node by modifying data storage location strategies in the current Hadoop framework, so as to achieve more balanced resource allocation among jobs when multiple jobs are concurrent. The experimental results show that the scheduling mode proposed in this study can shorten the job completion time by about 17% and effectively avoid the starvation phenomenon faced by some jobs compared with the two default job scheduling strategies of Hadoop.

Key words:MapReduce;job scheduling;erasure code;heterogeneous cluster;hybrid storage;cloud computing;load balance;big data

Get Citation

杨振宇,牛天洋,吕敏.混合存储模式下MapReduce作业调度.计算机系统应用,2023,32(3):70-85

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:August 09,2022
Revised:September 15,2022
Adopted:
Online: December 09,2022
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063