MemLoop: A Programming Framework Using In-Memory Cache for Iterative Application

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-26- 15

Home > Archive>Volume 24, Issue 3, 2015 >44-49

PDF HTML XML Export Cite reminder

MemLoop: A Programming Framework Using In-Memory Cache for Iterative Application
DOI:
                        
                    
CSTR:
                        [cstr]
                    
Author:
                        LIAN Wen-BoLIAN Wen-Bo
Institute of Software, Chinese Academy of Sciences, Beijing 100190, China;University of Chinese Academy of Science, Beijing 100190, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
WANG Mei-LingWANG Mei-Ling
Institute of Software, Chinese Academy of Sciences, Beijing 100190, China;University of Chinese Academy of Science, Beijing 100190, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
TAO Qiu-MingTAO Qiu-Ming
University of Chinese Academy of Science, Beijing 100190, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHAO ChenZHAO Chen
University of Chinese Academy of Science, Beijing 100190, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The iterative computation is an important big data analysis application. While implementing iterative computation on the distributed computation framework MapReduce, the iterative program will be divided into more than one jobs which run in the order defined by the dependencies between jobs, which lead to many interactions between the program and distributed file system(DFS) that will affect the program's execution time. Caching these interaction-related data will reduce the time of interactions between the program and DFS and hence improve the overall performance of application. Considering that large amount of memory in cluster nodes is unused at most time, this paper proposes a programming framework called MemLoop using memory cache for iterative application. This system sufficiently uses the free memory in the cluster's nodes to cache data by implementing the memory caching management from three models: job submit API, task scheduling algorithm, cache management. The cached data is classified into two categories: inter-iteration resident data and intra-iteration dependent data. We compare this framework with previous related framework. The result shows that MemLoop can improve the performance of iterative program.

Key words:job dependency;in-memory cache;iterative program;inter-iteration resident data;intra-iteration dependent data

Get Citation

连文波,汪美玲,陶秋铭,赵琛.使用内存缓存的迭代应用编程框架.计算机系统应用,2015,24(3):44-49

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:July 04,2014
Revised:August 11,2014
Adopted:
Online: March 04,2015
Published:

Article QR Code

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063