基于Spark的大数据混合计算模型

微信公众号

网站二维码

首页 > 过刊浏览>2015年第24卷第4期 >214-218

基于Spark的大数据混合计算模型
DOI:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:

Big Data Hybrid Computing Mode Based on Spark

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

现实世界大数据应用复杂多样, 可能会同时包含不同特征的数据和计算, 在这种情况下单一的计算模式多半难以满足整个应用的需求, 因此需要考虑不同计算模式的混搭使用. 混合计算模式之集大成者当属UCBerkeley AMPLab的Spark系统, 其涵盖了几乎所有典型的大数据计算模式, 包括迭代计算、批处理计算、内存计算、流式计算(Spark Streaming)、数据查询分析计算(Shark)、以及图计算(GraphX). Spark提供了一个强大的内存计算引擎, 实现了优异的计算性能, 同时还保持与Hadoop平台的兼容性. 因此, 随着系统的不断稳定和成熟, Spark有望成为与Hadoop共存的新一代大数据处理系统和平台. 本文详细研究和分析了Spark生态系统, 建立了基于Spark平台的混合计算模型架构, 并说明通过spark生态系统可以有效地满足大数据混合计算模式的应用.

Abstract:

The use of big data in the real world was complicated. It may contain different characteristic of data and computing. In this case, the single computing mode was mostly difficult to met the application requirements. Therefore we need to consider different computing mode of mix use. The ultimate evolution of hybrid computing mode is spark system which invented by UCBerkeley AMPLab. It covers almost all the typical big data computing mode, including iterative computing, batch computing, memory computing, flow computing (Spark Streaming), data query analysis (Shark), and map computing (GraphX). Spark provides a powerful memory computing engine and implents computing with excellent performance, while maintaining compatibility with the Hadoop platform. Therefore, with the continuous stable and mature, Spark is expected to be colocalized with Hadoop and became a new generation of big data processing systems and platforms. The paper has studied and analyed the Spark ecosystem, and set up the hybrid computing model architecture based on Spark platform, which also has illustrated the spark ecosystem can meet the application of hybrid computing model.

参考文献

相似文献

引证文献

引用本文

胡俊,胡贤德,程家兴.基于Spark的大数据混合计算模型.计算机系统应用,2015,24(4):214-218

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2014-07-19
最后修改日期:2014-08-25
录用日期:
在线发布日期: 2015-04-24
出版日期:

微信公众号

网站二维码

引用本文

分享

文章指标

历史

文章二维码