Research and Realization on Parallel Spatial Join Query Algorithm Based on Open Source RDBMS Cluster

doi:10.15888/j.cnki.csa.005462

AIPUB归智期刊联盟

WeChat

Mobile website

2025-4-3- 19

Home > Archive>Volume 25, Issue 10, 2016 >233-239. DOI:10.15888/j.cnki.csa.005462

PDF HTML XML Export Cite reminder

Research and Realization on Parallel Spatial Join Query Algorithm Based on Open Source RDBMS Cluster
DOI:
                        10.15888/j.cnki.csa.005462
                    
CSTR:
                        [cstr]
                    
Author:
                        FAN Xie-YUFAN Xie-YU
College of Resource and Environmental Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
REN Ying-ChaoREN Ying-Chao
Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, Beijing 100011, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference [19]

Related [20]

Cited by

Materials

Comments

Abstract:

Existing studies on parallel spatial join query mostly focus on algorithm process. Few of them pay enough attentions on implementation and application research. After analyzing existing algorithm process, parallel spatial join algorithms are divided into four phases which are parallel candidate tasks generating, assignment, executing and results collection. Each of them is designed and implemented on a parallel RDBMS cluster which is built on a open-source project named ‘PL/Proxy’. In addition, to implement parallel computation, mixed computation migration method is proposed. Spatial extension function is implemented to support the spatial data sets operation on the cluster. Result of experiments using real data sets shows that, the implemented algorithm gains near linear speedup when data declustering scheme is optimal. Moreover, speedup is also gained while significant data skew caused by data declustering exists. The data declustering scheme is replaceable for improving the algorithm performance. A practicable solution for parallel spatial data sets management is provided in this paper.

Key words:spatial data partition;parallel spatial join query;computation migration;parallel RDBMS

Reference

1 Özsu MT, Valduriez P. Distributed and parallel database systems. ACM Computing Surveys (CSUR), 1996, 28(1): 125-128.

2 Andrew P, Erik P, Alexander R. A comparison of approaches to large-scale data analysis. SIGMOD. Rhode Island, USA. ACM. 2009.

3 Shekhar S, Chawla S.谢昆青译.空间数据库.北京:机械工业出版社,2004.

4 Zhong YQ, Han JZ, Zhang TY, Li ZH, Fang JY, Chen GH. Towards parallel spatial query processing for big spatial data. 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW). 2012

5 张书彬,韩冀中,刘志勇,王凯.基于 MapReduce 实现空间查询的研究.高技术通讯,2010,(7):719-726.

6 Cary A, Yesha Y, Adjouadi M, Rishe N. Leveraging cloud computing in Geodatabase Management. IEEE International Conference on Granular Computing. San Jose, CA. IEEE. 2010. 73-78.

7 Nathan TK. Alternative Approaches to Parallel GIS Processing [M.S. Thesis]. Arizona, USA: Arizona State University, 2007.

8 pgFoundry. PgFoundry:PL/Proxy:Project Info. http://pgfoundry.org/projects/plproxy/.2012.

9 Brinkhoff T, Kriegel HP, Seeger B. Efficient processing of spatial joins using R-trees. ACM, 1993, 22.

10 Mutenda L, Kitsuregawa M. Parallel R-tree spatial join for a shared-nothing architecture. Proc. 1999 International Symposium on Database Applications in Non-Traditional Environments, 1999. (DANTE '99). IEEE, 1999. 423-430.

11 Patel JM, Dewitt DJ. Clone join and shadow join: two parallel spatial join algorithms. Proc. of the 8th ACM International Symposium on Advances in Geographic Information Systems. ACM. 2015. 54-61.

12 赵春宇.高性能并行 GIS 中矢量空间数据存取与处理关键技术研究[博士学位论文].武汉:武汉大学,2006.

13 Zhou X, Abel DJ, Truffet D. Data partitioning for parallel spatial join processing. Geoinformatica, 1997, 1262(2): 178-196.

14 陈达伦,陈荣国,谢炯.基于MPP 架构的并行空间数据库原型系统的设计与实现.地球信息科学学报,2016,18(2): 151-159.

15 Ray S, Simion B, Brown AD, Johnson R. Skew-resistant parallel in-memory spatial join. SSDBM'14 ACM. New York, USA. 2014.

16 Shohdy S, Yu S, Agrawal G. Load balancing and accelerating parallel spatial join operations using bitmap indexing. IEEE International Conference on High Performance Computing. 2016. 396-405.

17 刘宇,孙莉,田永青.并行空间连接查询处理.上海交通大学学报,2002,(4):512-515.

18 Shekhar S, Ravada S, Kumar V, et al. Declustering and load-balancing methods for parallelizing geographic information systems. IEEE Trans. on Knowledge & Data Engineering, 1998, 10(4): 632-655.

19 范协裕,任应超,邓富亮,等.基于代理的并行空间查询语言. 计算机工程,2013,11(11):61-64.

Get Citation

范协裕,任应超.开源关系数据库集群的并行空间连接算法实现.计算机系统应用,2016,25(10):233-239

Copy

Article Metrics

Abstract:1486
PDF: 2239
HTML: 0
Cited by: 0

History

Received:February 19,2016
Revised:April 19,2016
Adopted:
Online: October 22,2016
Published:

Article QR Code

You are the first990450Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address：4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code：100190
Phone：010-62661041 Fax： Email：csa (a) iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063