Research and Realization on Parallel Spatial Join Query Algorithm Based on Open Source RDBMS Cluster
CSTR:
Author:
  • Article
  • | |
  • Metrics
  • |
  • Reference [19]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    Existing studies on parallel spatial join query mostly focus on algorithm process. Few of them pay enough attentions on implementation and application research. After analyzing existing algorithm process, parallel spatial join algorithms are divided into four phases which are parallel candidate tasks generating, assignment, executing and results collection. Each of them is designed and implemented on a parallel RDBMS cluster which is built on a open-source project named ‘PL/Proxy’. In addition, to implement parallel computation, mixed computation migration method is proposed. Spatial extension function is implemented to support the spatial data sets operation on the cluster. Result of experiments using real data sets shows that, the implemented algorithm gains near linear speedup when data declustering scheme is optimal. Moreover, speedup is also gained while significant data skew caused by data declustering exists. The data declustering scheme is replaceable for improving the algorithm performance. A practicable solution for parallel spatial data sets management is provided in this paper.

    Reference
    1 Özsu MT, Valduriez P. Distributed and parallel database systems. ACM Computing Surveys (CSUR), 1996, 28(1): 125-128.
    2 Andrew P, Erik P, Alexander R. A comparison of approaches to large-scale data analysis. SIGMOD. Rhode Island, USA. ACM. 2009.
    3 Shekhar S, Chawla S.谢昆青译.空间数据库.北京:机械工业出版社,2004.
    4 Zhong YQ, Han JZ, Zhang TY, Li ZH, Fang JY, Chen GH. Towards parallel spatial query processing for big spatial data. 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW). 2012
    5 张书彬,韩冀中,刘志勇,王凯.基于 MapReduce 实现空间查询的研究.高技术通讯,2010,(7):719-726.
    6 Cary A, Yesha Y, Adjouadi M, Rishe N. Leveraging cloud computing in Geodatabase Management. IEEE International Conference on Granular Computing. San Jose, CA. IEEE. 2010. 73-78.
    7 Nathan TK. Alternative Approaches to Parallel GIS Processing [M.S. Thesis]. Arizona, USA: Arizona State University, 2007.
    8 pgFoundry. PgFoundry:PL/Proxy:Project Info. http://pgfoundry.org/projects/plproxy/.2012.
    9 Brinkhoff T, Kriegel HP, Seeger B. Efficient processing of spatial joins using R-trees. ACM, 1993, 22.
    10 Mutenda L, Kitsuregawa M. Parallel R-tree spatial join for a shared-nothing architecture. Proc. 1999 International Symposium on Database Applications in Non-Traditional Environments, 1999. (DANTE '99). IEEE, 1999. 423-430.
    11 Patel JM, Dewitt DJ. Clone join and shadow join: two parallel spatial join algorithms. Proc. of the 8th ACM International Symposium on Advances in Geographic Information Systems. ACM. 2015. 54-61.
    12 赵春宇.高性能并行 GIS 中矢量空间数据存取与处理关键技术研究[博士学位论文].武汉:武汉大学,2006.
    13 Zhou X, Abel DJ, Truffet D. Data partitioning for parallel spatial join processing. Geoinformatica, 1997, 1262(2): 178-196.
    14 陈达伦,陈荣国,谢炯.基于MPP 架构的并行空间数据库原型系统的设计与实现.地球信息科学学报,2016,18(2): 151-159.
    15 Ray S, Simion B, Brown AD, Johnson R. Skew-resistant parallel in-memory spatial join. SSDBM'14 ACM. New York, USA. 2014.
    16 Shohdy S, Yu S, Agrawal G. Load balancing and accelerating parallel spatial join operations using bitmap indexing. IEEE International Conference on High Performance Computing. 2016. 396-405.
    17 刘宇,孙莉,田永青.并行空间连接查询处理.上海交通大学学报,2002,(4):512-515.
    18 Shekhar S, Ravada S, Kumar V, et al. Declustering and load-balancing methods for parallelizing geographic information systems. IEEE Trans. on Knowledge & Data Engineering, 1998, 10(4): 632-655.
    19 范协裕,任应超,邓富亮,等.基于代理的并行空间查询语言. 计算机工程,2013,11(11):61-64.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

范协裕,任应超.开源关系数据库集群的并行空间连接算法实现.计算机系统应用,2016,25(10):233-239

Copy
Share
Article Metrics
  • Abstract:1486
  • PDF: 2239
  • HTML: 0
  • Cited by: 0
History
  • Received:February 19,2016
  • Revised:April 19,2016
  • Online: October 22,2016
Article QR Code
You are the first990450Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-3
Address:4# South Fourth Street, Zhongguancun,Haidian, Beijing,Postal Code:100190
Phone:010-62661041 Fax: Email:csa (a) iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063