期刊文献+

MapReduce框架下两个集合间的k最近对查找算法 被引量:1

Algorithm for k-Closest Pair Query Based on Two Sets on MapReduce Framework
下载PDF
导出
摘要 针对大数据集下k最近对查询,提出在MapReduce框架下基于R*-tree索引的查询处理技术.先提出在M apReduce框架下快速构建R*-tree索引的方法.在构建索引过程中,采用抽样方法快速确定空间划分函数,保证了将数据对象均匀地划分到各个分区.在已构建的R*-tree索引上,完成k最近对的查询处理.在查询执行过程中,引入基于M BR剪枝规则来过滤不相关对象,从而在很大程度上减少了计算量,提高了查询效率.实验结果表明,该算法具有良好的计算效率和可扩展性,能较好地满足大数据集下k最近对查询请求. For k-CPQ( Close Pair Query) between large scale datasets,this paper presents an R^*-tree index based approach for processing the queries on MapReduce platform. Before processing the queries,a method for fast building R^*-tree index on MapReduce platform has proposed. During the building of R^*-tree,the space partition functions are determined with a sampling approach which ensures data objects can be evenly divided into different partitions. To improve the query performance,an MBR-based pruning rule is employed to filter the irrevelant data objects before executing the queries. Experimental results demonstrate that the proposed approach of k-CPQ has good efficiency and scalability over large scale datasets.
出处 《小型微型计算机系统》 CSCD 北大核心 2016年第3期483-487,共5页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(61003031)资助 上海重点科技攻关项目(14511107902)资助 上海市工程中心建设项目(GCZX14014)资助 上海市一流学科建设项目(XTKX2012)资助 沪江基金研究基地专项(C14001)资助
关键词 MAPREDUCE k最近点对 R^*-tree 空间查询 MapReduce k-CPQ R^*-tree spatial query
  • 相关文献

参考文献13

  • 1Gaede V, GUnther O. Multidimensional access methods [ J ]. ACM Computing Surveys ( CSUR), 1998,30 (2) : 170-231. 被引量:1
  • 2Corral A, Almendros Jim6nez J M. A performance comparison of distance-based query algorithms using R-trees in spatial databases [ J ]. Information Sciences,2007,177 ( 11 ) :2207-2237. 被引量:1
  • 3Corral A, Manolopoulos Y, Theodoridis Y, et al. Algorithms for processing < i > K </i > -closest-pair queries in spatial databases [ J]. Data & Knowledge Engineering ,2004,49( 1 ) :67-104. 被引量:1
  • 4Corrai A, Manolopoulos Y, Theodoridis Y, et al. Cost models for distance joins queries using R-trees [ J ]. Data & Knowledge Engi- neering ,2006,57 ( 1 ) : 1-36. 被引量:1
  • 5刘义,景宁,陈荦,熊伟.MapReduce框架下基于R-树的k-近邻连接算法[J].软件学报,2013,24(8):1836-1851. 被引量:60
  • 6Shan J, Zhang D, Salzberg B. On spatial-range closest-pair query [ M ]. Advances in Spatial and Temporal Databases, Springer Berlin Heidelberg, 2003:252 -269. 被引量:1
  • 7Hjaltason G R,Sanr,t H. Instal distance join algorithms for spatial databases[ C]. ACM SIGMOD Record,ACM,1998,27(2):237-248. 被引量:1
  • 8Pincheira M,Guti6rrez G,Gajardo L. Closest pair query on spatial data sets without index [ C ]. Chilean Computer Science Society (SCCC), 2010 XXIX International Conference of the. IEEE,2010:178-182. 被引量:1
  • 9Guti6rrez G, Siez P. The k closest pairs in spatial databases [ J ]. GeoInformatica,2013,17 ( 4 ) : 543-565. 被引量:1
  • 10LIU Yi JING Ning CHEN Luo CHEN Huizhong.Parallel Bulk-Loading of Spatial Data with MapReduce:An R-tree Case[J].Wuhan University Journal of Natural Sciences,2011,16(6):513-519. 被引量:4

二级参考文献26

  • 1Apostolos P, Yannis M. Parallel bulk loading of spatial data [J]. Parallel Computing, 2003, 29: 1419-1444. 被引量:1
  • 2Roussopoulos N, Leiiker D. Direct spatial search on pictorial databases using packed R-trees [C]//Proceedings of 1985 ACM SIGMOD Conference. Austin: ACM Press, 1985:17- 31. 被引量:1
  • 3Kamel I, Faloutsos C. On packing R-trees [C]//Proceedings of the 2nd CIKM Conference. Washington: ACM press, 1993: 490-499. 被引量:1
  • 4Leutenegger S T, Lopez M A, Edgington J. STR: A simple and efficient algorithm for R-tree packing [C]//Proceedings of the 13th IEEE ICDE Conference. Birmingham: IEEE Computer Society Press, 1997: 497-506. 被引量:1
  • 5Leutenegger S T, Nicol D M. Efficient bulk-loading of grid files [J]. IEEE Transactions on Knowledge and Data Engineering, 1997, 9: 410-420. 被引量:1
  • 6Ciaccia P, Patella M. Bulk-loading the M-tree [C]//Proceedings of the 9th Australian Database Conference. Perth: Springer-Verlag, 1998: 15-26. 被引量:1
  • 7van den Bercken J, Seeger B. A generic approach to bulk loading multidimensional index structures [C]//Proceedings of the 23rd VLDB Conference. Athens: Springer-Verlag, 1997 406-415. 被引量:1
  • 8Arge L, Hinrichs K H. Efficient bulk operations on dynamic R-trees [J]. Algorithmiea, 2002, 1: 104-128. 被引量:1
  • 9Berchtold S, Bohm C. Improving the query performance of high-dimensional index structures by bulk load operations [C]//Proceedings of the 6th EDBT. Valencia: Springer-Verlag, 1998: 216-230. 被引量:1
  • 10van den Bercken J, Seeger B. An evaluation of generic bulk-loading techniques [C]//Proceedings of the 27th VLDB Conference. Rome: Springer-Verlag, 2001: 461-470. 被引量:1

共引文献61

同被引文献3

引证文献1

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部