期刊文献+

基于MapReduce和双层倒排网格索引的kNN算法 被引量:1

Research of kNN algorithm based on MapReduce and double levels of inverted grid index
下载PDF
导出
摘要 随着卫星定位技术和移动互联网技术的飞速发展,地理空间数据来源变得更加多源异构.面对海量地理空间数据,如何快速有效地找到目标周围的兴趣点变得异常重要.依据空间k近邻(kNN)查询算法,提高效率的关键在数据索引和数据块存储结构设计,通过引入云计算的MapReduce编程模型,设计了一种面向MapReduce的地理空间数据双层倒排网格索引,利用CircularTrip算法实现了目标点近邻查询计算,最终获得距离目标点最邻近的数据点集.实验结果表明,该索引方法较单层倒排网格索引下的kNN查询效率有明显提高,且数据量越大效率提升越明显,此法适合大规模并行计算. With the development of satellite positioning and mobile internet technology,the geospatial data become more multi-source and heterogeneous,which consequently makes the obtainment of the interesting points around a target remarkably important.Considering that the key to the efficiency of the kNN algorithm lies in the design of data index and the storage structure of data block,we propose a MapReduce-oriented double inverted grid index for geospatial data.The targets neighbor query calculation is implemented based on the CircularTrip algorithm,and finally the nearest point sets are achieved according to the requirements.The results of the following experiments show that the indexing method not only provides a significant improvement in kNN query efficiency,but also has a good performance under a great amount of data,which consequently fits large-scale parallel computing better.
出处 《浙江大学学报(理学版)》 CAS CSCD 2014年第6期703-708,共6页 Journal of Zhejiang University(Science Edition)
基金 国家自然科学基金资助项目(41471313 41101356) 国家海洋公益性行业科研专项经费资助项目(2015418003 201305012) 浙江省科技攻关计划项目(2013C33051) 中央高校基本科研业务费专项(2013QNA3023) 国家科技基础性工作专项(2012FY112300)
关键词 双层倒排网格索引 k最邻近结点算法 云计算 MAPREDUCE CircularTrip double levels of inverted grid index kNN cloud computing MapReduce CircularTrip
  • 相关文献

参考文献7

  • 1周傲英,杨彬,金澈清,马强.基于位置的服务:架构与进展[J].计算机学报,2011,34(7):1155-1171. 被引量:171
  • 2刘鹏.云计算[M].北京:电子工业出版社,2013.
  • 3DEAN J, GHEMAWAT S. MapReduce: simplified data processing on large clusters[J]. Communications of the ACM,2008,51(1) :107- 113.
  • 4CHEEMA M A, YUAN Y, LIN X. Circulartrip: an effective algorithm for continuous kNN queries[C]// Advances in Databases: Concepts, Systems and Applica- tions. Berlin : Springer, 2007 : 863-869.
  • 5CARY A, SUN Z, HRISTIDIS V, et al. Experiences on processing spatial data with mapreduce[C]//Scien- tific and Statistical Database Management. Berlin= Springer,2009:302 -319.
  • 6AKDOGAN A, DEMIRYUREK U, BANAEI-KAS- HANI F, et al. Voronoi-based geospatial query pro cessing with mapreduce[C]//Cloud Computing Tech- nology and Science (CloudCom), 2010 IEEE Second In- ternational Conference on. Angeles : IEEE, 2010 : 9-16.
  • 7JI Changqing, DONG Tingting, LI Yu, et al. Inverted grid-based knn query processing with mapreduce[C]// ChinaGrid Annual Conference (ChinaGrid), 2012 Sev- enth. Beiiin:: IEEE,2012:25 32.

二级参考文献103

  • 1潘晓,肖珍,孟小峰.位置隐私研究综述[J].计算机科学与探索,2007,1(3):268-281. 被引量:65
  • 2Yang B, Lu H, Jensen C S. Scalable continuous range monitoring of moving objects in symbolic indoor space//Proeeedings of the 18th ACM Conference on Information and Knowledge Management. Hong Kong, China, 2009:671-680.
  • 3Wolfson O, Sistla P A, Chamberlain S, Yesha Y. Updating and querying databases that track mobile units. Distributed and Parallel Databases, 1999, 7(3): 257-387.
  • 4Pfoser D, Jensen C S. Capturing the uncertainty of movingobjects representations//Proceedings of the 6th International Symposium on Advances in Spatial Databases. Hong Kong, China, 1999:111-132.
  • 5Cheng R: Kalashnikov D V, Prabhakar S. Querying imprecise data in moving object environments. IEEE Transactions on Knowledge and Data Engineering, 2004, 16(9): 1112- 1127.
  • 6Zhang M, Chen S, Jensen C S, Ooi B C, Zhang Z. Effectively indexing uncertain moving objects for predictive queries// Proceedings of the VLDB Endowment. Lyon, 2009, 2 (1): 1198-1209.
  • 7Cheng R, Chen L, Chen J, Xie X. Evaluating probability threshold k-nearest-neighbor queries over uncertain data// Proceedings of the 12th International Con/erence on Extending Database Technology. Saint Petersburg, 2009 :672-683.
  • 8Tao Y, Cheng R, Xiao X, Ngai W K, Kao B, Prabhakar S. Indexing multi-dimensional uncertain data with arbitrary probability density funetions//Proceedings of the 31st International Conference on Very Large Data Bases. Trondheim, 2005 : 922-933.
  • 9Kalashnikov D V, Ma Y, Mehrotra S, Hariharan R. Index for fast retrieval of uncertain spatial point data//Proceedings of the 14th ACM International Symposium on Geographic Information Systems. Arlington, 2006:195-202.
  • 10Chen J, Cheng R. Efficient evaluation of imprecise location- dependent queries//Proceedings of the 23rd International Conference on Data Engineering. Istanbul, 2007:586-595.

共引文献171

同被引文献9

引证文献1

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部