期刊文献+

一种局部相关不确定数据库快照集合上的概率频繁最近邻算法 被引量:12

An Algorithm on Probabilistic Frequent Nearest Neighbor Query over Snapshots of Uncertain Database with Locally Correlation
下载PDF
导出
摘要 局部相关空间不确定数据越来越受到许多实际应用的关注.提出了一种新颖的定义在不确定数据库的多个快照上的概率频繁近邻查询,目的是在多个快照数据上找到以一定概率频繁成为查询点最近邻的那些对象.应用现有的基于传统数据和基于不确定数据上的近邻查询算法直接处理这种查询会产生昂贵的开销.为了很好地解决这一问题,提出了一般的处理框架,其中包括相应的基于切尔诺夫界的过滤方法,以及对于概率质量函数的动态规划算法.给出了分别作用于两个阶段的两个过滤方法.在第1阶段,利用切尔诺夫界的上界推广形式可以过滤大量的候选目标,之后在第2阶段,利用切尔诺夫界的标准形式来进一步过滤候选目标.还讨论了用于处理扩展查询的动态规划算法以及相应的过滤条件.最后,在人工的和真实的数据上都进行了充分的实验,并验证了给出算法的有效性,为进一步的研究工作奠定了基础. Research on locally correlated spatial uncertain data recently gets more and more attentions in many practical applications. This paper proposes a novel probabilistic frequent nearest neighbor query over snapshots of uncertain database with locally correlation, which retrievals those objects which can be seen as the nearest neighbor of query point with a certain probability frequently. The existing methods of processing nearest neighbor query over traditional data or uncertain data cannot be used to process this kind of query because of the expensivetime consuming. In order to solve this problem, we propose a generic framework, including several filters based on chernoff bound and the dynamic programme used to compute the probability mass function. We provide two filtering methods for two phases. The extended form of chernoff upper bound is used to filter lots of candidates objects at the first phase; while at the second phase, we use the standard form of chernoff bound. Then we also discuss the filters and dynamic programme used to process the extended LC-PFNN query. At last, we conduct experiments both on real and synthetic data set, and the results demonstrate that our method is effective enough, which lays a solid foundation for further research.
出处 《计算机研究与发展》 EI CSCD 北大核心 2011年第10期1812-1822,共11页 Journal of Computer Research and Development
基金 国家自然科学基金项目(60703012) 国家自然科学基金重点项目(60933001)
关键词 不确定数据库 快照 局部相关性 概率频繁 最近邻查询 uncertain database snapshots locally correlation probabilistic frequent nearestneighbor query
  • 相关文献

参考文献26

  • 1Jeffery S R, Franklin M J, Garofalakis M. An adaptive RFID middleware for supporting metaphysical data independence [J]. The International Journal on Very Large Data Bases, 2008, 17(2): 265-289.
  • 2Cheng R, Kalashnikov D V, Prabhakar S. Evaluating prubabilistic queries over imprecise data [C] //Proc of ACM SIGMOD'2003. New York: ACM, 2003: 551-562.
  • 3Faradjian A, Gehrke J, Bonnet P. Gadt: A probability space ADT for representing and querying the physical world [C] // Proc of the 18th Int Conf on Data Engineering. Los Alamitos, CA: IEEE Computer Society, 2002: 201-211.
  • 4Mokbel M F, Chow C Y, Aref W G. The new Casper: Query processing for location services without compromising privacy [C]//Proc of the 32nd Int Conf on Very Large Data Bases. NewYork: ACM, 2006:763-774.
  • 5Dong X L, Berti Equille L, Srivastava D. Integrating conflicting data: The role of source dependence[J]. PVLDB, 2009, 2(1): 550-561.
  • 6Bohm C, Pryakhin A, Schubert M. The Gauss-tree: Efficient object identification in databases of probabilistic feature vectors [C] //Proe of the 22nd Int Conf on Data Engineering. Los Alamitos, CA: IEEE Computer Society, 2006 : 9.
  • 7Bleiholder J, Naumann F. Data fusion [J]. ACM Computing: Surveys, 2008, 41(1): 1-41.
  • 8Lian Xiang, Chen Lei. A generic framework for handling uncertain data with local correlations [J]. Proc of the VLDB Endowment, 2010, 4(1): 12-21.
  • 9Kanagal B, Deshpande A. Indexing correlated probabilistic databases [C] //Proc of ACM SIGMOD 2009. New York: ACM, 2009:455-468.
  • 10Jordan M I. Graphical models [J]. In Statistical Science: Special Issue on Bayesian Statistics, 2004, 19(1): 140-155.

二级参考文献6

  • 1Chan E P F, Zhang N. Finding shortest paths in large network systems [C] //Proc of the 9th ACM Int Syrup on Advances in Geographic Information Systems. New York: ACM, 2001:160-166.
  • 2Hutchinson D, Maheshwari A, Zeh N. An external memory data structure for shortest path queries[J]. Discrete Applied Mathematics, 2003, 126 (1) : 55-82.
  • 3Sharifzadeh M, Kolahdouzan M, Shahabi C. The optimal sequenced route query[J]. The VLDB Journal, 2008, 17 (4) : 765-787.
  • 4Guttman A. R-trees: A dynamic Index structure for spatial searching [C] //Proc of the ACM SIGMOD Int Conf on Management of Data. New York: ACM, 1984:47-57.
  • 5Li F, Chen D, Hadjieleftheriou M, et al. On trip planning queries in spatial databases [C]//Proe of the 9th Int Symp on Spatial and Temporal Databases. Berlin.. Springer, 2005: 273-290.
  • 6Ma X, Shekhar S, Xiong H, et al. Exploiting a page-level upper bound for multi-type nearest neighbor queries [C] // Proc of the 14th ACM Int Syrup on Advances in Geographic Information Systems. New York: ACM, 2006:179-186.

共引文献7

同被引文献86

  • 1张明波,陆锋,申排伟,程昌秀.R树家族的演变和发展[J].计算机学报,2005,28(3):289-300. 被引量:94
  • 2唐辉,齐乐华,李贺军.基于SQL Server的C/C复合材料实验数据库系统平台的设计开发[J].新型炭材料,2007,22(2):148-152. 被引量:6
  • 3李松,张丽平,孙冬璞.空间关系查询与分析[M].哈尔滨:哈尔滨工业大学出版社,2011.
  • 4Song Zhexuan,Roussopoulos N.K Nearest Neighbor Search for Moving Query Point[C]//Proc.of the 7th International Symposium on Advances in Spatial and Temporal Databases.Berlin,Germany:Springer-Verlag,2001:79-96.
  • 5Mouratidis K,Yiu M L,Papadias D.Continuous Nearest Neighbor Monitoring in Road Networks[C]//Proc.of VLDB.Seoul,Korea:[s.n.],2006.
  • 6Hu Ling,Jing Yinan,Ku W S,et al.Enforcing k Nearest Neighbor Query Integrity on Road Networks[C]//Proc.of the 20th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems.Redondo Beach,USA:ACM Press:346-349.
  • 7Elmongui H G,Mokbel M F,Aref W G.Continuous Aggregate Nearest Neighbor Queries[J].Geo Informatica,2013,17 (1):63-95.
  • 8Cheng R,Chen J,Mokbel M F.Probabilistic Verifiers:Evalutaing Constrained Nearest-neighbor Queries over Uncertain Data[C]//Proc.of International Conference on Data Engineering.Chicago,USA:[s.n.]:2008:973-982.
  • 9Nutanong S,Tanin E,Zhang Rui.Incremental Evaluation of Visible Nearest Neighbor Queries[J].IEEE Transactions on Knowledge Engineering,2010,22 (7):665-681.
  • 10Gao Yunjun,Zheng Baihua,Chen Gencai,et al.Visible Reverse k-Nearest Neighbor Query Processing in Spatial Databases[J].IEEE Transactions on Knowledge and Data Engineering,2009,21 (5):1314-1327.

引证文献12

二级引证文献31

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部