期刊文献+

不确定数据库上的top-k关键字查询 被引量:3

Top-k Keyword Query on Uncertain Database
下载PDF
导出
摘要 关系数据库上的关键字检索和不确定数据处理过去一直是两个独立的研究方向。研究了运用关键字方法检索不确定数据的问题,定义了不确定关键字查询的基本模型和语义,提出了一种在属性级粒度的不确定数据库上进行top-k关键字检索的算法。该算法根据用户指定的k值,计算并返回分数最高的前k个结果,其查询结果的评价函数综合考虑了结果与关键字的相关度和结果在可能世界语义下的概率大小。对算法进行了优化,显著降低了计算复杂度。最后通过实验,证明了算法的高效性和实用性。 The problems of keyword search on relational databases and uncertain data management have been considered extensively, however addressed in isolation in the past. This paper introduces a novel method that combines IR-style keyword query with uncertain relational databases, and defines an uncertain model and its query semantics. The paper also shows a top-k algorithm to perform keyword search query on the attribute level, and return k query results which have maximum rank scores. Rank score of a query result is well-defined, depending on its correlation with query keywords and its possibility under the possible world. An optimized algorithm is introduced to reduce the complexity of the top-k query. The experimental results demonstrate the practicality and efficiency of these methods.
出处 《计算机科学与探索》 CSCD 2011年第9期781-790,共10页 Journal of Frontiers of Computer Science and Technology
基金 国家高技术研究发展计划(863)No.2008AA121705 上海市重点基础研究项目No.08JC1402500 上海市科技创新专项No.Xiao-34-1~~
关键词 关键字检索 不确定 TOP-K 可能世界 keyword search uncertainty top-k possible world
  • 相关文献

参考文献16

  • 1Madden S R, Franklin M J, Hellerstein J M, et al. The design of an acquisitional query processor for sensor networks[C]//Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003. New York, NY, USA: ACM, 2003: 491-502.
  • 2Silberstein A, Braynard R, Ellis C, et al. A sampling- based approach to optimizing top-k queries in sensor networks[C]//Proceedings of the 22nd International Conference on Data Engineering (ICDE), Atlanta, GA, USA, 2006. Washington, DC, USA: IEEE Computer Society, 2006: 68.
  • 3Agrawal S, Chaudhuri S, Das G. DBXplorer: a system for keyword-based search over relational databases[C]//Proceedings of the 18th International Conference on Data Engineering (ICDE), San Jose, CA, USA, 2002. Washington, DC, USA: IEEE Computer Society, 2002: 1-5.
  • 4Kacholia V, Pandit S, Chakrabarti S, et al. Bidirectional expansion for keyword search on graph databases[C]// Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, 2005:505-516.
  • 5Kimelfeld B, Sagiv Y. Finding and approximating top-k answers in keyword proximity search[C]//Proceedings of the 25th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, Chicago, Illinois, USA, 2006. New York, NY, USA: ACM, 2006: 173-182.
  • 6Hristidis V, Gravano L, Papakonstantinou Y. Efficient IR-style keyword search over relational databases[C]// Proceedings of the 29th International Conference on Very Large Data Bases, Berlin, Germany, 2003: 950-861.
  • 7Sarma A D, Benjelloun O, Halevy A, et al. Working models for uncertain data[C]//Proceedings of the 22nd International Conference on Data Engineering (ICDE), Atlanta, Georgia, USA, 2006. Washington, DC, USA: IEEE Computer Society, 2006: 7-17.
  • 8Agrawal P, Benjelloun O, Sarma A D, et al. Trio: a system for data, uncertainty, and lineage[C]//Proceedings of the 32nd International Conference on Very Large Data Bases, Seoul, Korea, 2006: 1151-1154.
  • 9Chang R, Kalashnikov D V, Prabhakar S. Evaluating probabilistic queries over imprecise data[C]//Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, California, USA, 2003. New York, NY, USA: ACM, 2003: 551-562.
  • 10Hua Ming, Pei Jian, Zhang Wenjie, et al. Ranking queries on uncertain data: a probabilistic threshold approach[C]// Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, Vancouver, Canada, USA, 2008. New York, NY, USA: ACM, 2008: 673-686.

同被引文献44

  • 1刘殷雷,刘玉葆,陈程.不确定性数据流上频繁项集挖掘的有效算法[J].计算机研究与发展,2011,48(S3):1-7. 被引量:14
  • 2谢洁锐,胡月明,刘才兴,刘兰.无线传感器网络的时间同步技术[J].计算机工程与设计,2007,28(1):76-77. 被引量:9
  • 3Jin Che-Qing, Yi Ke,Chen Lei,Yu Xu,LinXue Min. Slieling Window Top-K Queries on Uncertain Stream. Proceedings of the VLDB Endowment, 2008, 1(1):301-312.
  • 4G.Cormade,M.Garofalakis. Sketching Probabilistic Data Stre- am. Proceeding of the 2007 ACM SIGMOD International Conference on Management of Data. Beijing, 2007:281-292.
  • 5T.S.Jayram,S.kale,E.Vee. Efficient Aggregation Algorithms for Probabilistic Data. Proceeding of the 18th Annual ACM- SIAM Symposium on Discrete Algorithms New-Orleans,2007: 346-355.
  • 6D.Pfoser, C.S.Jensen. Capturing the Uncertainty of Moving- Object Representation. In SSD,1999:111-132.
  • 7G.Trajcevski, O.Wolfson. Managing Uncertain Trajectories of Moving Objects with DOMINO. In ICEIS,2002:217-224.
  • 8G.Trajcerski. Probabilistic Range Queries in Moving Objects Databases with Uncertainty. In MobiDE,2003:39-45.
  • 9Y.Tao,R.Cheng,X.Xiao. Indexing Multi-Dimensional Uncer- tain Data with Arbitrary Probability Density Function. In VLDB 2005:922-933.
  • 10Y.K.Huang, S.J.Liao,C.Lee. Efficient Continuous K-Nearest Neighbor Query Processing over Moving Objects with Un- certain Speed and Direction. In SSDBM, 2008:549-557.

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部