期刊文献+

不确定数据库中基于x-tuple的高效Top-k查询处理算法 被引量:4

Efficient Processing of X-Tuple Based Top-k Queries in Uncertain Database
下载PDF
导出
摘要 Top-k查询由于其广泛的应用而倍受欢迎.不确定数据库中通常考虑的两条生成规则是:独立和互斥,一个x-tuple是由一些互斥的元组组成的,构成一个x-tuple的各个元组称为该x-tuple的可选元组.U-kRanks查询考虑x-tuple中每个可选元组排在前k的概率,并返回最可能排在前k的k个元组.已有的Top-k语义都没有将x-tuple作为一个整体,因此,定义了一种新的Top-k查询语义,不确定x-kRanks查询(U-x-kRanks),该Top-k语义返回最可能排在前k的k个x-tuple而非元组.新语义考虑x-tuple中的每个可选元组位于前k的概率,并将之汇集,得到整个x-tuple位于前k的概率.提出了一种基于动态规划的有效算法处理U-x-kRanks查询,在最小的搜索空间内完成查询处理过程.不同数据集合上的综合实验显示,所提出的算法是高效的. Like top-k in traditional databases,top-k queries in uncertain databases are quite popular and useful due to its wide application usage.However,compared with top-k in traditional databases,queries over uncertain database are more complicated because of the existence of exponential possible worlds.Often,two kinds of generation rules are considered in the uncertain database:independent and mutually exclusive.An x-tuple is the union of the tuples mutually exclusive.U-kRanks queries consider each alternative in x-tuple as single one and return the tuple which has the highest probability appearing at top k or a given rank.However,no matter which alternative(tuple) of an x-tuple appears in a possible world,it is undoubtedly believed that this x-tuple appears in the same possible world accordingly.Thus,instead of ranking each individual tuple,the authors define a novel top-k query semantic in uncertain database,uncertain x-kRanks queries(U-x-kRanks),which return k x-tuples according to the score and the confidence of alternatives in x-tuples,respectively.In order to reduce the search space,they present an efficient algorithm to process U-x-kRanks queries,which can minimize the scan depth by terminating the scan process as soon as the answers are found.Comprehensive experiments on different data sets demonstrate the effectiveness of the proposed solutions.
出处 《计算机研究与发展》 EI CSCD 北大核心 2010年第8期1415-1423,共9页 Journal of Computer Research and Development
基金 国家自然科学基金项目(60803105 60763001) 国家社会科学基金项目(07BTQ025) 江西省教育厅科技重点基金项目(GJJ08508) 江西省教育厅科学技术研究重点基金项目(赣教技字[2007]435号)~~
关键词 x-tuple TOP-K 不确定x-kRanks查询 不确定数据库 动态规划算法 x-tuple top-k uncertain x-kRanks queries uncertain database dynamic programming algorithm
  • 相关文献

参考文献17

  • 1Halevy A,Rajaraman A,Ordille J.Data integration:The teenage year[C] //Proc of the 32nd Int Conf on Very Large Data Bases.New York:ACM,2006:9-16.
  • 2Chaudhuri S,Ganjam K,Ganti V,et al.Robust and efficient fuzzy match for online data cleaning[C] //Proc of the 2003 ACM SIGMOD Int Conf on Management of Data.New York:ACM,2003:313-324.
  • 3Gupta R,Sarawagi S.Creating probabilistic databases from information extraction models[C] //Proc of the 32nd Int Conf on Very Large Data Bases.New York:ACM,2006:965-976.
  • 4崔逊学,方红雨,朱徐来.传感器网络定位问题的概率特征[J].计算机研究与发展,2007,44(4):630-635. 被引量:14
  • 5Jeffery S R,Garofalakis M,Franklin M J.Adaptive cleaning for RFID data streams[C] //Proc of the 32nd Int Conf on Very Large Data Bases.New York:ACM,2006:163-174.
  • 6Liu L.From data privacy to location privacy:Models and algorithms[C] //Proc of the 33rd Int Conf on Very Large Data Bases.New York:ACM,2007:1429-1430.
  • 7Dalvi N,Suciu D.Management of probabilistic data foundations and challenges[C] //Proc of the ACM SIGMOD Int Conf on Management of Data.New York:ACM,2007:1-12.
  • 8周傲英,金澈清,王国仁,李建中.不确定性数据管理技术研究综述[J].计算机学报,2009,32(1):1-16. 被引量:185
  • 9Ilyas I F,Beskales G,Soliman M A.Survey of top-k query processing techniques in relational database systems[J].ACM Computing Surveys,2008,40(4):1-58.
  • 10Soliman M A,Ilyas I F,Chang K C -C.Probabilistic top-k and ranking-aggregate queries[J].ACM Trans on Database System,2008,33(3):13:1-13:54.

二级参考文献106

  • 1金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 2崔莉,鞠海玲,苗勇,李天璞,刘巍,赵泽.无线传感器网络研究进展[J].计算机研究与发展,2005,42(1):163-174. 被引量:730
  • 3王福豹,史龙,任丰原.无线传感器网络中的自身定位系统和算法[J].软件学报,2005,16(5):857-868. 被引量:671
  • 4谷峪,于戈,张天成.RFID复杂事件处理技术[J].计算机科学与探索,2007,1(3):255-267. 被引量:54
  • 5Deshpande A, Guestrin C, Madden S, Hellerstein J M, Hong W. Model-driven data acquisition in sensor networks// Proceedings of the 30th International Conference on Very Large Data Bases. Toronto, 2004:588-599
  • 6Madhavan J, Cohen S, Xin D, Halevy A, Jeffery S, Ko D, Yu C. Web-scale data integration: You can afford to pay as you go//Proceedings of the 33rd Biennial Conference on Innovative Data Systems Research. Asilomar, 2007:342-350
  • 7Liu Ling. From data privacy to location privacy: Models and algorithms (tutorial)//Proceedings of the 33rd International Conference on Very Large Data bases. Vienna, 2007: 1429- 1430
  • 8Samarati P, Sweeney L. Generalizing data to provide anonymity when disclosing information (abstract)//Proeeedings of the 17th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. Seattle, 1998:188
  • 9Cavallo R, Pittarelli M. The theory of probabilistic databases//Proceedings of the 13th International Conference on Very Large Data Bases. Brighton, 1987:71-81
  • 10Barbara D, Garcia-Molina H, Porter D. The management of probabilistic data. IEEE Transactions on Knowledge and Data Engineering, 1992, 4(5): 487-502

共引文献197

同被引文献35

  • 1杨浩,尹东,洪日昌.高分辨率遥感图像中桥梁自动识别方法研究[J].计算机仿真,2006,23(9):119-122. 被引量:7
  • 2骆剑承,明冬萍,沈占锋,汪闽,盛昊.高分辨率遥感影像桥梁特征提取方法研究[J].计算机应用研究,2006,23(10):151-153. 被引量:13
  • 3Cheng R, Kalashn I D, Prabhakar S. Evaluating probabilistic que- ries over imprecise data [C] ff Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data. New York.. ACM Press, 2003,551-562.
  • 4Soliman M A, Ilyas I F, Chang K C. Top-k query processing in uncertain databases [C]//Proeeedings of the 23rd IEEE Inter- national Conference on Data Engineering. Istanbul, 2007: 896-905.
  • 5Chen Jiang,Yi Ke. Dynamic structures for top-k queries on un- certain data [-C]//Proceedings of the 18th International Sympo- sium on Algorithms and Computation. Sendai, 2007 : 427-438.
  • 6Hua Ming,Pei J'ian, Zhang Wen-jie, et al. Efficiently answering probabilistic threshold top-k queries on uncertain data [C] ff Proceedings of the 24th IEEE International Conference on Data Engineering. 2008 : 1403-1405.
  • 7Jin Che-qing, Yi Ke, Chen Lei, et al. Sliding-window top-k que- ries on uncertain Streams EJ]. Proceedings of the 34th Interna- tional Conference on Very Large Data Bases, 2008, 1 (1) : 301- 312.
  • 8Hua Ming, Pei Jian, Zhang Wen-jie, et al. Ranking queries on uncertain data.. A probabilistie threshold approach C]//SIG- MOD. 2008 : 673-687.
  • 9Yi Ke, Li Fei-fei, Kollios G. Efficient Processing of Top-k Que-ries in Uncertain Databases with X-Relations ['J. Knowledge and Data Engineering, 2009,21 (1) : 1-14.
  • 10Liu Ling. From Data Privacy to Location Privacy: Models and Algorithms[C]//Proc. of the 33rd Int'l Conf. on Very Large Data Bases. New York, USA: ACM Press, 2007: 1429-1430.

引证文献4

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部