期刊文献+

面向不确定图的概率可达查询 被引量:11

Answering Probabilistic Reachability Queries over Uncertain Graphs
下载PDF
导出
摘要 图的可达性查询被广泛应用于生物网络、社会网络、本体网络、RDF数据库和XML数据库等.由于对数据操作时引入的噪声和错误使这些图数据具有不确定性,已经有大量的针对不确定RDF和XML数据库的研究.文中使用可能世界语义模型构建不确定图,基于该模型,研究了概率可达查询(PR).处理PR查询是#P完全问题,对此文中首先给出一个基本随机算法,可快速地估算出可达概率,并且该值有很高的精确度.进一步,文中为随机算法引入条件分布(称为"条件随机算法"),采用图的不相交路径集和割集作为条件概率分布,因此改进的随机算法可准确地并且是在多项式时间内处理查询.最后基于真实不确定图数据的大量实验结果验证了文中的设计. Graph reachability queries are widely used in biological networks,social networks,ontology networks,RDF and XML databases.Meanwhile,data extracted from those applications is inherently uncertain due to noise,incompleteness and inaccuracy,and many works have been proposed to study uncertain RDF and XML databases.This paper discusses the reachability queries over uncertain graphs,specifically a probabilistic reachability(PR) query over an uncertain graph using the possible world semantics.It is proved that processing PR query is a P-complete problem.The authors first propose a basic random algorithm to efficiently estimate the reachable probability with a high quality.To further improve the basic method,the authors introduce conditional distribution in random algorithm called conditional random algorithm(CRA),and compute the disjoint path set and cut set probabilities for the conditional distribution that is used in CRA,which helps us to find the querying results in polynomial time.Finally,the authors have verified the effectiveness of the proposed solutions for PR queries through extensive experiments on real uncertain graph datasets.
作者 袁野 王国仁
出处 《计算机学报》 EI CSCD 北大核心 2010年第8期1378-1386,共9页 Chinese Journal of Computers
基金 国家自然科学基金重点项目(60933001) 国家自然科学基金面上项目(60773221) 国家"八六三"高技术研究发展计划项目基金(2009AA01Z150) 国家自然科学基金(60803026)资助~~
关键词 不确定图 可能世界 条件随机算法 路径集 割集 uncertain graph possible world conditional random algorithm path set cut set
  • 相关文献

参考文献24

  • 1Nierman A,Jagadish H V.ProTDB:Probabilistic data in XML//Proceedings of the VLDB.Hong Kong,China,2002:646-657.
  • 2Senellart P,Abiteboul S.On the complexity of managing probabilistic XML data//Proceedings of the PODS.Beijing,China,2007:283-289.
  • 3Adar E,Re C.Managing uncertainty in social networks.IEEE Data Engineering Bulletin,2007,30(2):15-22.
  • 4Asthana S,King O D,Gibbons F D et al.Predicting protein complex membership using probabilistic network reliability.Genome Research,2004,14(6):1170-1175.
  • 5Chui H N,Sung W K,Wong L.Exploiting indirect neighbors and topological weight to predict protein function from protein-protein interactions.Bioinformatics,2007,22(13):47-58.
  • 6Jiang R,Tu Z,Chen T et al.Network motif identification in stochastic networks.PNAS,2006,103(25):9404-9409.
  • 7Saito R,Suzuki H,Hayashizaki Y.Interaction generality:A measurement to assess the reliability of a protein-protein interaction.Nucleic Acids Research,2002,30(5):1163-1168.
  • 8Dalvi N N,Suciu D.Management of probabilistic data:Foundations and challenges//Proceedings of PODS.Beijing,China,2007:1-12.
  • 9Khoussainova N,Balazinska M.Towards correcting input data errors probabilistically using integrity constraints//Proceedings of the MobiDE Workshop.Chicago,Illinois,USA,2006:43-50.
  • 10Jin R,Xiang Y,Ruan N.Efficiently answering reachability queries on very large directed graphs//Proceedings of SIGMOD.Vancouver,Canada,2008:595-608.

二级参考文献17

  • 1周水庚 蔚赵春 蒋豪良.图结构数据搜索的概念、问题与进展[J].中国计算机学会通讯,2007,3(8):59-65.
  • 2Shasha D, Wang T L J, Giugno R. Algorithmics and applications of tree and graph searching//Proceedings of the 21st ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems. Madison, 2002:39-52.
  • 3Yan X, Yu P S, Han J. Graph indexing: A frequent structure-based approach//Proeeedings of the 2004 ACM SIGMOD International Conference on Management of Data. Paris, 2004:335-346.
  • 4Saito R, Suzuki H, Hayashizaki Y. Interaction generality, a measurement to assess the reliability of a protein-protein interaction. Nucleic Acids Research, 2002, 30(5): 1163-1168.
  • 5李建中 于戈 周傲英.不确定性数据管理的要求与挑战[J].中国计算机学会通讯,2009,5(4):6-14.
  • 6高宏 张炜.不确定图数据管理研究现状[J].中国计算机学会通讯,2009,5(4):31-36.
  • 7Dalvi N N, Suciu D. Efficient query evaluation on probabilistic databases. VLDB Journal, 2007, 16(4): 523-544.
  • 8Soliman M A, Ilyas I F, Chang K C. Top-k query processing in uncertain databases//Proceedings of the 23rd International Conference on Data Engineering. Istanbul, 2007:896-905.
  • 9Hintsanen P. The most reliable subgraph problem//Proceedings of the llth European Conference on Principles and Practice of Knowledge Discovery in Databases. Warsaw, 2007: 471-478.
  • 10Hintsanen P, Toivonen H. Finding reliable subgraphs from large probabilistic graphs. Data Mining and Knowledge Discovery, 2008, 17(1): 3-23.

共引文献47

同被引文献127

  • 1Cheng J,Yu J,Lin X.Fast computing reachability labelings for large graphs with high compression rate//Proceedings of the EDBT.Nantes,France,2008:193-204.
  • 2Agrawal R,Borgida A,Jagadish H V.Efficient management of transitive relationships in large data and knowledge bases.ACM SIGMOD Record,1989,18(2):253-262.
  • 3Chen L,Gupta A,Kurul M.Stack-based algorithms for pattern matching on dags//Proceedings of the VLDB.Trondheim,Norway,2005:493-504.
  • 4Tribl S,Leser U.Fast and practical indexing and querying of very large graphs//Proceedings of the SIGMOD.Beijing,China,2007:845-856.
  • 5Benjelloun O,Sarma A D,Hayworth C.An introduction to ULDBs and the Trio system.IEEE Data Engineering Bulletin,2006,29(1):5-16.
  • 6Soliman M A,Ilyas I F,Chang K C.Top-k query processing in uncertain databases//Proceedings of the ICDE.Istanbul,2007:896-905.
  • 7Pei J,Jiang B,Lin X.Probabilistic skylines on uncertain data//Proceedings of the VLDB.Vienna,Austria,2007:15-26.
  • 8Nierman A,Jagadish H V.ProTDB:Probabilistic data in XML//Proceedings of the VLDB.Hong Kong,China,2002:646-657.
  • 9Senellart P,Abiteboul S.On the complexity of managing probabilistic XML data//Proceedings of the PODS.Beijing,China,2007:283-289.
  • 10Haase P,Volker J.Ontology learning and reasoning-dealing with uncertainty and inconsistency//Proceeding of the Workshop on Uncertainty Reasoning for the Semantic Web (URSW).New York,USA,2005:45-55.

引证文献11

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部