期刊文献+

Adding regular expressions to graph reachability and pattern queries 被引量:2

Adding regular expressions to graph reachability and pattern queries
原文传递
导出
摘要 It is increasingly common to find graphs in which edges are of different types, indicating a variety of relation- ships. For such graphs we propose a class of reachability queries and a class of graph patterns, in which an edge is specified with a regular expression of a certain form, ex- pressing the connectivity of a data graph via edges of var- ious types. In addition, we define graph pattern matching based on a revised notion of graph simulation. On graphs in emerging applications such as social networks, we show that these queries are capable of finding more sensible informa- tion than their traditional counterparts. Better still, their in- creased expressive power does not come with extra complex- ity. Indeed, (1) we investigate their containment and mini- mization problems, and show that these fundamental prob- lems are in quadratic time for reachability queries and are in cubic time for pattern queries. (2) We develop an algorithm for answering reachability queries, in quadratic time as for their traditional counterpart. (3) We provide two cubic-time algorithms for evaluating graph pattern queries, as opposed to the NP-completeness of graph pattern matching via subgraph isomorphism. (4) The effectiveness and efficiency of these al- gorithms are experimentally verified using real-life data and synthetic data. It is increasingly common to find graphs in which edges are of different types, indicating a variety of relation- ships. For such graphs we propose a class of reachability queries and a class of graph patterns, in which an edge is specified with a regular expression of a certain form, ex- pressing the connectivity of a data graph via edges of var- ious types. In addition, we define graph pattern matching based on a revised notion of graph simulation. On graphs in emerging applications such as social networks, we show that these queries are capable of finding more sensible informa- tion than their traditional counterparts. Better still, their in- creased expressive power does not come with extra complex- ity. Indeed, (1) we investigate their containment and mini- mization problems, and show that these fundamental prob- lems are in quadratic time for reachability queries and are in cubic time for pattern queries. (2) We develop an algorithm for answering reachability queries, in quadratic time as for their traditional counterpart. (3) We provide two cubic-time algorithms for evaluating graph pattern queries, as opposed to the NP-completeness of graph pattern matching via subgraph isomorphism. (4) The effectiveness and efficiency of these al- gorithms are experimentally verified using real-life data and synthetic data.
出处 《Frontiers of Computer Science》 SCIE EI CSCD 2012年第3期313-338,共26页 中国计算机科学前沿(英文版)
关键词 graph reachability graph pattern queries regu-lar expressions CONTAINMENT equivalence minimization graph reachability, graph pattern queries, regu-lar expressions, containment, equivalence, minimization
  • 相关文献

参考文献18

  • 1Cohen E,Halperin E,Kaplan H,Zwick U. Reachability and distance queries via 2-hop labels[J].SIAM Journal on Computing,2003,(05):1338-1355.doi:10.1137/S0097539702403098.
  • 2Jin R,Xiang Y,Ruan N,Fuhry D. 3-hop:a high-compression indexing scheme for reachability query[A].2009.813-826.
  • 3Wang H,He H,Yang J,Yu P S,Yu J X. Dual labeling:answering graph reachability queries in constant time[A].2006.75-86.
  • 4Agrawal R,Borgida A,Jagadish H V. Efficient management of transitive relationships in large data and knowledge bases[A].1989.253-262.
  • 5Jin R,Xiang Y,Ruan N,Wang H. Efficiently answering reachability queries on very large directed graphs[A].2008.595-608.
  • 6Jin R,Hong H,Wang H,Ruan N,Xiang Y. Computing label-constraint reachability in graph databases[A].2010.123-134.
  • 7Bruno N,Koudas N,Srivastava D. Holistic twig joins:optimal XML pattern matching[A].2002.310-321.
  • 8Chen L,Gupta A,Kurul M E. Stack-based algorithms for pattern matching on DAGs[A].2005.493-504.doi:10.1007/s00249-010-0621-z.
  • 9Cheng J,Yu J X,Ding B,Yu P S,Wang H. Fast graph pattern matching[A].2008.913-922.doi:10.1016/j.apnu.2009.04.010.
  • 10Tong H,Faloutsos C,Gallagher B,Eliassi-Rad T. Fast best-effort pattern matching in large attributed graphs[A].2007.737-746.

同被引文献49

  • 1HENDLER J. Web 3.0 emerging[J]. Computer, 2009, 42(1):111-113.
  • 2MANYIKA J, CHUI M, BROWN B, et al. Big Data: The Next Frontier for Innovation, Competition, and Productivity[M]. Mckinsey Global Institute. 2011.
  • 3CHAU M, CHEN H. Comparison of three vertical search spiders[J]. Computer, 2003, 36(5):56-62.
  • 4HOWE A E, DREILINGER D. Savvysearch: a meta-search engine that learns which search engines to query[J]. AI Magazine, 1997, 18(2): 19-25.
  • 5PAGE L, BRIN S, MOTWANIR, et al. The PageRank Citation Ranking: Bringing Order to the We[R]. 1999.
  • 6WILKINSON K, SAYERS C, KUNO H, et al. Efficient RDF storage and retrieval in jena2[A]. International Workshop on Semantic Web and Databases[C]. 2003.35-43.
  • 7ETZIONI O, KOK S, SODERLAND S, et al. Web-scale information extraction in knowltAll[A]. International World Wide Web Conference Proceedings[C]. 2004. 100-110.
  • 8YATES A, CAFARELLA M, BANKO M, et al. Textrurmer: open information extraction on the Web[A]. Proceedings of Human Language Technologies[C]. 2007.25-26.
  • 9WU W, LI H, WANG H, et al. Probase: a probabilistic taxonomy for text understanding[A]. Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data[C]. 2012.481-492.
  • 10FABIAN M, GJERGJI K, GERHARD W. YAGO: a core of semantic knowledge unifying wordnet and wikipedia[A]. International Conference on World Wide Web. 2007.697-706.

引证文献2

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部