期刊文献+

支持近似图查询的Why-Not问题解释方法 被引量:1

Explaining Why-Not Questions on Approximate Graph Queries
下载PDF
导出
摘要 why-not问题是为查询结果中的缺失元组找到合理的解释。解决数据库查询中的why-not问题不仅能够帮助用户更好地理解查询,而且能够提高数据库的质量和可用性。为了提高图数据库的可用性,提出了支持近似图查询的why-not问题解释方法。该解释方法不仅阐明了为什么why-not问题没有出现在查询结果中,而且给出了一些修改初始查询图的建议,使得why-not问题能够出现在修改后的查询图的查询结果中。该算法分两部分完成:第一部分为候选修改操作生成阶段,首先利用边频率信息提出候选操作集生成基本算法,接着利用图分解操作提出候选操作集生成改进算法,得到修改初始查询图的候选操作集;第二部分基于对查询图修改操作数最少的代价模型,分别采用贪心算法和回溯法选取候选操作,贪心算法设计了合理的贪心函数,回溯法构建了回溯剪枝树,并提出三种剪枝策略执行剪枝操作,最终选取的候选操作集即为支持近似图查询的why-not问题的合理解释。实验表明,该方法可以快速有效地为近似图查询中的why-not问题提供合理解释。 Why-not questions aim to seek clarifications on the missing tuples for query results. Answering why-not questions on database queries not only helps user better understand the query, but also improves data quality and the availability of database. To enhance the availability of graph database, this paper proposes an efficient explaining technique for why-not questions on approximate graph queries. This technique gives explanations for missing graphs, as well as some suggestions how to modify the initial query graph to let the missing graphs appear in the query result set. Algorithm for answering approximate graph queries includes two parts: the first part is a candidate generation phase, which proposes the basic algorithm according to frequency information of edges, and the improved algorithm by graph decomposition operation to acquire candidate operation set. And in the second part, in order to minimize the cost of modifying initial query graph, greedy algorithm and backtracking are proposed separately to select candidate operations. Reasonable greedy function is designed for greedy algorithm. The pruning trees based on candidate selection are built and three strategies for selecting candidate operations from the candidate set are proposed. The final candidate operation set includes the reasonable explanations for why-not questions on graph queries. According to experiments, the algorithm proposed in this paper can efficiently generate reasonable explanations for why-not questions on approximate graph queries.
出处 《计算机科学与探索》 CSCD 北大核心 2017年第12期1871-1885,共15页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金Nos.61572122 61322208 61272178 61532021 国家重点基础研究发展计划(973计划)Nos.61572122 61322208 61272178 61532021~~
关键词 近似图查询 why-not问题 回溯法 剪枝策略 approximategraphqueries why-notquestions backtracking pruningstrategy
  • 相关文献

参考文献1

二级参考文献80

  • 1金澈清,钱卫宁,周傲英.流数据分析与管理综述[J].软件学报,2004,15(8):1172-1181. 被引量:161
  • 2Wang Y Richard, Madnick Stuart E. A polygen model for heterogeneous database systems: The source tagging perspective//Proceedings of the 16th International Conference on Very Large Data Bases. Brisbane, Queensland, Australia, 1990:519-538.
  • 3Lanter D P. Design of a lineage-based meta-data base for GIS. Cartography and Geographic Information Systems, 1991, 18:255-261.
  • 4Woodruff A, Stonebraker M. Supporting fine-grained data lineage in a database visualization environment//Proceedings of the 13rd IEEE International Conference on Data Engineering. Birmingham, England, 1997:91-102.
  • 5Cui Y, Widom J, Wiener J L. Tracing the lineage of view data in a warehousing environment. The ACM Transactions on Database Systems, 2000, 25(2): 179-227.
  • 6Buneman P, Khanna S, Tan WC. Why and where, A characterization of data provenanee//Proceedings of the 17th International Conference on Data Engineering. London, UK 2001:316-330.
  • 7Simmhan Yogesh L, Plale Beth, Gannon Dennis. A survey of data provenance techniques. Computer Science Department: Indiana University, Bloomington IN: Technical Report IUB-CS-TR618, 2001.
  • 8Glavic Boris, Dittrich Klaus. Data provenance: A categorization of existing approaehes//Proceedings of the 6th MMC Workshop of BTW 2007. Aachen, Germany, 2007:227-241.
  • 9Glavic Boris, Alonso Gustavo. Perm: Processing provenance and data on the same data model through query rewriting// Proceedings of the 25th IEEE International Conference on Data Engineering. Shanghai, China, 2009: 174-185.
  • 10Kolaitis P G. Schema mappings, data exchange, and metadata management//Proceedings of the 24th ACM SIGMOD- SIGACT-SIGART Symposium on Principles of Database Systems. Baltimore, Maryland, USA, 2005:61- 75.

共引文献65

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部