摘要
多关系频繁模式发现能够直接从复杂结构化数据中发现涉及多个关系的复杂频繁模式,避免了传统方法的局限。有别于主流基于归纳逻辑程序设计技术的方法,提出了基于合取查询包含关系的面向语义的精简化多关系频繁模式发现方法,具有理论与技术基础的新颖性,解决了两种语义冗余问题。实验表明,该方法在可理解性、功能、效率以及可扩展性方面具有优势。
Multi-relational data mining is one of rapidly developing subfields of data mining. Multi-relational frequent pattern discovery approaches directly look for frequent patterns that involve multiple relations from a relational database. While the state-of-the-art of multi-relational frequent pattern discovery approaches is based on the inductive logical programming techniques, we propose an approach to semantically condensed multi-relational frequent pattern discovery based on conjunctive query containment in terms of the theory and technique of relational database. With the novelty of the groundwork, the proposed approach deals with two kinds of semantically redundant problems. In theory and experiments, it shows that our approach improve the understandability, function, efficiency and scalabillty of the state-of-the-art of multi-relational frequent pattern discovery approaches.
出处
《中国工程科学》
2008年第9期47-53,共7页
Strategic Study of CAE
基金
国家自然科学基金资助项目(60675030)
国家科技成果重点推广计划资助项目(2003EC000001)
关键词
多关系数据挖掘
频繁模式发现
合取查询
精简化模式
multi-relational data mining
frequent pattern discovery
conjunctive query
condensed pattern