摘要
总结了用于数据挖掘的6种类型领域知识.据此,提出了索引知识加领域知识的2级知识组织方法,前者相当于知识目录,根据知识特征要素对后者编号.领域知识库包括规则、函数、层次知识和基本信息4个子库.讨论了多种类型领域知识的语法校验内容,指出领域规则的矛盾性具有结论矛盾、可信度矛盾和包含矛盾3种表现形式,冗余性具有等价冗余、包含冗余、条件冗余3种表现形式,并给出了校验算法.基于上述研究成果,开发了支持数据挖掘的知识库系统KB4DM.
Six types of domain knowledge for data mining were summarized. A method of two-level knowledge organization, that is a combination of index knowledge and domain knowledge, was proposed. The former, as the knowledge catalog, codes the latter according to the characteristics of knowledge. Domain knowledge base consists of rule, function, hierarchical knowledge and basic information sub-bases. Syntax check for all kinds of domain knowledge was discussed. It was pointed out that cohflict forms of domain rule included conclusion conflict, confidence conflict and inclusion conflict, and redundancy forms of domain rule included equivalence redundancy, inclusion redundancy and condition redundancy. Check algorithms were presented subsequently. A knowledge base system, named KB4DM (knowlege base for data mining), was implemented based on the research.
出处
《西南交通大学学报》
EI
CSCD
北大核心
2005年第3期406-411,416,共7页
Journal of Southwest Jiaotong University
基金
国家科技部项目(2002ED691036)资助
关键词
知识库
数据挖掘
知识组织
知识校验
知识发现
knowledge base
data mining
knowledge organization
knowledge check
knowledge discovery