摘要
根据专利领域本体构建的需求,提出一种专利领域本体概念语义层次获取方法。通过分析专利领域技术主题概念在形式化时的构词规律以及上下位关系的表现方式,利用相对修饰度和关联规则识别上下位关系。然后分析上下位关系的特性,总结关系冗余和关系冲突的消除规则,构建出专利领域概念语义层次图。实验结果表明,上下位关系识别方法具有较高的准确率和召回率,构建概念语义层次图的方法取得了较好的关系冗余和关系冲突的消除效果,证实了本文方法的有效性。
For the demand of ontology construction in patent domain, we propose a concept semantic hierarchy induction approach. For this purpose, the work isdiscomposed to two dimensions. First, we analyze the word-formation rules and hierarchical relation presentation forms of technology theme concepts in patent domain. Based on this, a relative decoration based approach and an association rule based approach areproposed to hierarchical relation extraction. Second, characteristics of hyponymy relations areanalyzed to achieve those rules to eliminate redundancies and conflictions in the extracted relation. The experimental result shows that the approach of hierarchical relation extraction can achieve high accuracy and recall rate, the approach of concept semantic hierarchy induction can achieve satisfied elimination result. The resultprovesthe validity of the approach in this paper.
出处
《情报学报》
CSSCI
北大核心
2014年第9期986-993,共8页
Journal of the China Society for Scientific and Technical Information
基金
国家自然科学基金项目“基于本体的专利自动标引研究”(61271304)
北京市教委科技发展计划重点项目暨北京市自然科学基金B类重点项目“面向领域的互联网多模态信息精准搜索方法研究”(KZ201311232037)
北京市属高等学校创新团队建设与教师职业发展计划项目(IDHT20130519)
关键词
专利领域
本体
上下位关系
概念语义层次
patent, domain ontology, hierarchy relationship, concept semantic hierarchy