期刊文献+

基于实体-属性框架的领域知识库构建 被引量:2

The construction of domain knowledge base under the entity-attribute frame
下载PDF
导出
摘要 知识库是进行各种自然语言处理任务不可或缺的一项基础性的资源。而目前知识库的构建还是一个难点问题,尤其是以自动方式构建复杂的领域性知识库系统的研究还处于探索阶段。本文提出一种基于实体-属性框架的领域知识库自动构建方法,致力于利用航空百科辞典的信息自动获取术语之间的上下位关系及部分实体属性关系,其中,基于多策略的上下位关系术语对提取融合了后缀子串匹配、模板自动构建、实质提取三种方法,分别考虑了辞典中反映上下位关系的不同信息。其中模板自动构建方法,在无需人工标注语料的情况下获得了比较好的效果。属性提取部分采用了以人工标注语料为前提的模板匹配方法。实验表明,本文系统对术语上下位关系抽取的F值达到76.01%,对各个属性的抽取也达到了75%以上。 Knowledge base is an essential basic resource for various natural language processing tasks.Currently,the construction of knowledge base is still a difficult problem,and the research on the automatic construction of a complex system of domain knowledge base is still in the exploratory stage.This paper proposes an automatic construction method of domain knowledge base under the entity-attribute frame,which aims to automatically extract the entity hyponymy and entity-attribute relationship by using the aerospace encyclopedia.In terms of entity hyponymy extraction,a multi-strategy method is adopted,in which suffix matching,automatic pattern construction and nature extraction are synchronized to reflect the different hyponymy in the encyclopedia.The automatic pattern construction method is proved to be effective without the manually labeled corpus.And the pattern matching method is applied to the attribute extraction based on the manually labeled corpus.Experimental result shows that the F-score of hyponymy extraction is 76.01%,and that of the attribute extraction is higher than 75%.
出处 《沈阳航空航天大学学报》 2011年第2期69-73,共5页 Journal of Shenyang Aerospace University
基金 教育部科学技术研究重点项目(项目编号:207148) 辽宁省高校创新团队支持计划项目(项目编号:2007T139)
关键词 领域知识库 实体-属性框架 上下位关系 属性 航空百科辞典 domain knowledge base entity-attribute frame hyponymy attribute aerospace encyclopedia
  • 相关文献

参考文献2

二级参考文献15

  • 1张孝飞,陈肇雄,黄河燕,王建德.基于锚点词对的双语词对齐算法[J].小型微型计算机系统,2006,27(2):330-334. 被引量:10
  • 2文勖,张宇,刘挺,马金山.基于句法结构分析的中文问题分类[J].中文信息学报,2006,20(2):33-39. 被引量:82
  • 3孙景广,蔡东风,吕德新,董燕举.基于知网的中文问题自动分类[J].中文信息学报,2007,21(1):90-95. 被引量:41
  • 4韩建波,张桂平,蔡东风.基于模式匹配的中文问答系统[J].沈阳航空工业学院学报,2007,24(1):38-40. 被引量:3
  • 5Peter F. Brown, Stephen Della Pietra, Vincent J. Della Pietra, et al. The mathematics of Statistical Machine Translation : Parameter Estimation [ J ]. Computational Linguistics, 1993, 19 ( 2 ) : 263 -311.
  • 6Hadar Shemtov. Text alignment in a tool for translating revised documents[ A]. Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics [ C ]. Association for Computational Linguistics, 1993.449 -453.
  • 7Frank A. Smadja, Kathleen McKeown, Vasileios Hatzivassiloglou. Translating collocations for bilingual lexicons: A statistical approach [ J ]. Computational Linguistics, 1996, 22 ( 1 ) : 1 - 38.
  • 8Ye - Yi Wang, Alex Waibel. Modeling with structures in statistical machine translation [ A ]. Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics [ C ]. Montreal, Canada, 1998. 1357-1363.
  • 9Katharina Probst, Ralf Brown. Using similarity scoring to improve the bilingual dictionary for sub - sentential alignment [ A ]. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics [ C ], Philadelphia, 2002. 409 - 416.
  • 10Teruko Mitamura, Eric Nyberg. Overview of the NTCIR - 7 ACLIA : Advanced Cross - Lingual Information Access [ A ]. Proceedings of the NTCIR7 Workshop [ C ] , Japan , Tokyo :2008.

共引文献1

同被引文献23

  • 1张鸣.知识服务方式之一——构建学科专题知识库[J].图书馆学刊,2006,28(3):108-110. 被引量:16
  • 2钱智勇.基于本体的专题域知识库系统设计与实现——以张謇研究专题知识库系统实现为例[J].情报理论与实践,2006,29(4):476-479. 被引量:16
  • 3夏天.汉语词语语义相似度计算研究[J].计算机工程,2007,33(6):191-194. 被引量:63
  • 4FELLBAUM C. WordNet:An Electronic Lexical Da- tabase [M]. Cambridge, Massachusetts : MITPress, 1999.
  • 5BAKER CF, F1LLLNORE CJ, LOWE JB. The berke- ley frameNet project. In: morgan K ed. proeeedings of the coling-ACL' 98 [ C ]. Montreal: ACL, 1998.
  • 6LONNEKER-RODMAN, BIRTE, BAKER, COLLIN F. The frameNet model and its applications [J].Natu- ral Language Engineering ,2009,15 (3) :415 - 453.
  • 7RICHARDSON STEPHEN D ,DOLAN WILLIAM B, VANDERWENDE LUCY. MindNet: Acquiring and structuring semantic information from text [ C ]. Pro- ceedings of the 17th International Conference on Com- putational Linguistics, 1998.
  • 8FABIAN M, SUCHANEK, GJERGJI KASNECI,GERHARD WEIKUM. YAGO: A Core of Semantic Knowledge Unifying WordNet and Wikipedia [C ]. Ontologies, 2007.
  • 9YAGO2s: A High-Quality Knowledge Base [DB/ OL]. http://www, mpi-inf, mpg. de/yago-naga/ya- go/,2016 - 03 - 30.
  • 10董振东,董强.知网[EB/OL].http://www.keen-age.com.2016-03-30.

引证文献2

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部