期刊文献+

信息抽取中实体关系模式的可信度评估 被引量:1

Evaluating the Confidence of Relation Pattern in Information Extraction
下载PDF
导出
摘要 在基于Bootstrap的信息抽取技术中,为提高实体关系抽取模式的质量,需要对抽取模式的可信度进行评估。本文提出了根据模式的历史匹配记录来对其进行可信度评估的简单方法,并以此为基础对模式进行了优化合并。经过可信度评估的模式在对句子进行实体关系标注时,有效提高了标注的准确率。这说明该方法能够提高抽取模式的质量,对信息抽取系统的性能提高有一定价值。 In the information extraction technology based on Bootstrap method,in order to improve the quality of the relation extraction pattern,it's necessary to evaluate the confidence of the extraction pattern. This paper introduces a simple method of evaluating confidence by pattern's matching record,and based on this,the patterns are optimized and merged. When these patterns are used to label the relations of the sentences,the labeling accuracy is effectively increased. So,this method can increase the quality of the extraction pattern,and it's of some value in improving the performance of the information extraction system.
出处 《情报理论与实践》 CSSCI 北大核心 2009年第12期103-105,共3页 Information Studies:Theory & Application
基金 国家自然科学基金项目资助的成果之一 项目编号:70803048
关键词 信息抽取 关系模式 模式匹配 可信度 information extraction relation pattern pattern match confidence
  • 相关文献

参考文献8

  • 1AGICHTEIN E, GRAVANO L. Snowball: extracting relations from large plain-text collections [ C]. ACM DL 2000: 85-94.
  • 2AGICHTEIN E. Confidence estimation methods for partially supervised relation extraction [ C ]. SDM 2006.
  • 3CULOTTA A, MCCALLUM A. Confidence estimation for information extraction [ C ] // Proceedings of Human Language Technology Conference and North American Chapter of the Association for Computational Linguistics ( HLT-NAACL), 2004.
  • 4GANDRABUR S, FOSTER G. Confidence estimation for text prediction [ C ] //Proceedings of the Conference on Natural Language Learning (CoNLL) 2003.
  • 5DOWNEY D, ETZIONI O, SODERLAND S. A probabilistic model of redundancy in information extraction [ C ] //Proceedings of the 19th International Joint Conference on Artificial Intelligence ( IJCAI), 2005.
  • 6HASEGAWA T, SEKINE S, GRISHMAN R. Discovering relations among named entities from large corpora [ C ] //Proceedings of the Annual Meeting of Association of Computational Linguistics (ACL), 2004.
  • 7RILOFF E. Automatically generating extraction patterns from untagged text [ C ] // Proceedings of the Thirteenth National Conference on Artificial Intelligence, 1996.
  • 8ZHAO Shubin, GRISHMAN R. Extracting relations with integrated information using kernel methods [ C ] // Proceedings of the annual meeting of ACL, 2005.

同被引文献6

引证文献1

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部