期刊文献+

基于依存分析的开放式中文实体关系抽取方法 被引量:27

Open Chinese Entity Relation Extraction Method Based on Dependency Parsing
下载PDF
导出
摘要 实体关系抽取是信息抽取的组成部分,其目标是确定实体之间是否存在某种语义关系。由于中文语法错综复杂、表达方式灵活、语义多样等固有性质的限制,导致在中文中以动词作为关系表述容易引起实体间的关系含糊不清。为此,利用依存分析,提出一种开放式中文实体关系抽取方法。对输入的单句进行依存分析,通过依存分析输出的依存弧判断单句是否为动词谓语句,如果是动词谓语句则结合中文语法启发式规则抽取关系表述。根据距离确定论元位置,对三元组进行评估,输出符合条件的三元组。在Sogou CA和Sogou CS语料库上的实验结果表明,提出的方法适用于大规模语料库,具有较好的性能与可移植性。与基于卷积树核的无监督层次聚类方法相比,F值提高了16.68%。 Entity relation extraction is a part of the Information Extraction(IE).Its objective refers to determining whether there is a kind of semantic relationship between entities.To break the limitations of complex Chinese grammar,flexible expression and various semantic,which results in the vague relationship between entities simply using verbs as relational expressions in Chinese,this paper presents an open Chinese entity relation extraction method using dependency parsing.This method first does dependency parsing to the input sentence.Whether it is verb predicate sentence can be judged through the dependency arc by dependency parsing.If it is verb predicate sentence,relationship expression can be extracted combined with Chinese grammar heuristic rule.The location of the argument is determind according to the distance,evaluating the triples and outputting these qualified triples.Experimental results on SogouCA and SogouCS corpus show that the proposed method is suitable for large-scale corpus,and has good performance and portability.Contrast with unsupervised clustering method based on kernel tree,F-measure is increased by 16.68%.
作者 李明耀 杨静
出处 《计算机工程》 CAS CSCD 北大核心 2016年第6期201-207,共7页 Computer Engineering
基金 上海市科委基金资助项目(14511107000)
关键词 开放式信息抽取 中文实体关系抽取 依存分析 无监督 启发式规则 Open Information Extraction(OIE) Chinese entity relation extraction dependency parsing unsupervised heuristic rule
  • 相关文献

参考文献18

  • 1Bach N,Badaskar S.A Review of Relation Extraction[D].Pittsburgh,USA:Carnegie Mellon School,2007.
  • 2Banko M,Cafarella M J,Soderland S,et al.Open Information Extraction from the Web[C]//Proceedings of the 20th International Joint Conference on Artifical Intelligence.New York,USA:ACM Press,2007:2670-2676.
  • 3赵军,刘康,周光有,蔡黎.开放式文本信息抽取[J].中文信息学报,2011,25(6):98-110. 被引量:61
  • 4Wu Fei,Weld D S.Open Information Extraction Using Wikipedia[C]//Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics.New York,USA:ACM Press,2010:118-127.
  • 5Fader A,Soderland S,Etzioni O.Identifying Relations for Open Information Extraction[C]//Proceedings of Conference on Empirical Methods in Natural Language Processing.New York,USA:ACM Press,2011:1535-1545.
  • 6Etzioni O,Fader A,Christensen J,et al.Open Information Extraction:The Second Generation[C]//Proceedings of the 22nd International Joint Conference on Artificial Intelligence.Berlin,Germany:Springer,2011:3-10.
  • 7张素香,李蕾,秦颖,钟义信.基于Boot Strapping的中文实体关系自动生成[J].微电子学与计算机,2006,23(12):15-18. 被引量:3
  • 8Chen Jinxiu,Dong Hong.Relation Extraction Using Label Propagation Based Semi-supervised Learning[C]//Proceedings of the 21st International Conference on Computational Linguistics.Stroudsburg,USA:Association for Computational Linguistics,2006:129-136.
  • 9Cvitas A.Relation Extraction from Text Document[C]//Proceedings of the 34th International Convention on Manufactured Imports Promotion Organization.Washington D.C.,USA:IEEE Press,2011:23-27.
  • 10Kozareva Z,Hovy E.Not all Seeds are Equal:Measuring the Quality of Text Mining Seeds[C]//Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics.Los Angeles,USA:Association for Computational Linguistics,2010:618-626.

二级参考文献112

  • 1车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6. 被引量:115
  • 2李宝敏.基于语义的Internet研究[J].微电子学与计算机,2005,22(9):130-133. 被引量:4
  • 3袁里驰,钟义信.基于相似度的词聚类算法[J].微电子学与计算机,2005,22(8):93-95. 被引量:4
  • 4董静,孙乐,冯元勇,黄瑞红.中文实体关系抽取中的特征选择研究[J].中文信息学报,2007,21(4):80-85. 被引量:55
  • 5Kambhatla N. Combining lexical, syntactic and semantic features with Maximum Entropy models for ex- tracting relations[C]//ACL-2004(Poster) : 178-181.
  • 6Zhao S B and Grishman R. Extracting relations with integrated information using kernel-based methods [C] //ACL-2005:419-426.
  • 7Zhou G D, Su J, Zhang J and Zhang M. Exploring various knowledge in relation extraction[C] //ACL- 2005 : 427-434.
  • 8Jiang J and Zhai C X. A Systematic Exploration of the Feature Space for Relation Extraction[C]//NAACL- HLT-2007 : 113-120.
  • 9Zelenko D, Aone C and Richardella A. Kernel-based methods for relation extraction[J]. Journal of Machine Learning Research, 2003, 3(Feb) :1083-1106.
  • 10Culotta A and Sorensen J. Dependency tree kernels for relation extraction[C]//ACL-2004: 423-429.

共引文献102

同被引文献186

引证文献27

二级引证文献311

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部