期刊文献+

语言学组合特征在语义关系抽取中的应用 被引量:16

The Application of Combined Linguistic Features in Semantic Relation Extraction
下载PDF
导出
摘要 语义关系抽取是信息抽取中的一个重要的研究领域。目前基于特征向量的语义关系抽取已经很难通过发掘新的特征来提高抽取的性能。本文提出了一种特征组合方法,通过在各种词法、语法、语义的基本特征内部及特征之间进行合理的组合形成组合特征,使用基于支持向量机的学习方法,使得关系抽取的准确率和召回率得到了提高。在ACE2004语料库的7个关系大类和23个关系子类抽取实验中F值分别达到了66.6%和59.50%。实验结果表明通过对基本语言学特征进行组合所得到的组合特征能够显著地提高语义关系抽取的性能。 Semantic relation extraction is one of the important fields in information extraction research. The present feature vector based approach for semantic relation extraction can hardly be improved simply by mining new features, This paper presents a novel method through combining the diverse basic lexical, syntactic and semantic features to form new combined features. The experiments show that these combined features positively improve the precision and recall of the SVM based relation extraction. The F-score of relation extraction for the 7 major types and 23 subtypes in ACE 2004 corpora achieves 66.6% and 59.50% respectively.
出处 《中文信息学报》 CSCD 北大核心 2008年第3期44-49,63,共7页 Journal of Chinese Information Processing
基金 “863”国家高技术研究发展计划资助项目(2006AA01Z147) 国家自然科学基金资助项目(60673041)
关键词 计算机应用 中文信息处理 语义关系抽取 支持向量机 组合特征 computer application Chinese information processing semantic relation extraction support vector machine combined features
  • 相关文献

参考文献12

  • 1郑家恒,王兴义,李飞.信息抽取模式自动生成方法的研究[J].中文信息学报,2004,18(1):48-54. 被引量:22
  • 2ZHOU G D, SU J, ZHANG J, et al. Exploring various knowledge in relation extraction[A]. UnivofMichgan-AnnArbor,USA: 25-30. ACL' 2005 [C] . June, 2005. 427-434.
  • 3ZHANG M, SU J, WANG D M, et al. Discovering Relations from a Large Raw Corpus Using Tree Simi larity based Clustering[A]. IJCNLP'2005 [C]. Jeju island, Korea :LNCS, October, 2005. 378-389.
  • 4KAMBHATLA N. Combining lexical, syntactic and semantic features with Maximum Entropy models for extracting relations [A]. ACL ' 2004 (poster) [C]. Barcelona,Spain:21-26 July, 2004. 178-181.
  • 5ZHAO S B, GRISMAN R. Extracting relations with integrated information using kernel methods [A]. ACL' 2005[C]. USA : 25-30 UnivofMichgan-AnnArbor June 2005. 419-426.
  • 6ACE 2004. The Automatic Content Extraction (ACE) Projects, 2007 (2007-4-20). http//www, ldc. upenn. edu/ Projects/ACE/.
  • 7WANG T, LI Y Y, KALINA B, et al. Automatic Extraction of Hierarchical Relations from Text[A]. Proceedings of the Third European Semantic Web Conference (ESWC 2006) [C]. USA: Springer, 2006:401-416.
  • 8ZHANG M, ZHANG J, SU J, et al. A Composite Kernel to Extract Relations between Entities with both Flat and Structured Features [A]. ACL' 2006 [C]. Sydney: July, 2006. 825-832.
  • 9车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6. 被引量:115
  • 10董静,孙乐,冯元勇,黄瑞红.中文实体关系抽取中的特征选择研究[J].中文信息学报,2007,21(4):80-85. 被引量:55

二级参考文献30

  • 1车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6. 被引量:115
  • 2梁晗,陈群秀,吴平博.基于事件框架的信息抽取系统[J].中文信息学报,2006,20(2):40-46. 被引量:38
  • 3[1]Ellen Riloff. Automatically Constructing a Dictionary for Information Extraction Tasks[C]. In: Proceedings of the Eleventh National Conference on Artificial Intelligence, 811-816. AAAI Press/ The MIT Press, 1993.
  • 4[2]Stephen Soderland, David Fisher, Jonathan Aseltine, and Wendy Lehnert. CRYSTAL: Inducing a conceptual dictionary[C]. In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1314-1319, 1995.
  • 5[3]Ellen Riloff. Automatically Generating Extraction Patterns from Untagged Text[C]. In: Proceedings of Thirteenth National Conference on Artificial Intelligence (AAAI-96), 1044-1049. 1996.
  • 6[4]Ellen Riloff, Rosie Jones. Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping[C]. In: Proceedings of the Sixteenth National Conference on Artificial Intelligence (AAAI-99), Orlando FL. 1999.
  • 7[5]Roman Yangarber, Ralph Grishman, Pasi Tapanainen and Silja Huttunen. Unsupervised Discovery of Scenario-Level Patterns for Information Extraction[C]. In: Proceedings of Sixth Applied Natural Language Processing Conference (ANLP-2000), 282-289, Seattle WA. 2000.
  • 8In: Proceedings of the 6th Message Understanding Conference (MUC - 7) [ C ]. National Institute of Standars and Technology, 1998.
  • 9C. Aone and M. Ramos-Santacruz. Rees: A large-scale relation and event extraction system[A]. In: Proceedings of the 6th Applied Natural Language Processing Conference[C] ,pages 76- 83, 2000.
  • 10S. Miller, M. Crystal, H. Fox, L. Ramshaw, R. Schwartz, R. Stone, R. Weischedel, and the Annotation Group.Algorithms that learn to extract information-BBN: Description of the SIFT system as used for MUC[ A]. In: Proceedings of the Seventh Message Understanding Conference (MUC-7)[C], 1998.

共引文献155

同被引文献185

引证文献16

二级引证文献128

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部