期刊文献+

复杂中文文本的实体关系抽取研究 被引量:1

Entity Relation Extraction for Complex Chinese Text
下载PDF
导出
摘要 实体关系抽取是信息抽取研究领域中的重要研究课题之一。针对已有方法在处理复杂文本上的不足,提出了复杂中文文本的实体关系抽取方法。结合中文文本的语法特征,提出了7条抽取关系特征序列的启发式规则,并采用语义序列核和KNN机器学习算法结合的方法来分类和标注关系的类型。通过对ACE评测定义下的两个子类的实体关系抽取,关系抽取的平均F值达到了76%,明显高于传统的基于特征向量和最短依存路径核的方法。 Entity Relation Extraction is one of the important research fields in Information Extraction. Aiming at the problem of inefficiency of existing approaches dealing with entity relation extraction, this paper presented a novel approach. This new approach proposes seven heuristic rules to extract relation feature sequence through combining with grammar feature of Chinese text, and applies the semantic sequence kernel function with KNN learning algorithm to fulfill the entity relation extraction task. Experiments are carried out on two kinds of relation types defined in the ACE guidelines, results show that the new approach achieves an average F-score up to 76 %, significantly higher than the traditional feature-based approaches and traditional shortest path for dependency kernel approaches.
出处 《计算机科学》 CSCD 北大核心 2009年第8期208-211,共4页 Computer Science
基金 国家自然科学基金重点项目(60433020) 湖南省自然科学基金(06JJ50142) 湖南省国土资源厅科技计划项目(200718)资助
关键词 实体关系抽取 语法特征 启发式规则 语义序列核 Entity relation extraction, Grammar feature, Heuristic rule, Semantic sequence kernel
  • 相关文献

参考文献9

  • 1车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6. 被引量:115
  • 2董静,孙乐,冯元勇,黄瑞红.中文实体关系抽取中的特征选择研究[J].中文信息学报,2007,21(4):80-85. 被引量:55
  • 3刘克彬,李芳,刘磊,韩颖.基于核函数中文关系自动抽取系统的实现[J].计算机研究与发展,2007,44(8):1406-1411. 被引量:58
  • 4Li Weigang Liu Ting Li Sheng.BOOTSTRAPPING FOR EXTRACTING RELATIONS FROM LARGE CORPORA[J].Journal of Electronics(China),2008,25(1):89-96. 被引量:5
  • 5Zhang Min, Zhong Guo - along, Aw Aiti. Exploring syntactic struetured feature over parse trees for relation extraetion using kernel methods [J]. Information Processing and Management, 2008,44:687-701.
  • 6Culotta A, Sorensen J. Dependency tree kernels for relation extraction[C] // Proceedings of the 42nd Annual Meetings of the Association for Computational Linguistics (ACL-04). Barcelona, Spain July, 2004 : 423-429.
  • 7BUnescu R C, Mooney R J. A Shortest Path Dependency Kernel for Relation Extraction[C]//Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing. 2005 : 724-731.
  • 8Huang Rui - hong, Sun Le, Feng Yuan- yong. Study of kernel - based Methods for Chinese Relation Extraetion[C]//the LNCS, Springer, AIRS' s 08. 2008: 698-604.
  • 9Che Wang-xiang, Jiang Jian-rnin, Su Zhong, et al. Improved-editdistance Kernel for Chinese Relation Extraction[C]//Proc. of the Second International Joint Conference on Natural Language Processing (IJCNLP-05). 2005.. 132-137.

二级参考文献53

  • 1车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6. 被引量:115
  • 2姜吉发,王树西.一种自举的二元关系和二元关系模式获取方法[J].中文信息学报,2005,19(2):71-77. 被引量:5
  • 3曾兴杰,李芳,张冬茉.采用开放语料库的跨领域模式自动获取[J].计算机仿真,2005,22(4):259-263. 被引量:1
  • 4梁晗,陈群秀,吴平博.基于事件框架的信息抽取系统[J].中文信息学报,2006,20(2):40-46. 被引量:38
  • 5In: Proceedings of the 6th Message Understanding Conference (MUC - 7) [ C ]. National Institute of Standars and Technology, 1998.
  • 6C. Aone and M. Ramos-Santacruz. Rees: A large-scale relation and event extraction system[A]. In: Proceedings of the 6th Applied Natural Language Processing Conference[C] ,pages 76- 83, 2000.
  • 7S. Miller, M. Crystal, H. Fox, L. Ramshaw, R. Schwartz, R. Stone, R. Weischedel, and the Annotation Group.Algorithms that learn to extract information-BBN: Description of the SIFT system as used for MUC[ A]. In: Proceedings of the Seventh Message Understanding Conference (MUC-7)[C], 1998.
  • 8S. Soderland. Learning information extraction rules for semi-structured and free text[J]. Machine Learning, 1999. 34(1 - 3) :233 - 272.
  • 9N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines[ M]. Cambridge University Press,Cambirdge University, 2000.
  • 10T. Zhang. Regularized winnow methods[A]. In: Advances in Neural Information Processing Systems 13[C], pages703 - 709, 2001.

共引文献170

同被引文献5

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部