期刊文献+

基于支持向量机分类和语义信息的中文跨文本指代消解 被引量:5

Chinese cross document co-reference resolution based on SVM classification and semantics
下载PDF
导出
摘要 跨文本(实体)指代消解(CDCR)的任务就是把所有分布在不同文本但指向相同实体的词组合在一起形成一个指代链。传统的跨文本指代消解主要采用聚类方法来解决信息检索中遇到的重名消歧问题。将聚类问题转换为分类问题,并采用支持向量机(SVM)分类器来解决信息抽取中的重名消歧和多名聚合问题。该方法可有效融合实体名称的构词特征、读音特征以及文本内部和文本外部的多种语义特征。在中文跨文本指代语料库上的实验表明,同聚类方法相比,该方法在提高精度的同时,也提高了召回率。 The task of Cross-Document Co-reference Resolution(CDCR) aims to merge those words distributed in different texts which refer to the same entity together to form co-reference chains.The traditional research on CDCR addresses name disambiguation posed in information retrieval using clustering methods.This paper transformed CDCR as a classification problem by using an Support Vector Machine(SVM) classifier to resolve both name disambiguation and variant consolidation,both of which were prevalent in information extraction.This method can effectively integrate various features,such as morphological,phonetic,and semantic knowledge collected from the corpus and the Internet.The experiment on a Chinese cross-document co-reference corpus shows the classification method outperforms clustering methods in both precision and recall.
出处 《计算机应用》 CSCD 北大核心 2013年第4期984-987,共4页 journal of Computer Applications
基金 国家自然科学基金资助项目(60873150 90920004) 江苏省自然科学基金资助项目(BK2010219) 江苏省高校自然科学重大项目(11KJA520003)
关键词 跨文本指代 信息抽取 支持向量机分类器 语义信息 重名消歧 多名聚合 cross document co-reference resolution information extraction Support Vector Machine(SVM) classifier semantics name disambiguation variant consolidation
  • 相关文献

参考文献17

  • 1MCCARTHY L W. Using decision trees for coreference resolution [ C]// MUC-6: Proceedings of the Sixth Message Understanding Conference. Montreal, Quebec, Canada: [s.n.], 1995: 20-25.
  • 2BAGGA A, BALDWIN B. Entity-based cross-document coreferenc- ing using the vector space model [ C]//COLING-ACL'98: Proceed- ings of the 36th Annual Meeting of the Association for Computational Linguistics and the 17th International Conference on Computational Linguistics. Stroudsburg, PA, USA: Association for Computational Linguistics, 1998:79 - 85.
  • 3NIST speech group. The ACE2008 evaluation plan: assessment of detection and recognition of entities and relations within and across documents [ EB/OL]. [ 2008 - 08 - 08]. http://www, nist. gov/ speech/tests/ace/2008/doc/ace08-evalplan, vl. 2d. pdf.
  • 4BARON A, FREEDMAN M. Who is who and what is what: experi- ments in cross-document co-reference [ C]// [MNLP'08: Proceed- ings of the 2008 Conference on Empirical Methods in Natural Lan- guage Processing. StroudsbUrg, PA, USA: Association for Computa- tional Linguistics, 2008:274-283.
  • 5SINGH S, SUBRAMANYA A, PEREIRA F, et al. Large-scale cross-document coreference using distributed inference and hierarchical models [ C]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA, USA: Association for Computational Linguistics, 2011:793 -803.
  • 6GOOI C H, ALLAN J. Cross-document coreference on a large scale corpus [ C]// HLT-NAACL 2004. Stroudsburg, PA, USA: Associ- ation for Computational Linguistics, 2004:9 - 16.
  • 7BOLLEGALA D, MATSUO Y, ISHIZUKA M. Disambiguating per- sonal names on the Web using automatically extracted key phrases [ C]// Proceedings of the European Community of Artificial Intelli- gence. [ S. 1. ] : IOS Press, 2006:553 -557.
  • 8HUANG J]AN , TAYLOR S M , SMITH J L , et al. Profile based cross-document coreference using kernelized fuzzy relational cluste- ring [ C]//Proceedings of the 47th Annual Meeting of the ACL and the4th [JCNLP of the AFNLP. Stroudsbnrg, PA, USA: Association for Computational Linguistics, 2009:414 - 422.
  • 9POPESCU O. Person cross document coreference with name perplex- ity estimates[ C]//Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing. Stroudsburg, PA, USA: Association for Computational Linguistics, 2009:997 - 1006.
  • 10POPESCU O. Dynamic parameters for cross document coreference [ C]//COLIN 2010. Beijing: [ s. n. ], 2010:988 -996.

同被引文献42

引证文献5

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部