期刊文献+

一种基于图划分的无监督汉语指代消解算法 被引量:19

A New Graph Clustering Algorithm for Chinese Noun Phrase Coreference Resolution
下载PDF
导出
摘要 指代消解是自然语言处理领域中的一个重要问题。针对当前中文指代标注训练语料非常缺乏的现状,本文提出一种无监督聚类算法实现对名词短语的指代消解。引入图对名词短语的指代消解问题进行建模,将指代消解问题转化为图划分问题,并引入一个有效的模块函数实现对图的自动划分,使得指代消解过程并不是孤立地对每一对名词短语分别进行共指决策,而是充分考虑了多个待消解项之间的相关性,并且避免了阈值选择问题。通过在ACE中文语料上的人称代词消解和名词短语消解实验结果表明,该算法是一种有效可行的无监督指代消解算法。 Coreference resolution plays an important role in natural language processing. Facing the fact that the Chinese training corpus for coreference resolution is heavily lacking, this paper presents a new unsupervised clustering algorithm for noun phrase coreference resolution. In this approach, the problem of coreference resolution is firstly converted as a graph clustering problem, and then an objective function called the modularity function, which allows automatic selection of the number of clusters, is selected for graph clustering. The proposed algorithm does not make pairwise coreference decisions independently of each other. The experimental results on the Chinese ACE training corpus demonstrate that the proposed method is a feasible unsupervised algorithm for noun phrase coreference resolution.
出处 《中文信息学报》 CSCD 北大核心 2007年第2期77-82,共6页 Journal of Chinese Information Processing
基金 国家863高技术研究发展计划资助项目(2006AA01Z143) 国家自然科学基金资助项目(60673043) 江苏省自然科学基金项目(BK2006117)
关键词 人工智能 自然语言处理 聚类 指代消解 模块函数 artificial intelligence natural language processing clustering coreference resolution modularity function
  • 相关文献

参考文献15

  • 1T. Morton. Coreference for NLP applications[A]. In:Proc. of the ACL[C]. 2000.
  • 2D. Zelenko, C. Aone, and J. Tibbetts. Coreference resolution for information extraction[A]. In: Proc. of the ACL Workshop on Reference Resolution and its Applications[C]. 2004. 9-16.
  • 3Kee van Deemter, Rodger Kibble. On Coreferring:Coreference in MUC and Related Annotation Schemes[J]. Computational Linguistics, 2000, 26(4): 615-623.
  • 4王厚峰.指代消解的基本方法和实现技术[J].中文信息学报,2002,16(6):9-17. 被引量:46
  • 5W. M. Soon, H. T. Ng, and D. Lim. A machine learning approach to coreference resolution of noun phrases[J]. Computational Linguistics, 2001, 27(4):521-544.
  • 6V. Ng and C. Cardie.. Improving machine learning approaches to coreference resolution[A]. In: Proc. of the ACL[C]. 2002. 104-111.
  • 7X. Yang, G. D. Zhou, J. Su, and C. L. Tan. Coreference resolution using competitive learning approach[A]. In: Proc. of the ACL 2003[C]. 176-183.
  • 8X. Luo, A. Ittycheriah, H. Jing, N. Kambhatla, and S. Roukos. A mention-synchronous coreference resolution algorithm based on the Bell tree[A]. In: Proc.of the ACL 2004[C]. 136-143.
  • 9王厚峰,梅铮.鲁棒性的汉语人称代词消解[J].软件学报,2005,16(5):700-707. 被引量:36
  • 10C. Cardie and K. Wagstaff.. Noun phrase coreference as clustering[A]. In: Proc. of EMNLP/VLC[C]. 1999. 82-89.

二级参考文献14

  • 1秦洪武.第三人称代词在深层回指中的应用分析[J].当代语言学,2001,3(1):55-64. 被引量:34
  • 2马彦华 黄昌宁 等.汉语中人称代词指代问题研究.1998年中文信息处理国际会议论文集[M].北京,1998..
  • 3郭志立.人称代词指代主体的辨析及其在摘要提取中的应用.1998年中文信息处理国际会议会论文集[M].北京清华大学出版社,1998.310-315.
  • 40uinlan, CA.5: Programs for Machine Learning[M], Morgan Kaufinann, 1993.
  • 5Michael PAUL, Kazuhide YAMAMU1D, Eiichiro SUMITA. Corpus-Based Anaphora Resolution Towards Antecedent Preference[A]. In: Proc. of the 37th ACL, Workshop Coreference and It's Application, Maryland, USA, 1999,47-52.
  • 6W M Soon, HTNg, C Y Lim. Corpus-based leamirg for noun phrase conference msolulion[A]. In: Proc. of the Joint Conf on Eaoirical Methods in NIP and Very Large Corpora. Maryland: Univetsity of Maryland, USA, 1999, 285-291.
  • 7M Vilain, J Aberdeen et al. A model-theoretic coreference scoring scheme. [A] In: Proc. of the 6th Message Understanding Corlf (MUC6), San Francisco: Morgan Kaufinann Publishers, 1995,45-52.
  • 8Caroline Gasperin and Renata Viera, Using word similarity lists for resolving indirect anaphora[A], ACI2004 workshop on coreference resolution, 2004.
  • 9Dmitry Zelenko, Chinatsu Aone, Jason Tibbetts, Coreference Resolution for Information Extraction [A], ACL2004 workshop on Coreference resolution, 2004.
  • 10许敏,王能忠,马彦华.汉语中指代问题的研究及讨论[J].西南师范大学学报(自然科学版),1999,24(6):633-637. 被引量:18

共引文献83

同被引文献263

引证文献19

二级引证文献77

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部