期刊文献+

基于层次聚类的中文人名消歧方法研究 被引量:1

Chinese Personal Name Disambiguation:Based on Hierarchical Agglomerative Clustering Approach
原文传递
导出
摘要 人名消歧近来成为自然语言处理中的一个热点问题。由于中文的复杂性,中文人名消歧被认为比英文人名消歧更困难。本文在使用层次凝聚聚类算法的基础上,主要探讨了中文人名的识别对中文人名消歧的影响以及中文人名消歧有效特征的自动提取。实验证明,特征融合是提高系统性能的有效方法。中国中文信息学会与SIGHAN组织的评测表明本文所提出的方法是有效的。 Personal name disambiguation has became a active area in the natural language processing. Because of the complexity of the Chinese, the Chinese personal name disambiguation is considered more difficult than the English personal name disambiguation. Using hierarchical agglomerative clustering,this paper focuses on the impact of Chinese personal name detection on Chinese personal name disambiguation, and the automatic extraction of effective feature. The experimental results show that the proposed feature integration method could improve the system’s performance significantly. The result in the evaluation jointly organized by Chinese Information Processing Society of China (CIPS) and SIGHAN show that our method is promising.
出处 《心智与计算》 2010年第4期236-241,共6页 Mind and Computation
关键词 中文人名消歧 聚类 中文人名识别 特征提取 特征融合 Chinese personal name disambiguation clustering Chinese personal name detection feature extraction feature integration
  • 相关文献

参考文献7

  • 1Guha R,Garg A.Disambiguating people in search. The Tthirteenth International World Wide Web Conference . 2004
  • 2Mann G S,Yarowsky D.Unsupervised personal name disambiguation. Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL . 2003
  • 3Artiles J,Gonzalo J,Sekine S.WePS2evaluation campaign:overview of the web people search clustering task. 2nd Web People Search Evaluation Workshop(WePS2009) . 2009
  • 4Ying Chen,Yatmeilee Sophia,Huang Churen.PolyUHK:a robust information extraction system for web personal names. 2nd Web People Search Evaluation Workshop(WePS2009) . 2009
  • 5M Ikeda,S Ono,I Sato,et al.Person name disambiguation on the web by twostage clustering. In2nd Web People Search Evaluation Workshop(WePS2009) . 2009
  • 6Chen Ying,Jin Peng,Li Wenjie,et al.The Chinese persons name disambiguation evaluation:exploration of personal name disambiguation in Chinese news. Proceedings of CIPS-SIGHAN Joint Conference on Chinese Language Processing . 2010
  • 7Wang Huizhen,Ding Haibo,Shi Yingchao,et al.A multi-stage clustering framework for Chinese personal name disambiguation. Proceedings of CIPS-SIGHAN Joint Conference on Chinese Language Processing . 2010

同被引文献9

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部