期刊文献+

Chinese multi-document personal name disambiguation 被引量:8

Chinese multi-document personal name disambiguation
下载PDF
导出
摘要 This paper presents a new approach to determining whether an interested personal name across doeuments refers to the same entity. Firstly,three vectors for each text are formed: the personal name Boolean vectors denoting whether a personal name occurs the text the biographical word Boolean vector representing title, occupation and so forth, and the feature vector with real values. Then, by combining a heuristic strategy based on Boolean vectors with an agglomeratie clustering algorithm based on feature vectors, it seeks to resolve multi-document personal name coreference. Experimental results show that this approach achieves a good performance by testing on "Wang Gang" corpus. This paper presents a new approach to determining whether an interested personal name across documents refers to the same entity. Firstly, three vectors for each text are formed: the personal name Boolean vectors denoting whether a personal name occurs in the text, the biographical word Boolean vector representing title, occupation and so forth, and the feature vector with real values. Then, by combining a heuristic strategy based on Boolean vectors with an agglomerative clustering algorithm based on feature vectors, it seeks to resolve multi-document personal name coreference. Experimental results show that this approach achieves a good performance by testing on 'Wang Gang' corpus.
出处 《High Technology Letters》 EI CAS 2005年第3期280-283,共4页 高技术通讯(英文版)
基金 国家高技术研究发展计划(863计划),国家自然科学基金
关键词 personal name disambiguation Chinese multi-document heuristic strategy. agglomerative clustering 中文多文件系统 文字处理软件 启发策略 布尔向量
  • 相关文献

同被引文献26

  • 1任恩瀛,曹震,范永信,张仁,赵文东,于克辉.计划密植山楂园适期疏间移栽的探讨[J].北方果树,1989(1):36-38. 被引量:6
  • 2BOLLEGALA D,MATSUO Y, 1SHIZUKA M. Disambiguating personal names on the Web using automatically extracted key phrases [ C ]// Proc of the 17th European Conference on Artificial Intelligence. Riva del Garda, Italy :IOS Press,2011:553-557.
  • 3WANG Hou-feng. Cross-document transliterated persona1 name core- ference resolution [ C]//Lecture Notes in Computer Science, vol 3614. 2005.
  • 4于满泉.面向人物追踪的知识挖掘研究[D].北京:中国科学院计算技术研究所,2009.
  • 5Staab S, Domingos P, Mika P, et al. Social Networks Ap- plied[J]. IEEE Intelligent Systems, 2005, 20(1)80-93.
  • 6Fleischman M, Hovy E. Multi-Document Person Name Res-olution[C]//Proc of the Workshop on Reference Resolution and Its Applications, 2004:1-8.
  • 7Bollegala D, Matsuo Y, Ishizuka M. Disambiguating Person- al Names on the Web Using Automatically Extracted Key Phrases[C]//Proc of the 17th European Conference on Arti- ficial Intelligence, 2006 553-557.
  • 8Malin B. Unsupervised Name Disambiguation via Social Net- work Similarity[C]// Proc of the Workshop Notes on Link Analysis, Counterterrorism, and Security, 2005:93-102.
  • 9Wang Houfeng. Cross-Document Transliterated Personal Name Coreference Resolution[C]//Proe of the 2nd Int Conf on Fuzzy Systems and Knowledge Discovery, 2005 :11-20.
  • 10Bagga A, Baldwin B. Entity-Based Cross-Document Corder- encing Using the Vector Space Model[C]//Proc of the 17th Int Conf on Computational Linguistics, 1998:79- 85.

引证文献8

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部