期刊文献+

基于异构知识库的命名实体消歧 被引量:9

Named entity disambiguation based on heterogeneous knowledge base
下载PDF
导出
摘要 针对自然语言处理中的中文命名实体消歧问题,提出一种基于异构知识库的层次聚类方法。利用中文信息抽取系统对中文维基百科等知识库进行抽取,形成包含人物信息、实体关系的实体信息对象,并在Hadoop平台上用分布式计算进行层次聚类,研究人物实体特征的选取和维基百科等知识库的使用对命名实体消歧结果的影响。结果表明加入百科知识库后,F值从91.33%增加到了92.68%。 A scalable and robust system is proposed to deal with Named Entity disambiguation problem based on hierarchical clustering using Wikipedia as Knowledge Base.The entity profiles, as information obj ects which contain entity attributes and entity relations created by our IE system,are disambiguated with hierarchical clustering on Hadoop platform.Features selection on similarity measurement and comparison of the results using Heterogeneous as Knowledge Base are studied mainly in this paper.Results show that F-measure value increase from 91.33% to 92.68% by using Wikipedia as knowledge base.
作者 宁博 张菲菲
出处 《西安邮电大学学报》 2014年第4期70-76,共7页 Journal of Xi’an University of Posts and Telecommunications
基金 陕西省教育厅科研计划自然基金资助项目(12JK0938)
关键词 人名消歧 维基百科 中文信息抽取 层次聚类 实体信息 entity disambiguation Wikipedia Chinese information extraction hierarchical clustering entity information
  • 相关文献

参考文献16

  • 1陈英.基于维基百科的命令实体消歧研究[D].北京:北京理工大学,2011:29-35.
  • 2Bunescu R, Pasea M. Using encyeloped c knowledge for named entity disambiguation[C]//Proceedings of the llth Conference of the European Chapter of the Association for Computational Linguistics (EACL- 06), 2006:9-16.
  • 3Dredze M, McNamee P, Rao D, et al. Entity disam- biguation for knowledge base population[C]//Proceed- ings of the 23rd International Conference on Computa- tional Linguistics, 2010:277-285.
  • 4Cueerzan S. Large-scale named entity disambiguation based on Wikipedia data[C]//Proceeding: of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Lan- guage Learning (EMNLP-CoNLL), 2007 : 708-716.
  • 5Fangtao Li, Zhicheng Zheng, Fan Bu, et al. THU QUANTA at TAC 2009 KBP and RTE track[C]// Text Analysis Conference (TAC), 2009:136-147.
  • 6赵飞,周涛,张良,马鸣卉,刘金虎,余飞,查一龙,李睿琪.维基百科研究综述[J].电子科技大学学报,2010,39(3):321-334. 被引量:38
  • 7Han Xianpei, Zhao Jun. Structural Semantic Related- ness: A knowledge-based method to named entity dis- ambiguation [C]//Proceedings of the 48th Annual Meeting of the Association for Computational Linguis- tics, 2010:50-59.
  • 8张海粟,马大明,邓智龙.基于维基百科的语义知识库及其构建方法研究[J].计算机应用研究,2011,28(8):2807-2811. 被引量:26
  • 9Soon W M, Ng H T, Lim D C Y. A machine learning approach to coreference resolution of noun phrases[J]. Computational Linguistics, 2001, 27(4):521-544.
  • 10Manning C D, Raghavan P,Schutze H. Introduction to Information Retrieval[M]. Cambridge: Cambridge U- niversity Press, 2008 : 21-27.

二级参考文献80

共引文献130

同被引文献81

  • 1王永生.基于改进的Lesk算法的词义排歧算法[J].微型机与应用,2013,32(24):69-71. 被引量:4
  • 2吴云芳,金澎,郭涛.基于词典属性特征的粗粒度词义消歧[J].中文信息学报,2007,21(2):3-8. 被引量:10
  • 3孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量:1072
  • 4Guha V,Garg A. Disambiguating People in Search[ C l// The Thirteenth International World Wide Web Confer- ence. 2004:22-32.
  • 5Artiles J, Gonzaks J, Verdejo F. A testbed for people Searching Strategies in the www [ C ]//Proceedings of the 28th annual International ACM SIGIR conference on Re- search and Development in information Retrieval New York. 2005:569-570.
  • 6Chen Ying, Jin Peng, Li Wenjie, et al. Exploration of personal name disambiguation in Chinese news [ C ]// CIPS-SIGHAN Joint Conference on Chinese Language Processing. 2010: 20-26.
  • 7He Zhengyan, Wang Houfeng, Li Sujian. The Task 2 of CIPS-SIGHAN 2012 Named entity recognition and disam- biguation in Chinese bakeoff[ C ]//CIPS-SIGHAN Joint Conference on Chinese Language Processing. 2012: 108- 114.
  • 8Han X, Zhao J. CASIANED: web personal name disam- biguation based on professional categorization [ C ]//2nd Web People Search Evaluation Workshop ( WePS 2009). 18th WWW Conference. 2009: 2-5.
  • 9Long Chong, Shi Lei. Web person name disambiguation by relevance weighting of extended feature sets [ C ]// CLEF (Notebook Papers/LABs/Workshops). 2010: 1-13.
  • 10Guerreiro J, Goncalves D, de Matos D M. Towards a fair comparison between name disambiguation approaches [ C]//Proceedings of the lOth Conference on Open Re- search Areas in Information Retrieval. 2013: 17-20.

引证文献9

二级引证文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部