期刊文献+

基于多知识库的短文本实体链接方法研究——以Wikipedia和Freebase为例 被引量:7

Entity Linking Method for Short Texts with Multi-Knowledge Bases:Case Study of Wikipedia and Freebase
原文传递
导出
摘要 【目的】基于多知识库进行实体链接,解决基于单一知识库的实体链接覆盖度低的问题。【方法】首先生成文本的n-gram并利用词性和多个指称–实体字典获取候选指称,然后生成指称组合并保留覆盖度最大且不被其他组合包含的指称组合,接着生成候选实体序列并利用多知识库信息计算实体序列的相关度,最后选择相关度最大的实体序列为最终结果。【结果】以Wikipedia和Freebase为例的实验结果表明,基于Wikipedia+Freebase的实体链接准确率、召回率、F值分别达到71.81%、76.86%、74.25%。【局限】基于词性过滤n-gram缺乏理论依据,数据集FACC1具有高准确率和低召回率的特点。【结论】利用多个知识库的实体信息,能够提升实体链接效果。 [Objective] This paper proposes an entity linking method using multi-knowledge bases, aiming at solving the problem of low coverage caused by entity linking with single knowledge base. [Methods] First, we generated n-gram of input text and obtained candidate mentions using part of speech and multi-mention-entity dictionary. Second, we generated and retained mention combinations of highest coverage which are not contained by other mention combinations. Third, we generated entity sequences and calculated their relevence degree using information from multi-knowledge bases. We listed entity sequence with the highest relevence degree as the final result. [Results] This case study showed that the Precision, Recall, and F-value of the entity linking based on Wikipedia+Freebase reaches 71.81%, 76.86%, and 74.25% respectively. [Limitations] Filtering n-gram based on part of speech lacked theoretical foundation, and the FACC1 dataset featured high precision but low recall. [Conclusions] Utilizing entity information from multi-knowledge bases can improve the performance of entity linking.
出处 《现代图书情报技术》 CSSCI 2016年第6期1-11,共11页 New Technology of Library and Information Service
基金 国家自然科学基金面上项目"基于语言模型的通用实体检索建模及框架实现研究"(项目编号:71173164) 武汉大学与中国科技信息研究所合作项目"科学文献的语义功能识别与深度利用"的研究成果之一
关键词 实体链接 知识库 WIKIPEDIA Freebase Entity linking Knowledge base Wikipedia Freebase
  • 相关文献

参考文献1

二级参考文献51

  • 1Hachey B, Radford W, Nothman J, et al. Evaluating entity linking with wikipedia [ J ]. Artificial Intelligence, 2013, 194(4) : 130-150.
  • 2Dill S, Eiron N, Gibson D, et al. SemTag and Seeker: Bootstrapping the Semantic Web via Automated Semantic Annotation[ C l// Proceedings of the 12th International Conference on World Wide Web, Budapest, Hungary, 2003 : 178-186.
  • 3Bunescu R C, Pasca M. Using Encyclopedic Knowledge for Named entity Disambiguation[ C]//Proceedings of the llth Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, 2006 : 9-16.
  • 4Bollacker K, Evans C, Paritosh P, et al. Freebase: a Collaboratively Created Graph Database for Structuring Human Knowledge [ C ]//Proceedings of the 2008 ACM SIGMOD international Conference on Management of Data, Vancouver, BC, Canada, 2008: 1247-1249.
  • 5Suchanek F M, Kasneci G, Weikum G. Yago: a Core of Semantic Knowledge [ C ]//Proceedings of the 16th International Conference on World Wide Web, Banff, Alberta, Canada, 2007: 697-706.
  • 6Pantel P, Fuxman A. Jigs and Lures: Associating Web Queries with Structured Entities [ C ]//Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, USA, 2011 : 83-92.
  • 7Lin T, Mausam Etzioni O. Entity Linking at Web Scale [ C]//Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction, Montreal, Canada, 2012: 84-88.
  • 8Balasuriya D, Ringland N, Nothman J, et al. Named Entity Recognition in Wikipedia[ C ]//Proceedings of the 2009 Workshop on The People's Web Meets NLP, Suntec, Singapore, 2009: 10-18.
  • 9Toral A, Munoz R. A Proposal to Automatically Build and Maintain Gazetteers for Named Entity Recognition by Using Wikipedia[ C ]//Proceedings of the Workshop on NEW TEXT Wikis and Blogs and Other Dynamic Text Sources, Trento, Italy, 2006: 56-61.
  • 10Kazama J I, Torisawa K. Exploiting Wikipedia as External Knowledge for Named Entity Recognition [ C ]//Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Prague, 2007 : 698-707.

共引文献18

同被引文献89

引证文献7

二级引证文献40

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部