期刊文献+

命名实体消歧研究综述 被引量:1

Review of Named Entity Disambiguation Studies
下载PDF
导出
摘要 实体消歧是指在一个具体的知识库中,把一个被标识的实体指称链向它对应条目的过程。实体消歧的任务是根据上下文信息解决一个命名实体指称项对应多个实体概念的一词多义问题,它在从海量数据准确提取信息的知识图谱构建过程中起到重要作用,是自然语言处理中的一项基本任务。该文主要对实体消歧技术的相关研究内容进行综述。首先,阐述了实体消歧的国内外研究背景,并对命名实体识别、候选实体生成、候选实体排序等实体消歧相关理论进行全面梳理。其次,对实体消歧的具体含义及其研究内容进行详细综述,并对实体消歧研究内容的特点进行了分析。再次,将实体消歧技术的实现方法划分为三类并对涉及到的数据集进行归纳,并从四个方面讨论了实体消歧领域存在的难点和提高实体消歧准确率的途径,对消歧方法的优缺点及评价指标进行了总结,意在为改善实体消歧效果提供新的解决思路。最后,对实体消歧技术的应用和发展前景进行总结。 Entity disambiguation is the process of chaining an identified entity referent to its corresponding entry in a specific knowledge base.The task of entity disambiguation is to solve the word polysemy problem where a named entity referent term corresponds to multiple entity concepts based on contextual information,and it plays an important role in the construction of knowledge graphs for accurate extraction of information from massive data,which is a fundamental task in natural language processing.We mainly review the research content related to entity disambiguation techniques.Firstly,the background of the domestic and international research on entity disambiguation is described,and the theories related to entity disambiguation such as named entity identification,candidate entity generation,and candidate entity ranking are comprehensively reviewed.Secondly,a detailed overview of the specific meaning of entity disambiguation and its research content is presented,and the characteristics of the research content of entity disambiguation are analyzed.Thirdly,the implementation methods of entity disambiguation techniques are classified into three categories and the data sets involved are summarized,and the difficulties in the field of entity disambiguation and the ways to improve the accuracy of entity disambiguation are discussed from four aspects,and the advantages and disadvantages of disambiguation methods and evaluation indexes are summarized,with the intention of providing new solutions for improving the effectiveness of entity disambiguation.Finally,the application and development prospects of entity disambiguation techniques are summarized.
作者 李欣宇 赵震 LI Xin-yu;ZHAO Zhen(School of Information Science and Technology,Bohai University,Jinzhou 121013,China)
出处 《计算机技术与发展》 2024年第2期1-8,共8页 Computer Technology and Development
基金 国家自然科学基金项目(61976027) 辽宁省教育厅基本科研项目(LJKZ1028) 渤海大学2021年研究生教育教学改革项目(YJG20210022)。
关键词 实体消歧 命名实体识别 知识图谱 自然语言处理 综述 entity disambiguation named entity identification knowledge graph natural language processing review
  • 相关文献

参考文献10

二级参考文献62

  • 1董振东,董强.知网和汉语研究[J].当代语言学,2001,3(1):33-44. 被引量:57
  • 2程妮,崔建海,王军.国外信息过滤系统的研究综述[J].现代图书情报技术,2005(6):30-38. 被引量:11
  • 3卢志茂,刘挺,李生.统计词义消歧的研究进展[J].电子学报,2006,34(2):333-343. 被引量:28
  • 4王宏鼎,谭少华,唐世渭,杨冬青,童云海.基于模式元素语义关系的模式合并方法研究[J].北京大学学报(自然科学版),2007,43(3):405-411. 被引量:3
  • 5RICARDO B Y,BERTHIER R N.Modern information retrieval[M].New York:ACM press,1999:3-9.
  • 6MOLLA D,VICEDO J L.Question answering in restricted domains:an overview[J].Computational Linguistics,2007,33(1):41-61.
  • 7LEE J,FINK D.Knowledge mapping:encouragements and impediments to adoption[J].Journal of Knowledge Management,2013,17(1):16-28.
  • 8AGICHTEIN E,GRAVANO L.Snowball:Extracting relations from large plain-text collections[C]//Proceedings of the Fifth ACM Conference on Digital Libraries.New York:Association for Computing Machinery,2000:85-94.
  • 9WELD D S,HOFFMANN R,WU F.Using wikipedia to bootstrap open information extraction[J].ACM SIGMOD Record,2009,37(4):62-68.
  • 10YAN Y,OKAZAKI N,MATSUO Y,et al.Unsupervised relation extraction by mining Wikipedia texts using information from the Web[C]//Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP:Volume 2-Volume 2.Stroudsburg,PA,USA:Association for Computational Linguistics,2009:1021-1029.

共引文献42

同被引文献8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部