期刊文献+

基于重排序的迭代式实体对齐 被引量:9

Iterative Entity Alignment via Re-Ranking
下载PDF
导出
摘要 现有的知识图谱无法避免地存在不完整这一问题.缓解此问题的可行方法是引入外部知识图谱中的知识.在此过程中,实体对齐是最关键的步骤.当前最先进的实体对齐解决方案主要依靠知识图谱的结构信息来判断实体的等价性,但在真实世界知识图谱上,大部分实体只具有较低的节点度数以及微少的结构信息.此外,标注数据的缺乏也大大限制了实体对齐模型的效果.为解决上述问题,提出将不受节点度数影响的实体名信息与结构信息相结合,从更全面的角度实现实体对齐.在此基本框架上,利用基于课程学习的迭代训练方法从易至难地选择高置信度结果加入到训练数据中,扩增标注数据的规模.最后使用词移距离模型进一步改进实体名信息的利用方式,并对前序对齐结果重排序,提升实体对齐准确率.在跨语言以及单语言实体对齐任务上的实验结果表明,提出的实体对齐方法性能远好于当前最好的方法. Existing knowledge graphs(KGs)inevitably suffer from the problem of incompleteness.One feasible approach to tackle this issue is by introducing knowledge from other KGs.During the process of knowledge integration,entity alignment(EA),which aims to find equivalent entities in different KGs,is the most crucial step,as entities are the pivots that connect heterogeneous KGs.State-of-the-art EA solutions mainly rely on KG structure information for judging the equivalence of entities,whereas most entities in real-life KGs are in low degrees and contain limited structural information.Additionally,the lack of supervision signals also constrains the effectiveness of EA models.In order to tackle aforementioned issues,we propose to combine entity name information,which is not affected by entity degree,with structural information,to convey more comprehensive signals for aligning entities.Upon this basic EA framework,we further devise a curriculum learning based iterative training strategy to increase the scale of labelled data with confident EA pairs selected from the results of each round.Moreover,we exploit word mover's distance model to optimize the utilization of entity name information and re-rank alignment results,which in turn boosts the accuracy of EA.We evaluate our proposal on both cross-lingual and mono-lingual EA tasks against strong existing methods,and the experimental results reveal that our solution outperforms the state-of-the-arts by a large margin.
作者 曾维新 赵翔 唐九阳 谭真 王炜 Zeng Weixin;Zhao Xiang;Tang Jiuyang;Tan Zhen;Wang Wei(Science and Technology on Information Systems Engineering Laboratory,National University of Defense Technology,Changsha 410073;Collaborative Innovation Center of Geospatial Technology(Wuhan University),Wuhan 430079;School of Computer Science and Engineering,The University of New South Wales,Sydney,Australia,2052)
出处 《计算机研究与发展》 EI CSCD 北大核心 2020年第7期1460-1471,共12页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61872446,61902417,71690233,71971212) 湖南省自然科学基金项目(2019JJ20024) 湖南省研究生科研创新项目(CX20190033)。
关键词 实体对齐 课程学习 迭代训练 重排序 知识图谱对齐 entity alignment curriculum learning iterative training re-ranking knowledge graph alignment
  • 相关文献

参考文献4

二级参考文献33

  • 1BERNERS-LEE T, ttENDLER J, 1,ASSILA O. The semantic Web [J]. Scientific American, 2001, 284(5): 28-37.
  • 2BOLLACKER K, EVANS C, PARITOSH P, et al. Frcebase: a eol- lahoratively created graph database tor structuring human knowledge [ C]//ACM S1GMOD 2008: Proceedings of the 2008 Association thr Compming Machinery' s Special Interest Group on Management of Data. New York: ACM 2008: 1247-1250.
  • 3LEHMANN J, ISELE R, JAKOB M, et al. DBpedia-a large- scale, muhilingual knowledge base extracted ii'om wikipedia [ J]. Semantic Weh, 2015(2) : 167 - 195.
  • 4BIEGA J, KUZEY E, SUCHANEK F M, Inside YAGO2s: a trans parent information extraction architecture [ C]// Proceedings of the22nd International Conference on Worhl Wide Web Conferetace. New York: ACM, 2013:325-328.
  • 5PHIl,POT A, HOVY E, PANTFI, P. The Omega ont,Jlogy [ C]// OntoLex-05: Proceedings of the 2nd International Joint Conference on Natural Language Processing Workshop on Ontologies and Lexical Resources. Cambridge, UK: Cambridge University Press, 2005:59 -66.
  • 6LI M, SHI Y, WANG Z, et al. Building a large-scale cross-lingual knowledge base from heterogcneous online wikis [ M] // Natural Lan- guage Processing and Chinese Co,nputing. Berlin: Springer, 2015: 413 -420.
  • 7MADHU G, GOVARDHAN A, RAJINIKANTI-I T V. Intelligent se- mantic Web search engines: a brief survey [ J]. International Journal of Web & Semantic Technology, 201 1, 2(1) : 34 -42.
  • 8HAN X, SUN L. A generative entity-mention model for linking enti- ties with knowledge base [ C]// ACL-HLT 2011 :Proct, edings of the 49th Annual Meeting of the Association for Computational Linguis- tics: Human Languagechnologies-Volume 1. Stroudsburg, PA: Association for Computational Linguistics, 2011:945 -954.
  • 9NOV O. What motivates wikipedians [ J]. Communications of the ACM, 2007, 50(11) : 60 - 64.
  • 10SLEEMAN J, FININ T. Computing FOAF co-reference re.lations with rules and machine learning [ C]// SDoW-2010: Proceedings of the 3rd International Workshop on Social Data on the Web. Ber- lin: Springer, 2010:1 - 11.

共引文献88

同被引文献34

引证文献9

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部