期刊文献+

Topological Features Based Entity Disambiguation 被引量:1

Topological Features Based Entity Disambiguation
原文传递
导出
摘要 This work proposes an unsupervised topological features based entity disambiguation solution. Most existing studies leverage semantic information to resolve ambiguous references. However, the semantic information is not always accessible because of privacy or is too expensive to access. We consider the problem in a setting that only relationships between references are available. A structure similarity algorithm via random walk with restarts is proposed to measure the similarity of references. The disambiguation is regarded as a clustering problem and a family of graph walk based clustering algorithms are brought to group ambiguous references. We evaluate our solution extensively on two real datasets and show its advantage over two state-of-the-art approaches in accuracy. This work proposes an unsupervised topological features based entity disambiguation solution. Most existing studies leverage semantic information to resolve ambiguous references. However, the semantic information is not always accessible because of privacy or is too expensive to access. We consider the problem in a setting that only relationships between references are available. A structure similarity algorithm via random walk with restarts is proposed to measure the similarity of references. The disambiguation is regarded as a clustering problem and a family of graph walk based clustering algorithms are brought to group ambiguous references. We evaluate our solution extensively on two real datasets and show its advantage over two state-of-the-art approaches in accuracy.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2016年第5期1053-1068,共16页 计算机科学技术学报(英文版)
基金 This work is supported by the National Basic Research 973 Program of China under Grant No. 2012CB316201, the Fundamental Research Funds for the Central Universities of China under Grant No. N120816001, and the National Natural Science Foundation of China under Grant Nos. 61472070 and 61402213.
关键词 entity disambiguation topological feature CLUSTERING random walk with restarts entity disambiguation, topological feature, clustering, random walk with restarts
  • 相关文献

参考文献37

  • 1Ferreira A A, Gonqalves M A, Laender A H. A brief sur- vey of automatic methods for author name disambiguation. ACM SIGMOD Record, 2012, 41(2): 15-26.
  • 2Han H, Giles L, Zha H, Li C, Tsioutsiouliklis K. Two super- vised learning approaches for name disambiguation in au- thor citations. In Proc. the 4th ACM/IEEE Joint-CS Con- ference on Digital Libraries, June 2004, pp.296-305.
  • 3Han H, Zha H, Giles C L. Name disambiguation in au- thor citations using a k-way spectral clustering method. In Proc. the 5th ACM/IEEE-CS Joint Conference on Digital Libraries, June 2005, pp.334-343.
  • 4Bhattacharya I, Getoor L. A latent Dirichlet model for un- supervised entity resolution. In Proc. the 2006 SIAM Con- ference on Data Mining, April 2006.
  • 5Shu L, Long B, Meng W. A latent topic model for com- plete entity resolution. In Proe. the 25th IEEE Interna- tional Conference on Data Engineering, March 29-April 2, 2009, pp.880-891.
  • 6Song Y, Huang J, Councill I G, Li J, Giles C L. Efficient topic-based unsupervised name disambiguation. In Proe, the 7th ACM/IEEE-CS Joint Conference on Digital Li- braries, June 2007, pp.342-351.
  • 7Kataria S S, Kumar K S, Rastogi R R, Sen P, Sengamedu S H. Entity disambiguation with hierarchical topic models. In Proc. the 17th A CM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2011, pp.1037-1045.
  • 8Tang J, Fong A C M, Wang B, Zhang J. A unified prob- abilistic framework for name disambiguation in digital li- brary. IEEE Transactions on Knowledge and Data Engi- neering, 2012, 24(6): 975-987.
  • 9Sen P. Collective context-aware topic models for entity dis- ambiguation. In Proe. the 21st International Conference on World Wide Web, April 2012, pp.729-738.
  • 10Cen L, Dragut E C, Si L, Ouzzani M. Author disambigua- tion by hierarchical agglomerative clustering with adaptive stopping criterion, in Proc. the 36th International ACM SIGIR Conference on Research and Development in Infor- mation Retrieval, July 28-Aug. 1, 2013, pp.741-744.

同被引文献15

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部