期刊文献+

Disambiguating Authors by Pairwise Classification 被引量:1

Disambiguating Authors by Pairwise Classification
原文传递
导出
摘要 Name ambiguity is a critical problem in many applications, in particular in online bibliography sys-tems, such as DBLP, ACM, and CiteSeerx. Despite the many studies, this problem is still not resolved and is becoming even more serious, especially with the increasing popularity of Web 2.0. This paper addresses the problem in the academic researcher social network ArnetMiner using a supervised method for exploiting all side information including co-author, organization, paper citation, title similarity, author's homepage, web constraint, and user feedback. The method automatically determines the person number k. Tests on the researcher social network with up to 100 different names show that the method significantly outperforms the baseline method using an unsupervised attribute-augmented graph clustering algorithm. Name ambiguity is a critical problem in many applications, in particular in online bibliography sys-tems, such as DBLP, ACM, and CiteSeerx. Despite the many studies, this problem is still not resolved and is becoming even more serious, especially with the increasing popularity of Web 2.0. This paper addresses the problem in the academic researcher social network ArnetMiner using a supervised method for exploiting all side information including co-author, organization, paper citation, title similarity, author's homepage, web constraint, and user feedback. The method automatically determines the person number k. Tests on the researcher social network with up to 100 different names show that the method significantly outperforms the baseline method using an unsupervised attribute-augmented graph clustering algorithm.
出处 《Tsinghua Science and Technology》 SCIE EI CAS 2010年第6期668-677,共10页 清华大学学报(自然科学版(英文版)
基金 supported by the National Natural Science Foundation of China (Nos.70771043,60873225,and 60773191) supported by the National Natural Science Foundation of China (No.60773061) the Natural Science Foundation of Jiangsu Province (No.BK2008381) supported by the National High-Tech Research and Development (863) Program ofChina (No.2009AA01Z138)
关键词 disambiguating pairwise classification arnetminer disambiguating pairwise classification arnetminer
  • 相关文献

参考文献16

  • 1http://portal.acm.org, 2010.
  • 2Bunescu C, Pasca M. Using encyclopedic knowledge for named entity disambiguation. In: Proceedings of the l lth Conference of the European Chapter of the Association for Computational Linguistics. Trento, Italy, 2006: 9-16.
  • 3Cucerzan S. Large-scale named entity disambiguation based on Wikipedia data. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Prague. Czech Republic. 2007:708-716.
  • 4Han H, Giles L, Zha H, et al. Two supervised learning approaches for name disambiguation in author citations. In: Proceedings of the 4th ACM/IEEE-CS Joint Conference on Digital Libraries. Tuscon, USA, 2004: 296-305.
  • 5Han H, Zha H, Giles L. Name disambiguation in author citations using a k-way spectral clustering method. In: Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries. Denver, USA, 2005: 334-343.
  • 6Tan F, Kan Y, Lee D. Search engine driven author disambiguation. In: Proceedings of the 6th ACM/IEEE-CS Joint Conference. Chapel HJLll, USA, 2006:314-315.
  • 7Yin X, Han J, Yu S. Object distinction: Distinguishing objects with identical names. In: Proceedings of the 23rd International Conference on Data Engineering. Istanbul, Turkey, 2007: 1242-1246.
  • 8Bekkerman R, McCallum A. Disambiguating web appearances of people in a social network. In: Proceedings of the 14th International Conference on World Wide Web. Chiba, Japan, 2005: 463- 470.
  • 9Mann S, Yarowsky D. Unsupervised personal name disambiguation. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL. Edmonton, Canada, 2003: 33-40.
  • 10Minkov E, Cohen W, Ng Y. Contextual search and name disambiguation in email using graphs. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Washington D. C., USA, 2006: 27-34.

同被引文献14

  • 1Making a match [ OL 1. [ 2013 - 10 - 05 ]. http ://himmelfarbli- brary, wordpress, com/2012/10/18/matching -who - is - who - in - research/.
  • 2O RCID community [ OL 1. [ 2013 - 10 - 05 ]. http ://orcid. org/a- bout/community.
  • 3Why ProQuest Scholar Universe? [ OLd. [2013 - 10 -05 1. ht-tp ://www. refworks - cos. com/cosscholaruniverse/.
  • 4如何从IsIWebofKnowledge平台向ResearcherID添加我的著作歹Ⅱ表?[OL].[2013-10-27].http://ip-science.thomson-reuters.eom.cnfmedia/wok511.pdf.
  • 5如何从EndnoteWeb向ResearcherID中添加我的著作列表[OL].[2013-10-27].http://ip-science.thomsonreuters.eom.cn/media/wok512.pdf.
  • 6About VIVO [ OL ]. [ 2014 - 02 - 09 ]. http ://vivo. cornell, edu/.
  • 7Overview[ OL]. [ 2014 - 02 - 09 ]. http ://academic. research, mi- crosoft, com/SilverlightInstall.
  • 8ThuRID服务目标[0L].[2014-02-09].http://rid.1ib.tsing-hua.edu.cn/.
  • 9AbouttheHub[OL].[2014-02-09].http://hub.hku.hk/.
  • 10李伟钢.巴西人才计划与科研管理的技术支持[0L].[2014-01-12].http://blog.sciencenet.cn/blog一652078-655383.ht.m1.

引证文献1

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部