
基于人物相关社区的重名消解研究 被引量:5

Person′s name disambiguation based on person related social communities
摘要 由于人的重名现象,人名检索的结果往往是同名的不同人物实体相关网页的混合。重名消解是根据上下文来区分同名的不同人物实体的过程。本文提出了基于相关社区的重名消解方法,采用改进的Espresso算法进行相关社区发现。将每个网页发现的社区应用到两阶段重名消解算法中,并且在WePS-2测试集上进行试验。实验结果表明了该方法的有效性。 Person's names are so ambiguous that the results of searching for a person's name are usually a mixture of pages about namesakes. Person's name disambiguation is a course of distinguishing different person's entities with the same name. The method of person's name disambiguation based on the relevant community was proposed and the modi- fied Espresso algorithm was used to find relevant community for each Web page. The enlarged name sets were applied in the two-stage person's name disambiguation algorithm, and then the algorithm was tested it on the WePS-2 test data- set. The experimental results show the effectiveness of our method.
作者 李琦 马军
出处 《山东大学学报(理学版)》 CAS CSCD 北大核心 2012年第3期33-37,共5页 Journal of Shandong University(Natural Science)
基金 国家自然科学基金资助项目(60970047 61103151 61173068) 教育部博士点基金项目(20110131110028)
关键词 社会网络 社团 重名消解 人名检索 聚类 social network community person's name disambiguation Web people search clustering
  • 相关文献


  • 1ARTILES J, GONZALO J, VERDEJO F. A testbed for people searching strategies in the www [ C ]// Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA: ACM Press, 2005: 569-570.
  • 2BAGGA A, BALDWIN B. Entity-based cross-document coreferencing using the vector space model[ C ]//Proceed- ings of the 36th Annual Meeting of the Association for Computational Linguistics. East Stroudsburg, PA: Associ- ation for Computational Linguistics, 1998, 1:79-85.
  • 3郎君,秦兵,宋巍,刘龙,刘挺,李生.基于社会网络的人名检索结果重名消解[J].计算机学报,2009,32(7):1365-1374. 被引量:32
  • 4MANN G S, YAROWSKY D. Unsupervised Personal Name Disambiguation [ C]// Proceedings of the Confer- ence on Computational Natural Language Learning. East Stroudsburg, PA: Association for Computational Linguis- tics, 2003, 4:33-40.
  • 5RESNICK P, IACOVOU N, SUCHAK M, et al. Grou- plens: an open architecture for collaborative filtering of netnews [ C ]// Proceedings of the Conference on Comput- er Supported Cooperative Work. New York, USA: ACM Press, 1994: 175-186.
  • 6PANTEL P, PENNACCHIOTTI M. Espresso: leveraging generic patterns for automatically harvesting semantic rela- tions[ C ]// Proceedings of the 21st International Confer- ence on Computational Linguistics and the 44th Annual Meeting of the ACL. East Stroudsburg, PA: Association for Computational Linguistics, 2006 : 113-120.
  • 7YOSHIDA M, IKEDA M, ONO S, et al. Person name disambiguation by bootstrapping[ C ]//Proceedings of the 33rd Annual ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA: ACM Press, 2010: 10-17.
  • 8NAKAGAWA H, MORI T. Automatic term recognition based on statistics of compound nouns and their compo- nents [ J ]. Terminology, 2003, 9 ( 2 ) : 201-219.
  • 9AMIGO E, GONZALO J, ARTILES J, et al. A compar- ison of extrinsic clustering evaluation metrics based on formal constraints [J].Information Retrieval, 2009, 12 (4) :461-486.
  • 10ARTILES J, GONZALO J, SEKINE S. WePS 2 Evalu- ation campaign : overview of the web people search clus- tering task [C ] // Proceedings of 2nd Web People Search Evaluation Workshop. Madrid, Spain: [s. n. ], 2009: 31-39.




  • 1罗会兰,孔繁胜,李一啸.聚类集成中的差异性度量研究[J].计算机学报,2007,30(8):1315-1324. 被引量:36
  • 2Han Hui,Giles C L,Zha Hongyuan,et al. Two supervised learning approaches for name disamblguation in author citations [ C ]//Pro- ceedings of the 4th ACM/IEEE Joint Conference on Digital Librar- ies. Tucson:iEEE, 2004:296 -305.
  • 3Huang Jian, Ertekin S, Giles C L. Efficient name disambiguation for large-scale databases [ C ]//Proceedings of the 10th European Con- ference on Principles and Practice of Knowledge Discovery in Data- bases. Berlin:Springer, 2006:536 -544.
  • 4Zhang Duo, Tang Jie, Li Juanzi, et al. A constraint - based proba- bilistic framework for name disambiguation [ C ]//Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management. New York:ACM, 2007 : 1019 - 1022.
  • 5Pereira D A, Ribeiro - Neto B A, Ziviani N, et al. Using Web infor- mation for author name disambiguation [ C ]//Proceedings of the 2009 ACM/IEEE-CS Joint International Conference on Digital Li- braries. New York:ACM Press, 2009:49 -58.
  • 6Kang I S, Na S H, Lee S, et al. On co - authorship for author dis- amblguation[ Jl. Information Processing & Management, 2009, 45(1):84—97.
  • 7Guha V,Garg A. Disambiguating People in Search[ C l// The Thirteenth International World Wide Web Confer- ence. 2004:22-32.
  • 8Artiles J, Gonzaks J, Verdejo F. A testbed for people Searching Strategies in the www [ C ]//Proceedings of the 28th annual International ACM SIGIR conference on Re- search and Development in information Retrieval New York. 2005:569-570.
  • 9Chen Ying, Jin Peng, Li Wenjie, et al. Exploration of personal name disambiguation in Chinese news [ C ]// CIPS-SIGHAN Joint Conference on Chinese Language Processing. 2010: 20-26.
  • 10He Zhengyan, Wang Houfeng, Li Sujian. The Task 2 of CIPS-SIGHAN 2012 Named entity recognition and disam- biguation in Chinese bakeoff[ C ]//CIPS-SIGHAN Joint Conference on Chinese Language Processing. 2012: 108- 114.










使用帮助 返回顶部