期刊文献+

基于聚类的人名消歧研究综述 被引量:2

A Summary:Research Name Disambiguation of Clustering
下载PDF
导出
摘要 人名消歧问题属于文本聚类范围,但有其自身的特殊性,即参与聚类的文本集采用向量空间模型表示以后具有较高的维度,导致数据在聚类过程中效率低下、计算内存开销过高。为了深入分析人名消歧研究中聚类算法的整体应用情况,从中国知网期刊数据库收集2006-2018年10月相关文献进行了统计和分析,介绍了利用聚类算法进行人名消歧研究的一般流程,阐述了聚类算法在人名消歧研究的应用、聚类评价指标和聚类结果评价,详细介绍相关研究成果及代表文献,为研究人员提供参考和借鉴。 Name disambiguation belongs to the scope of text clustering,but it has its own particularity:the set of text clustering represented by vector space model has a higher dimension,which leads to inefficiency and high computational memory in clustering process. In order to deeply analyze the overall application of clustering algorithm in the research of name disambiguation,the paper collected the related literature from the database of CNKI from October 2006 to October 2018 to statistics and analyze. Also,introduces the general process of using clustering algorithm in the researching name disambiguation,expounds the application of clustering evaluation in researching name disambiguation,clustering evaluation and evaluation of clustering result. Finally,the paper introduces in detail research results and representative literature,which provides reference for researchers of name disambiguation.
作者 展金梅 陈君涛 ZHAN Jinmei;CHEN Juntao(Qiongtai Normal University,Haikou 571127,China;Hainan College of Economics and Business,Haikou 571127,China)
出处 《现代信息科技》 2019年第10期88-91,共4页 Modern Information Technology
基金 海南省高等学校科学研究项目:聚类集成算法在中文文本中人名消歧的应用研究(项目编号:Hnky2018-78)资助,属其阶段性研究成果之一
关键词 聚类 人名消歧 研究综述 clustering name disambiguation research summary
  • 相关文献

参考文献10

二级参考文献98

  • 1张猛,王大玲,于戈.一种基于自动阈值发现的文本聚类方法[J].计算机研究与发展,2004,41(10):1748-1753. 被引量:16
  • 2刘远超,王晓龙,刘秉权.一种改进的k-means文档聚类初值选择算法[J].高技术通讯,2006,16(1):11-15. 被引量:23
  • 3ICTCLAS-分词-中文分词-汉语分词[EB/OL].[2009-07-18].http://ictclas.org/.
  • 4罗会兰,孔繁胜,李一啸.聚类集成中的差异性度量研究[J].计算机学报,2007,30(8):1315-1324. 被引量:36
  • 5CHOI J D,L EE K,LOGINOV A,et al.Efficient and precise data race detection for multithreaded object-oriented programs[C]//Proceeding of the 2002 ACM SIGPLAN Conference on Programming Language Design and Implementation.Berlin,2002:258-269.
  • 6Fleischman M.B,Hovy E.Multi-document Person Name Resolution[C]//Proceedings of ACL-42 Reference Resolution Workshop,Barcelona,Spain,2004,7.
  • 7Chen Y,Martin J.Towards Robust Unsupervised Personal Name Disambiguation[C]//Proceedings of the EMNLP and CoNLL,Prague,2007:190-198.
  • 8Artiles J,Gonzalo J,Sekine S.The SemEval-2007 WePS Evaluation:Establishing a benchmark for the Web People Search Task[C]//Proceedings of the 4th International Workshop on Semantic Evaluations 2007,Prague,June,2007:64-69.
  • 9Shingo O,Issei S,Minoru Y.Person Name Disambiguation in Web Pages Using Social Network[J].Compound Words and Latent Topics.PAKDD,2008:260-271.
  • 10Malin B, Airoldi E, Carley K M. A Network Analysis Model for Disambiguation of Names in Lists[ J]. Computational & Mathematical Organization Theory, 2005,11 (2) :119 - 139.

共引文献46

同被引文献11

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部