一种新的重名消解算法在保险领域中的应用研究被引量：3

Application of data mining in customer name disambiguation of insurance field

下载PDF

导出

摘要研究客户重名消解问题。针对以往重名消解方法如文本聚类的方法需考虑大量无用词汇并需要人工设定阈值以及类别数量,而基于信息抽取的人物相关属性相似度方法对于人物信息的抽取具有依赖性,提出了一种改进的重名消解算法。该算法首先对具有相同标志的客户进行属性匹配,合并匹配成功的标志;然后进行链接分析,对客户合作网的结构进行分析,将具有相同标志并与同一个代理人实体合作的客户归为一个客户实体,并把具有相同合作对的信息加以分析合并;最后通过原子团簇分析法进行聚类分析。仿真实验结果表明,所提改进算法对中文字符串的匹配处理进行了优化,执行效率高,适合于以大量数据为特征的保险领域的重名消解。 This paper researched the solution to customer name disambiguation of the field of insurance. Aiming at the former name disambiguation methods such as text clustering method need to be considered in a lot of useless words, manually set the threshold, and gave he numbers of type, and the method of character-related properties of similarity based on information extraction depends on the character information, proposed a new name disambiguation method. Firstly, applied the same attribute matching, merging the identity of a successful match and then used link analysis, analyzed structural analysis of customers network, the entities had the same identity and classified cooperate with the same policy to a customer entity, merged the same cooperating information. Finally, analyzed cluster analysis cluster. Experiment results show that the proposed method can optimize the chinese text string matching process and have the high implementation efficiency, especially suitable for large amounts of data to the insurance sector is characterized by digestion of the same name.

作者姚宇峰

机构地区常熟理工学院计算机科学与工程学院

出处《计算机应用研究》 CSCD 北大核心 2012年第3期994-997,共4页 Application Research of Computers

基金常熟理工学院青年教师科研启动基金资助项目(QZ0912)

关键词重名消解数据挖掘保险领域实体 name disambiguation data mining insurance field entity

分类号 TP391 [自动化与计算机技术—计算机应用技术] TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献6

1郎君,秦兵,宋巍,刘龙,刘挺,李生.基于社会网络的人名检索结果重名消解[J].计算机学报,2009,32(7):1365-1374. 被引量：32
2陈农心,张效严.数据挖掘技术在证券分析系统的应用研究[J].计算机仿真,2010,27(10):301-305. 被引量：3
3BOLLEGALA D,MATSUO Y, 1SHIZUKA M. Disambiguating personal names on the Web using automatically extracted key phrases [ C ]// Proc of the 17th European Conference on Artificial Intelligence. Riva del Garda, Italy :IOS Press,2011:553-557.
4WANG Hou-feng. Cross-document transliterated persona1 name core- ference resolution [ C]//Lecture Notes in Computer Science, vol 3614. 2005.
5Wang Houfeng（王厚峰）,Mei Zheng.Chinese multi-document personal name disambiguation[J].High Technology Letters,2005,11(3):280-283. 被引量：8
6于满泉.面向人物追踪的知识挖掘研究[D].北京:中国科学院计算技术研究所,2009.

二级参考文献7

1Wang Houfeng（王厚峰）,Mei Zheng.Chinese multi-document personal name disambiguation[J].High Technology Letters,2005,11(3):280-283. 被引量：8
2张晓莉,王苗,罗文劼.数据结构与算法[M].北京:机械工业出版社,2008:97-142.
3Han Jiawei, Micheline Kamber. Data Mining Concepts and Techniques[ M]. San Marco : Morgan Kaufmarm Publishers Inc, 2001.
4HanJiawei,MichalineKamber[加]著,范明,等译.数据挖掘概念与技术[M].北京:机械工业出版社,2001-8.
5敖富江译.数据仓库、挖掘和可视化[M].北京:清华大学出版社,2004-10.
6李旭.基于改进神经网络的WEB数据挖掘研究[J].计算机仿真,2008,25(6):99-102. 被引量：3
7单承戈.决策支持系统问题模型的可视化构造方法[J].计算机应用研究,2000,17(9):25-47. 被引量：19

共引文献36

1Fei Shu.Limitations of citation analysis on the measurement of research impact:A summary[J].Data Science and Informetrics,2021,1(3):37-49.
2杨高明,李敬兆,张顺香,周华平.社会网络社区识别方法研究[J].大庆师范学院学报,2013,33(3):1-4.
3郑倩冰,朱培栋,朱政坚.基于在线社会网络的信息存储与搜索机制研究[J].计算机研究与发展,2011,48(S1):143-146.
4吴斌,徐超群,王文彬,吴巍.基于链接的作者重名处理方法研究与应用[J].计算机科学,2008,35(3):197-199. 被引量：5
5郎君,秦兵,宋巍,刘龙,刘挺,李生.基于社会网络的人名检索结果重名消解[J].计算机学报,2009,32(7):1365-1374. 被引量：32
6杨欣欣,李培峰,朱巧明,王英帅.一种基于改进的K-means算法的人名消歧系统的设计与实现[J].计算机与数字工程,2010,38(8):10-12. 被引量：5
7施佺,肖仰华,温文灏,朱乾钱,王恒山.基于Mapreduce的大规模社会网络提取方法研究[J].计算机应用研究,2011,28(1):145-148. 被引量：4
8郑倩冰,朱培栋,王永文,徐明.基于在线社会网络的网络协议增强机制研究[J].计算机科学,2011,38(6):81-83. 被引量：1
9王英帅,李培峰,朱巧明.一种基于LDA和上下文摘要的Web人名消歧方法[J].计算机应用与软件,2011,28(7):13-15. 被引量：3
10陈晨,王厚峰.基于社会网络的跨文本同名消歧[J].中文信息学报,2011,25(5):75-82. 被引量：13

同被引文献32

1Golubin A Y. Pareto-optimal insurance policies in the models with a premium based on the actuarial value[J]. Journal of Risk and Insurance, 2006,73(6):469-487..
2Shen Wei,Wang Jianyong,Han Jiawei.Entity linking with a knowledge base:issues,techniques,and solutions[J].IEEE Trans on Knowledge and Data Engineering,2014,27(2):443-460.
3Shen Wei,Wang Jianyong,Luo Ping,et al.LINDEN:linking named entities with knowledge base via semantic knowledge[C]//Proc of the 21st International Conference on World Wide Web.2012.
4Guo Yuhang,Qin Bing,Li Yuqin,et al.Improving candidate generation for entity linking[C]//Proc of Natural Language Processing and Information Systems.2013:225-236.
5Guo Yuhang,Qin Bing,Liu Ting,et al.Microblog entity linking by leveraging extra posts[C]//Proc of Conference on Empirical Methods in Natural Language Processing.2013.
6Shen Wei,Wang Jianyong,Luo Ping,et al.Linking named entities in Tweets with knowledge base via user interest modeling[C]//Proc of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2013:68-76.
7Blei D M,Ng A Y,Jordan M I.Latent dirichlet allocation[J].The Journal of Machine Learning Research,2003,3(1):993-1022.
8Han Xianpei,Sun Le.An entity-topic model for entity linking[C]//Proc of Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning.[S.l.]:Association for Computational Linguistics,2012.
9Li Yang,Wang Chi,Han Fangqiu,et al.Mining evidences for named entity disambiguation[C]//Proc of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2013.
10Newman D,Chemudugunta C,Smyth P.Statistical entity-topic models[C]//Proc of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Philadelphia,PA,USA:ACM Press,2006:680-686.

引证文献3

1宋骏豪.一种基于遗传算法的保险方案智能推荐方法[J].计算机与现代化,2013(7):80-83. 被引量：1
2宋俊,李禹恒,黄宇,陈昊,付琨.一种基于用户兴趣的微博实体链接方法[J].计算机应用研究,2016,33(7):2079-2082. 被引量：1
3冯钧,柳菁铧,孔盛球.融合多特征的中文集成实体链接方法[J].计算机与现代化,2019(1):69-74.

二级引证文献2

1李春梅,刘海雄.遗传算法在政府采购系统中的应用[J].电子设计工程,2015,23(16):43-46. 被引量：1
2王旭阳,姜喜秋.基于上下文信息的中文命名实体消歧方法研究[J].计算机应用研究,2018,35(4):1072-1075. 被引量：7

1李琦,马军.基于人物相关社区的重名消解研究[J].山东大学学报（理学版）,2012,47(3):33-37. 被引量：5
2刘显敏,李建中.高效的实体匹配结果消解算法[J].计算机研究与发展,2013,50(S1):239-247.
3郎君,秦兵,宋巍,刘龙,刘挺,李生.基于社会网络的人名检索结果重名消解[J].计算机学报,2009,32(7):1365-1374. 被引量：32
4李刚,史向东.基于Google搜索结果的重名消解方法[J].信息与电脑（理论版）,2011(2):125-126. 被引量：1
5罗泽飞.浅析汽车在日常维护中要注意的若干问题[J].科技风,2013(2):113-113. 被引量：1
6朱云霞.中文文献题录数据作者重名消解问题研究[J].图书情报工作,2014,58(23):143-148. 被引量：8
7郑轶.基于条件随机场的人物信息抽取[J].计算技术与自动化,2015,34(4):132-136. 被引量：3
8刘万伟,周倜,李梦君,李舟军.一种基于进程代数的安全协议验证消解算法[J].计算机工程与科学,2006,28(7):14-16. 被引量：1
9胡晓,胡洁,彭颖红,李晟,曹兆敏.语义级知识融合中的冲突消解方法[J].上海交通大学学报,2009,43(11):1730-1733. 被引量：2
10刘兵,臧天阳,张晶.一种中文字符串近似匹配查询技术研究[J].电脑编程技巧与维护,2013(14):6-6.

计算机应用研究

2012年第3期

浏览历史

内容加载中请稍等...

一种新的重名消解算法在保险领域中的应用研究被引量：3

参考文献6

二级参考文献7

共引文献36

同被引文献32

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种新的重名消解算法在保险领域中的应用研究 被引量：3

参考文献6

二级参考文献7

共引文献36

同被引文献32

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种新的重名消解算法在保险领域中的应用研究被引量：3