期刊文献+

基于场景信息融合的中文姓名识别方法研究 被引量:1

Research of Chinese name identification method based on scene information fusion
下载PDF
导出
摘要 为克服传统的先分词再识别方法的缺点,提出了一种基于场景信息融合的姓名识别方法。该方法结合中文姓名的特点,综合考虑上下文信息、词本身信息、词典信息和姓名自身信息等场景资源对中文名实体的影响,将它们作为姓名识别的依据,同时引入了证据理论,通过场景资源信息的融合,最终识别出人名。通过对互联网上随机抽取的大规模真实语料的开放测试表明,该方法可以取得较高的召回率并同时保证较高的准确率。 To overcome the defects of traditional name identification algorithms with automatic segmentation at first,a name identification method based on scene information fusion is presented.Combining the characteristics of Chinese names,the scene information, such as the context, word, dictionary, names, is used as the basis of name identification.And then, the evidence theory is introduced,and the names are identified by scene information fusion.The open tests on real data sets randomly selected from the internet show that it is an effective method to improve the result of the identification with high recall rate and accuracy rate are guaranteed.
出处 《计算机工程与应用》 CSCD 北大核心 2009年第34期147-151,共5页 Computer Engineering and Applications
基金 国家自然科学基金No.60972045 南京邮电大学引进人才科研基金No.NY207148~~
关键词 姓名识别 场景信息融合 自动分词 证据理论 name identification scene information fusion automatic segmentation evidence theory
  • 相关文献

参考文献9

  • 1Zhang Hua-ping, Liu Qun, Zhang Hao.Automatic recognition of Chinese unknown words based on roles tagging[C]//Prceedings of the First SIGHAN Workshop on Chinese Language Processing, 2002, 18: 71-77.
  • 2黄德根,马玉霞,杨元生.基于互信息的中文姓名识别方法[J].大连理工大学学报,2004,44(5):744-748. 被引量:12
  • 3贾宁,张全.基于最大熵模型和规则的中文姓名识别[J].计算机工程与应用,2007,43(35):1-4. 被引量:6
  • 4Zhang Yue-jie,Zhang Tao.Me-based Chinese person name and location name recognition model[C]//International Conference on Machine Learning and Cybernetics, 2007,6: 3442-3447.
  • 5Telmoudi A,Chakhar S.Data fusion application from evidential databases as a support for decision making[J].Information and Software Technology, 2004,46(8) : 547-555.
  • 6Srivastava R P,Liu L.Applications of belief function in business decisions : A review [J].Information Systems Frontiers, 2003,5 (4) : 359-378.
  • 7Jones R W,Lowe A,Harrison M J.A framework for intelligent medical diagnosis using the theory of evidence[J].Knowledge- Based Systems, 2002,15 : 77-84.
  • 8Otman B,Fakhri K,Zhu Hong-wei.Connectionist-based dempstershafer evidential reasoning for data fusion[J].IEEE Transactions on Neural Networks,2005,16(6) : 1513-1530.
  • 9Beynon M J.Understanding local ignorance and non-specificity in the DS/AHP method of muti-criteria decision making[J].European Journal of Operational Research, 2005,163 (2) : 403-417.

二级参考文献12

  • 1季姮,罗振声.基于统计和规则的中文姓名自动辨识[J].语言文字应用,2001(1):14-18. 被引量:13
  • 2王振华,孔祥龙,陆汝占,刘绍明.结合决策树方法的中文姓名识别[J].中文信息学报,2004,18(6):10-15. 被引量:15
  • 3CHURCH K W, HANKS P. Word association norms, mutual information, and lexicography [J]. Comput Linguist, 1990,16(1):22-29.
  • 4CHURCH K W, GALE W, HANKSP, et al. Using Statistics in Lexical Analysis. Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon [M]. ZERNIK U. New Jersey: Lawrence Erlbaum, 1991. 115-164.
  • 5Adwait R.Maximum entropy models for natural language ambiguity resolution[D].University of Pennsylvania, 1998.
  • 6Jin Rong,Yan Rong,Zhang Jian.A faster iterative scaling algorithm for conditional exponential model [C]//Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003), Washington DC,2003.
  • 7郑家恒,谭红叶.基于变换的中文姓名识别技术探讨[C]//中文信息处理国际会议,北京,1998.
  • 8中国社会科学院语言文字应用研究所.姓氏人名用字分析统计[M].北京:语文出版社,1991.
  • 9张跃,姚天顺.基于结合性自动识别中文姓名[J].小型微型计算机系统,1997,18(10):43-48. 被引量:9
  • 10刘秉伟,黄萱菁,郭以昆,吴立德.基于统计方法的中文姓名识别[J].中文信息学报,2000,14(3):16-24. 被引量:48

共引文献15

同被引文献15

  • 1张晓艳,王挺,陈火旺.命名实体识别研究[J].计算机科学,2005,32(4):44-48. 被引量:66
  • 2俞鸿魁,张华平,刘群,吕学强,施水才.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-94. 被引量:157
  • 3周俊生,戴新宇,尹存燕,陈家骏.基于层叠条件随机场模型的中文机构名自动识别[J].电子学报,2006,34(5):804-809. 被引量:112
  • 4李丽双,黄德根,陈春荣,杨元生.SVM与规则相结合的中文地名自动识别[J].中文信息学报,2006,20(5):51-57. 被引量:32
  • 5贾宁,张全.基于最大熵模型的中文姓名识别[J].计算机工程,2007,33(9):31-33. 被引量:5
  • 6RICHMAN A E, SCHONE P. Mining Wiki resources for multilingual named entity recognition, ACL-08 [ EB/OL]. [ 2009 - 12 - 12]. http://aclweb, org/anthology-new/P/P08/P08-1001, pdf.
  • 7FU GUOHONG, LUKE K-K. Chinese named entity recognition using lexicalized HMMs [ J]. ACM SIGKDD Explorations Newsletter, 2005, 7(1): 19-25.
  • 8DAVID N, SATOSHI S. A survey of named entity recognition and classification [ J]. Linguisticae Investigationes, 2007, 30( 1 ) : 3 - 26.
  • 9XIONG DEYI, LIU QUN, LIN SHOUXUN. Maximum entropy based phrase reordering model for statistical machine translation [ C]// Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Com- putational Linguistics. Morristown, N J: Association for Computational Linguistics, 2006:521 - 528.
  • 10BENAJIBA Y, DIAB M, ROSSO P, et al. Arabic named entity recognition: An SVM-based approach [ EB/OL]. [ 2009 - 12 - 12]. http://eref, uqu. edu. sa/files/eref2/folder6/fl31, pdf.

引证文献1

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部