期刊文献+

维吾尔语中汉族人名的识别及翻译 被引量:13

Recognition and Translation for Chinese Names in Uighur Language
下载PDF
导出
摘要 该文研究了一种维吾尔语中汉族人名的识别和翻译方法。该方法在词典等传统方法的基础上,运用语言模型实现维语中的汉族人名的识别和翻译。针对维语人名的构词和拼写特点,增加了名词词缀识别预处理模块,补充了维语字母到汉语拼音的映射规则,有效提高了人名识别的正确率及召回率。在1 000句含有汉族人名的维语语料上进行测试,汉族人名识别的正确率和召回率分别达到75.2%和91.5%。 Name translation in the minority languages is still in its infancy.This paper presents a method for recognizing and translating Chinese Names in Uighur Language.In addition to using the traditional rule approach,we use Uighur and Chinese language models to recognize and translate Chinese names in Uighur Language.On this basis,we add the appropriate rules and algorithms to solve the problem of names with noun affixes and incomplete rules.This improves the accuracy of translation and the recall rate.We test the translation system with 1000 random sentences with Chinese names.The results show that the accuracy can reach 75.2% and the recall rate can reach 91.5%.
出处 《中文信息学报》 CSCD 北大核心 2011年第4期82-87,共6页 Journal of Chinese Information Processing
基金 国家自然科学基金重点资助项目(60736014) 国家自然科学基金资助项目(60873167)
关键词 语言模型 名词词缀 拼写规则 人名识别及翻译 language model noun affixes spelling rules recognition and translation of names
  • 相关文献

参考文献8

  • 1宋柔,朱宏.基于语料库和规则库的人名识别法[C]//陈力为.计算语言研究与应用.北京:北京语言学院出版社,1993.
  • 2罗智勇,宋柔.现代汉语自动分词中专名的一体化、快速识别方法[C]//Ji Dong-Hong.国际中文电脑学术会议,新加坡,2001:323-328.
  • 3张华平,刘群.基于角色标注的中国人名自动识别研究[J].计算机学报,2004,27(1):85-91. 被引量:101
  • 4Zhang Huaping, Liu Qun, Yu Hongkui, et al. Chinese named entity recognition using role model[J]. The International Journal of Computational Linguistics and Chinese Language Processing, 2003, 8(2) : 29-60.
  • 5吕雅娟,赵铁军,杨沐昀,于浩,李生.基于分解与动态规划策略的汉语未登录词识别[J].中文信息学报,2001,15(1):28-33. 被引量:43
  • 6Wu Youzheng, Zhao Jun, Xu Bo, et al. Chinese named entity recognition based on multiple feature [C]//Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing(HLT/EMNLP), Vancouver, 2005: 427-434.
  • 7衣马木艾山.阿布都力克木,吐尔地.托合提,艾斯卡尔.艾木都拉.基于规则的维吾尔人名汉文机器翻译算法研究[J].计算机应用与软件,2010,27(8):86-87. 被引量:9
  • 8张秀玲.汉维语人名文化异同之比较[J].新疆大学学报(哲学社会科学版),2009,37(6):136-139. 被引量:10

二级参考文献18

  • 1杜绍源.新疆维吾尔族人名初探[J].中央民族大学学报(哲学社会科学版),1983,10(3):68-73. 被引量:5
  • 2孙茂松,黄昌宁,高海燕,方捷.中文姓名的自动辨识[J].中文信息学报,1995,9(2):16-27. 被引量:87
  • 3艾山.吾买尔,吐尔根.伊布拉音.英文维文人名机器翻译算法的研究与实现[J].新疆大学学报(自然科学版),2007,24(1):97-101. 被引量:8
  • 4罗智勇,宋柔.现代汉语自动分词中专名的一体化、快速识别方法[C]//Ji Dong-Hong.国际中文电脑学术会议,新加坡,2001:323-328.
  • 5阿卜杜外力·佐尔冬,尼加提·马木提,麦麦提·阿希木,等.维吾尔人名汉文写法手册[M].新疆电子出版社,2000,10:1-172.
  • 6Ji Heng, Luo Zhen-Shen. Inverse name frequency model and rules based on Chinese name identifying. In: Huang ChangNing, Zhang Pu ed.. Natural Language Understanding and Machine Translation. Beijing: Tsinghua University Press,2001, 123 - 128( in Chinese)(季姮,罗振声.基于反比概率模型和规则的中文姓名自动辨识系统.见:黄昌宁,张普编.自然语言理解与机器翻译.北京:清华大学出版社,2001,123-128)
  • 7Zhen Jia-Heng, Liu Kai-Ying. Discussion on strategy of surname and personal name processing in Chinese word segmentation. In: Chen Li-Wei ed.. Research and Application of Computational Linguistics. Beijing: Beijing Institute of Linguistics and Culture Press, 1993(in Chinese)(郑家恒刘开瑛.自动分词系统中姓氏人名的处理策略探讨.见:陈力为编.计算语言研究与应用.北京:北京语言学院出版社,1993)
  • 8Song Rou, Zhu Hong et al.. Approach of personal name recognition based on corpus and rules. In: Chen Li Wei ed.. Research and Application of Computational Linguistics. Beijing:Beijing Institute of Linguistics and Culture Press, 1993(in Chinese)(宋柔,朱宏等.基于语料库和规则库的人名识别法.见:陈力为编.计算语言研究与应用.北京:北京语言学院出版社,1993)
  • 9Wang Sheng, Huang De-Gen, Yang Yuan-Sheng. Chinese person name recognition based on mixture of statistics and rules.In: Huang Chang-Ning, Dong Zhen-Dong ed.. Corpora of Computational Linguistics. Beijing: Tsinghua University Press, 1999 (in Chinese)(王省,黄德根,杨元生.基于统计和规则相结合的中文姓名识别.见:黄昌宁,董振东编.计算语言学文集.北京:清华大学出版社,1999)
  • 10Chen Xiao-He. Automatic Analysis of Modern Chinese. Beijing: Beijing University Linguistics and Culture Press, 2000,104-114(in Chinese)(陈小荷.现代汉语自动分析.北京:北京语言文化大学出版社, 2000, 104-114 )

共引文献142

同被引文献93

引证文献13

二级引证文献43

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部