期刊文献+

基于规则和统计相结合的西里尔蒙古文到传统蒙古文转换方法 被引量:3

Combining of Rules and Statistics for Cyrillic Mongolian to Traditional Mongolian Conversion
下载PDF
导出
摘要 西里尔蒙古文与传统蒙古文分别是蒙古国与中国使用的蒙古文,西里尔蒙古文到传统蒙古文的转换工作不仅给两国同胞的交流带来更多的便利,而且对蒙古族的科学、文化和教育发展具有重要意义。本文结合规则与统计模型的优点,研究了西里尔蒙古文到传统蒙古文的转换方法。本文首先采用基于规则的方法对西里尔蒙古文集内词进行转换,其次对集外词的转换采用了基于联合序列模型的方法,并采用N-gram语言模型解决了一个西里尔蒙古文单词对应多个传统蒙古文单词的问题。实验结果表明,该系统单词转换错误率低至4.12%,基本达到了实用要求。 Cyrillic Mongolian and Traditional Mongolian are used in Mongolia and China, respectively. Cyrillic Mongolian to Traditional Mongolian conversion not only will bring more convenience to exchanges between the two countries, but also has great significance for scientific, cultural and educational development of Mongolian. This paper proposes a highly efficient Cyrillic Mongolian to Traditional Mongolian conversion method. It adopts the rule based approach to convert the words in the vocabulary, and the statistical model to convert the out of-vocabulary words. A large part of Cyrillic Mongolian words correspond more than one candidates in Traditional Mongolian, which is solved by the N-gram language model. Experimental results show that the word error rate is as low as 4. 12%, meeting the practical requirement.
出处 《中文信息学报》 CSCD 北大核心 2017年第3期156-162,共7页 Journal of Chinese Information Processing
基金 国家自然科学基金(61563040) 内蒙古自然科学基金(2016D06) 内蒙古大学高层次人才引进科研项目资助
关键词 西里尔蒙古文 传统蒙古文 转换 规则 联合序列模型 Cyrillic Mongolian Traditional Mongolian conversion rules joint sequence model
  • 相关文献

参考文献2

二级参考文献14

  • 1清格尔泰.蒙古语语法[M].呼和浩特:内蒙古人民出版社,1992.
  • 2中玄致え.モンゴlレ語電子化計画.[2009-01-21].http://texa.human.is.tohoku.ac.jp/-chigenlmd_cnt.J.htm#contents.
  • 3DulaMan.传统蒙古文在线文本数据库的构造法与在文本检索系统中的应用.[2011-01-30].http://www.docin.com/p-44530763.html.
  • 4Li Hao,Sarina B.The study of comparison and conversion about traditional Mongolian and Cyrillic Mongolian[C]//2011 4th International Conference on Intelligent Networks and Intelligent Systems,2011:199-202.
  • 5Zhao Lili,Men Jia,Zhang Congpin,et al.A combination of statistical and rule-based approach for Mongolian lexical analysis[C]//2010 International Conference on Asian Language Processing,Harbin,2010:7-10.
  • 6Bisani M,Ney H.Joint sequence models for grapheme-tophoneme conversion[J].Speech Communication,2008,50(5):434-451.
  • 7Wang D.Out-of-vocabulary spoken term detection[D].[S.l.]:University of Edinburgh,2010:85-110.
  • 8嘎拉桑朋斯格.基立尔蒙古文学习读本[M].呼和浩特:内蒙古教育出版社,2006.
  • 9图门吉日嘎拉.现代蒙古语[M].呼和浩特:内蒙古大学出版社,2009.
  • 10舍·却玛.蒙古文、基里尔文正字法比较研究[M].呼和浩特:内蒙古教育出版社,2010.

共引文献5

同被引文献16

引证文献3

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部