摘要
针对蒙古文字母到音素的转换(grapheme to phoneme conversion,G2P)问题,提出了基于规则的蒙古文G2P转换方法和基于联合序列模型的蒙古文G2P转换方法。实验结果表明,利用联合序列模型的蒙古文G2P转换方法要明显好于基于规则的蒙古文G2P转换方法。并且建立的基于联合序列模型的蒙古文G2P转换系统的词误识率为16.32%,音素误识率仅为3.37%,能达到实用要求。
This paper presented the rule-based Mongolian G2P conversion method and the statistic-based Mongolian G2P conversion method for Mongolian G2P conversion.Experimental results show that Mongolian G2P conversion method based on the joint-sequence model is significantly better than the rule-based Mongolian G2P conversion method.The word error rate is 16.32% and the phoneme error rate is 3.37% for the Mongolian G2P conversion system based on the joint-sequence model,and this system has reached the application requirements.
出处
《计算机应用研究》
CSCD
北大核心
2013年第6期1696-1700,共5页
Application Research of Computers
基金
国家自然科学基金资助项目(61263037
71163029)
内蒙古自然科学基金重大资助项目(2011ZD11)
关键词
蒙古文
字母到音素的转换
联合序列模型
联合多元
联合分割
Mongolian
grapheme-to-phoneme conversion(G2P)
joint-sequence models
joint multigram
co-segmentation