蒙古文字母到音素转换方法的研究被引量：4

Research on grapheme to phoneme conversion for Mongolian

下载PDF

导出

摘要针对蒙古文字母到音素的转换(grapheme to phoneme conversion,G2P)问题,提出了基于规则的蒙古文G2P转换方法和基于联合序列模型的蒙古文G2P转换方法。实验结果表明,利用联合序列模型的蒙古文G2P转换方法要明显好于基于规则的蒙古文G2P转换方法。并且建立的基于联合序列模型的蒙古文G2P转换系统的词误识率为16.32%,音素误识率仅为3.37%,能达到实用要求。 This paper presented the rule-based Mongolian G2P conversion method and the statistic-based Mongolian G2P conversion method for Mongolian G2P conversion.Experimental results show that Mongolian G2P conversion method based on the joint-sequence model is significantly better than the rule-based Mongolian G2P conversion method.The word error rate is 16.32% and the phoneme error rate is 3.37% for the Mongolian G2P conversion system based on the joint-sequence model,and this system has reached the application requirements.

作者飞龙高光来闫学亮

机构地区内蒙古大学计算机学院

出处《计算机应用研究》 CSCD 北大核心 2013年第6期1696-1700,共5页 Application Research of Computers

基金国家自然科学基金资助项目(61263037 71163029) 内蒙古自然科学基金重大资助项目(2011ZD11)

关键词蒙古文字母到音素的转换联合序列模型联合多元联合分割 Mongolian grapheme-to-phoneme conversion（G2P） joint-sequence models joint multigram co-segmentation

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献12

1MENG H M, SENEFF S, ZUE V W. Phonological parsing for bi-directional letter-to-sound / sound-to-letter generation [ C ]//Proc of Workshop on Human Language Technology. 1994: 289-294.
2TORKKOLA K. An efficient way to learn English grapheme-to-phoneme rules automatically[ C ]//Proc of IEEE International Conference on Acoustics, Speech, and Signal Processing. 1993:199-202.
3BAGSHAW P C. Phonemic transcription by analogy in text-to-speech synthesis: novel word pronunciation and lexicon compression [ J ]. Computer Speech & Language, 1998,12 ( 2 ) : 119-142.
4MENG H. A hierarchical lexical representation for bi-directional spelling-to-pronunciation/pronunciation-to-spelling generation [ J ]. Speech Communication,2001,33(3) : 213-239.
5BISANI M, NEY H. Muhigram-based graphenae-to-phoneme conversion for LVCSR [ C ]//Proc of INTERSPEECH. 2003 : 933- 936.
6BELLEGARDA J R. Unsupervised, language-independent grapheme- to-phoneme conversion by latent analogy[ J]. Speech Gommunieation ,2005,46 (2) : 140-152.
7WANG Dong. Out-of-vocabulary spoken term detection [ D ]. Edinburgh : University of Edinburgh. 2010.
8TAYLOR P. Hidden Markov models for grapheme to phoneme conversion[ C ]//Proc of INTERSPEECH. 2005 : 1973-1976.
9BISANI M, NEY H. Joint sequence models for grapheme-to-phoneme conversion [ J]. Speech Communication ,2008,50 ( 5 ) :434-451.
10BAO Fei-long, GAO Guang-lai. Improving of acoustic model for the mongolian speech recognition system [ C ]//Proc of Chinese Conference on Pattern Recognition. 2009: 616-620.

同被引文献11

1Feilong Bao, Guanglai Gao. The Research on Mongo- lian Spoken Term Detection Based on Confusion Net- work[C]//Proceedings of the Chinese Conference on Pattern Recognition (CCPR2012). Beijing, 2012 ; 606- 612.
2Feilong Bao, Guanglai Gao. Improving of Acoustic Model for the Mongolian Speech Recognition System [C]//Proceedings of the Chinese Conference on Pat tern Recognition (CCPR2009). Nanjing, 2009: 616- 620.
3Feilong Bao, Guangiai Gao, Xueliang Yan. Segmenta- tion-based Mongolian LVCSR Approach[C]//Proeeed ings of the 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP2013), Van- couver, 2013.. 8136-8139.
4J Mamou, B Ramabhadran and O Siohan. Vocabulary independent spoken term detection[C]//Proceedings of the ACM-SIGIR'07. Amsterdam, 2007..615-622.
5Ville T. Turunen and Mikko Kurimo, Indexing Confu-sion Networks for MorPh-based Spoken Document Re- trieval [C]//Proceedings of the ACM-SIGIR'07. Am- sterdam, 2007 : 631-638.
6D Wang. Out-of-vocabulary spoken term detection IDa. Ph.[ D]. dissertation University of Edinburgh. 2010.
7G Gosztolya and L Toth. Spoken term detection based on the most probable phoneme sequence[C]//Proceed- ings of the 2011 International Symposium on Applied Machine Intelligence and Informatics ( SAMI ) (IEEE), Slovakia, 2011 : 101-106.
8L Mangu, E Brill, and A Stolcke: Finding consensus in speech recognition: word error minimization and other applications of confusion networks [J]. Comput- er Speech and Language, 2000, 14(4): 373-400.
9Young S, et al. The HTK book (Revised for HTK version 3.4.1)[M]. Cambridge University. 2009.
10A Stolcke. SRILM--An Extensible Language Model- ing Toolkit[C]//Proceedings of Intl. Conf. Spoken Lantguage Processing. Denver, Colorado,2002.

引证文献4

1飞龙,高光来,鲍玉来.基于音素混淆网络的蒙古语语音关键词检测方法的研究[J].中文信息学报,2015,29(1):178-182.
2萨仁高娃,牧仁高娃.蒙古语发音词典建设的长元音和复合元音特征提取[J].内蒙古社会科学（蒙文版）,2020,0(1):99-103.
3萨仁高娃.论蒙古语书面语与口语之间音节对应关系的相关问题[J].中国蒙古学（蒙文）,2020,48(5):22-28.
4吴则诚,飞龙,张晖,王海波.基于细粒度韵律建模和条件CycleGAN的非平行蒙古语语音转换方法[J].信号处理,2021,37(10):1825-1834. 被引量：1

二级引证文献1

1王翠英.基于深度学习的合成语音转换问题研究[J].自动化与仪器仪表,2023(7):196-200. 被引量：2

1高颂,李富栋.图像边缘提取的区域联合分割与主动轮廓模型[J].激光与红外,2013,43(1):94-97. 被引量：6
2郑全录.英语正文音素转换的规则表示及其推理[J].信息工程学院学报,1996,15(4):18-22.
3汪粼波,郭延文,夏天辰,金国平.样本驱动的半自动图像集前背景分割[J].计算机辅助设计与图形学学报,2013,25(6):794-801. 被引量：6
4登顶娱乐机巅峰华硕G2P测试[J].现代计算机（中旬刊）,2007(1):20-21.
5张俐,胡明函,李晶皎,何荣伟.满汉计算机辅助翻译系统的满文字符编码[J].东北大学学报（自然科学版）,2002,23(2):119-122. 被引量：6
6乔琪珑,王继业,杨舒.基于超像素和SVM的交互式联合分割算法研究[J].电视技术,2015,39(22):85-88.
7陆梨花,张连海.基于音素混淆模型的集外词查询项扩展方法[J].信息工程大学学报,2014,15(4):459-465. 被引量：1
8汤力,张兆扬.基于亮度和运动联合分割的位移场估计新算法[J].应用科学学报,2002,20(1):42-46. 被引量：1
9高攀,杨斌,刘建敏.基于Qt/Embedded的phoneME Feature移植与实现[J].计算机技术与发展,2011,21(1):31-34.
10王永卫,李介谷.基于肤色特征的最短生成树方法进行人脸分割[J].上海交通大学学报,1998,32(1):12-17.

计算机应用研究

2013年第6期

浏览历史

内容加载中请稍等...

蒙古文字母到音素转换方法的研究被引量：4

参考文献12

同被引文献11

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

蒙古文字母到音素转换方法的研究 被引量：4

参考文献12

同被引文献11

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

蒙古文字母到音素转换方法的研究被引量：4