期刊文献+

面向大词汇量的维吾尔语连续语音识别研究 被引量:7

Research on large vocabulary continuous speech recognition for Uyghur
下载PDF
导出
摘要 近年来大词汇量连续语音识别技术得到了迅速的发展,国内外研究机构加大了对汉语和英语语音识别技术的研究,然而,维吾尔语语音识别技术的研究工作最近才起步。建立了面向大词汇量的维吾尔语语音语料库,研究了维吾尔语声学模型和语言模型建模技术、解码技术,进行了面向大词汇量的维吾尔语连续语音识别实验。对维吾尔语大词汇量连续语音识别技术进一步发展中存在的问题进行了讨论。 The technology of Large Vocabulary Continuous Speech Recognition(LVCSR) has developed quickly, and many scientific institutions have reinforced the speech recognition research on the Mandarin Chinese and English. However, the study of Uyghur speech recognition technology has started recently. This paper introduces the research on main aspect of Uyghur LVCSR system, such as construction of Uyghur speech corpus, acoustic and language modeling techniques, decoding techniques, and performed experiments for Uyghur LVCSR. At the end, the issues affecting Uyghur LVCSR system are discussed in detail.
出处 《计算机工程与应用》 CSCD 2013年第9期115-119,共5页 Computer Engineering and Applications
基金 国家自然科学基金(No.61063024) 新疆大学联合科研项目(No.XY110122)
关键词 维吾尔语 语音语料库 大词汇 识别技术 Uyghur language speech corpus large vocabulary recognition technology
  • 相关文献

参考文献8

二级参考文献17

  • 1徐波,史晓东,刘群,宗成庆,庞薇,陈振标,杨振东,魏玮,杜金华,陈毅东,刘洋,熊德意,侯宏旭,何中军.2005统计机器翻译研讨班研究报告[J].中文信息学报,2006,20(5):1-9. 被引量:10
  • 2石现峰,张学智,张峰.基于HTK的语音识别系统设计[J].计算机技术与发展,2006,16(10):37-38. 被引量:23
  • 3郑方 吴文虎 等.CDCPM及其在语音识别中的应用[J].软件学报,1996,7(10):69-75.
  • 4方晓华.现代维语教程(上册,语音篇)[M].乌鲁木齐,新疆师范大学,1987..
  • 5王昆仑 樊志锦 吐尔洪江 等.维吾尔语综合语音数据库系统[A]..哈尔滨工业大学第五届全国人机语音通讯学术会议论文集.NCMMSC-96[C].,1996.366.C.
  • 6BROWN P, COCKE J, PIETRA S, et al. A statistical approach to machine translation[J]. Computational Linguistics, 1990, 16(2):79 -85.
  • 7KOEHN P, OCH F J, MARCU D. Statistical phrase-based translation[ C] // Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language. Morristown, N J: Association for Computational Linguistics, 2003:48 -54.
  • 8OCH F J, NEY H. Discriminative training and maximum entropy models for statistical machine translation[ C]// Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. Morristown, NJ: Association for Computational Linguistics, 2001: 295 - 302.
  • 9STOLKE A. Srilm - An extensible language modeling toolkit [ EB / OL]. [ 2008 - 09 - 20]. http://web, iti. upv. es/-evidal/ students/doct/sht/transp/srlim2p, pdf.
  • 10OCH F J, NEY H, A systematic comparison of various statistical alignment models[ J]. Computational Linguistics, 2003, 29(!) : 19 - 51.

共引文献25

同被引文献84

  • 1热依曼.吐尔逊,吾守尔.斯拉木,努尔麦麦提.多文种手机混合输入/输出技术及实现[J].计算机工程与科学,2006,28(4):103-104. 被引量:5
  • 2郑方.连续无限制语音流中关键词识别方法研究[D],1997.
  • 3A Hauptmann,H Wactlar.Indexing andSearch of Multimodal Information[A].Proceedings of IEEE International Conference of Acoustics Speech and Signal Processing,Munich,Germany,1997[C]:195-198.
  • 4G J.E Jones,J.T.Foote,K Sparck Jones et al.Video mail retrieval:the Effect of Word Spotting Accuracy on Precision[A].International Conference on Acoustics,Speech,and Signal Processing 1995[C].ICASSP' 95,1995,1(1):309-312P.
  • 5GOOG-411[DB/OL],http://en.wikipedia.org/wiki/ GOOG-411,2008,12.
  • 6Hsin-min Wang.Mandarin Spoken Document Retrieval Based on Syllable Lattice Matching[J].Pattem Recog nition Letters.2000:615-624P.
  • 7L.Mangu,E.Brill,A.Stolcke.Finding Consensus in Speech Recognition:Word Error Minimization and Other Applications of Confusion Networks[J].Computer Speech And Language,2000,14:373-400.
  • 8Ville T.Turunen,Mikko Kurimo.Indexing Confusion Network for Morph-based Spoken document Retrieval[A],Proceedings of the SIGIR[C]//2007:631-638.
  • 9F K Soong,W K Lo,S Nakamura.Generalized Word Posterior Probablity(GWPP) for Measuring Reliability of Recognized Words[A].Proceeding of the SWIM2004,2004:127-128.
  • 10F Wessel,R Schluter,K Macherey et al.Confidence Maesures for Large Vocabulary Continuous Speech Recognition[A].IEEE Transactions on Speech and Audio Processing,2001,9(3):288-298.

引证文献7

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部