期刊文献+

Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode

下载PDF
导出
摘要 We proposed a method using latent regression Bayesian network (LRBN) toextract the shared speech feature for the input of end-to-end speech recognition model.The structure of LRBN is compact and its parameter learning is fast. Compared withConvolutional Neural Network, it has a simpler and understood structure and lessparameters to learn. Experimental results show that the advantage of hybridLRBN/Bidirectional Long Short-Term Memory-Connectionist Temporal Classificationarchitecture for Tibetan multi-dialect speech recognition, and demonstrate the LRBN ishelpful to differentiate among multiple language speech sets.
出处 《Journal on Internet of Things》 2019年第1期17-23,共7页
  • 相关文献

参考文献3

二级参考文献19

  • 1李永宏,孔江平,于洪志.藏语文-音自动规则转换及其实现[J].清华大学学报(自然科学版),2008,48(S1):621-626. 被引量:19
  • 2共确降措.论藏文[J].西藏研究,1997(3):94-108. 被引量:7
  • 3郑方,吴文虎,方棣棠.连续无限制语音流中关键词识别的研究现状[C].第四届全国人机语音通讯学术会议论文集,1996.
  • 4Steve Y.The HTK Book(for HTK Version 3.4)[D].Cambridge,UK:Engineering Department of Cambridge University,2009.
  • 5Rabiner L,Juang Biing-Hwang.Fundamentals of Speech Recognition[M].阮平望,译.北京:清华大学出版社,1993.
  • 6Dahl G E, Yu D, Deng L, et al. Context-Dependent Pre-trained Deep Neural Networks for Large Vocabulary Speech Recognition.IEEE Trans on Audio, Speech, and Language Processing, 2012, 20 ( 1 ) : 30-42.
  • 7Hinton G E, Osindero S, Teh Y W. A Fast Learning Algorithm for Deep Belief Nets. Neural Computation, 2006, 18(7) : 1527-1554.
  • 8Beulen K, Ney H. Automatic Question Generation for Decision Tree Based State Tying//Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Seattle, USA, 1998, II: 805 -805.
  • 9Singh R, Raj B, Stern R M. Automatic Clustering and Generation of Contextual Questions for Tied States in Hidden Markov Models // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Phoenix, USA, 1999, I: 117-120.
  • 10Huang J T, Li J Y, Yu D, et al. Cross-Language Knowledge Trans- fer Using Muhilingual Deep Neural Network with Shared Hidden Layers//Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Vancouver, Canada, 2013 : 7304- 7308.

共引文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部