期刊文献+

Sichuan dialect speech recognition with deep LSTM network 被引量:4

原文传递
导出
摘要 In speech recognition research,because of the variety of languages,corresponding speech recognition systems need to be constructed for different languages.Especially in a dialect speech recognition system,there are many special words and oral language features.In addition,dialect speech data is very scarce.Therefore,constructing a dialect speech recognition system is difficult.This paper constructs a speech recognition system for Sichuan dialect by combining a hidden Markov model(HMM)and a deep long short-term memory(LSTM)network.Using the HMM-LSTM architecture,we created a Sichuan dialect dataset and implemented a speech recognition system for this dataset.Compared with the deep neural network(DNN),the LSTM network can overcome the problem that the DNN only captures the context of a fixed number of information items.Moreover,to identify polyphone and special pronunciation vocabularies in Sichuan dialect accurately,we collect all the characters in the dataset and their common phoneme sequences to form a lexicon.Finally,this system yields a 11.34%character error rate on the Sichuan dialect evaluation dataset.As far as we know,it is the best performance for this corpus at present.
出处 《Frontiers of Computer Science》 SCIE EI CSCD 2020年第2期378-387,共10页 中国计算机科学前沿(英文版)
基金 the National Key R&D Program of China(2016YFC0801800) General Program of the National Natural Science Foundation of China(Grant No.61772353) the Key Program of the National Natural Science Foundation of China(Grant No.61332002) and Fok Ying Tung Education Foundation(151068).
  • 相关文献

同被引文献27

引证文献4

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部