期刊文献+

面向电力行业的热词语音识别技术 被引量:3

Hot Word Speech Recognition Technology for Power Industry
下载PDF
导出
摘要 随着智能电网的高速发展,电网业务中对语音识别的需求也在不断增加。然而面向公共领域的语音识别技术很难识别出电网特有的专业信息词汇,使得电力行业语音的识别准确率不高。基于此,设计一种面向电力行业的热词语音识别技术,技术首先构建CTC声学模型将语音信息转化为基本音素信息,再利用电力行业热词库构建针对电力数据的Transformer语言模型,最后通过语言模型和发音字典将基本音素信息解码为中文信息,并通过基于南网信息系统语料库的实验验证该方法的有效性。 With the rapid development of the smart grid,the demand for voice recognition in the grid business is also increasing.However,the voice recognition technology for the public domain is difficult to recognize the professional information vocabulary unique to the power grid,which makes the voice recognition accuracy of the power industry not high.Based on this,this paper designs a hot word speech recognition technology for the power industry.The technology first builds a CTC acoustic model to convert voice information into basic phoneme infor⁃mation,then uses the power industry hot lexicon to build a Transformer language model for power data,and finally decodes the basic pho⁃neme information into Chinese information through the language model and pronunciation dictionary.The effectiveness of this method is verified by experiments based on the corpus of the State Grid Information System.
作者 张云翔 李智诚 ZHANG Yun-xiang;LI Zhi-cheng(Shenzhen Power Supply Bureau Co.,Shenzhen 518001)
出处 《现代计算机》 2020年第22期14-17,共4页 Modern Computer
关键词 自然语言处理 语音识别 CTC TRANSFORMER Natural Language Processing Speech Recognition CTC Transformer
  • 相关文献

参考文献8

二级参考文献45

  • 1何新,王晓兰,周献中.汉语语音识别中的一种音节分割方法[J].火力与指挥控制,2004,29(6):94-96. 被引量:5
  • 2刘晓明,覃胜,刘宗行,江泽佳.语音端点检测的仿真研究[J].系统仿真学报,2005,17(8):1974-1976. 被引量:21
  • 3李金宝,屈百达,徐宝国,周小祥.基于自适应子带功率谱熵的语音端点检测算法[J].计算机工程与应用,2007,43(12):57-58. 被引量:5
  • 4赵恒,李冬梅,张玉宏.MATLAB环境下的基于GMM模型的说话人识别系统[J].微计算机信息,2007,23(31):261-263. 被引量:6
  • 5Geoffrey E. Hinton,Simon Osindero,Yee-Whye Teh.A Fast Learning Algorithm for Deep Belief Nets[J]. Neural Computation . 2006 (7)
  • 6Nair V,Hinton G E.Rectified linear units improve Restricted Boltzmann machines. Proceedings of the 27th International Conferenceon Machine Learning . 2010
  • 7Vincent P,Larochelle H,Bengio Y, et al.Extracting and composing robust features with denoising autoencoders. The 25th International Conference on Machine learning (ICML 2008) . 2008
  • 8Srivastava N,Hinton G,Krizhevsky A, et al.Dropout:A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research . 2014
  • 9Xiaodong Cui,Mohamed Afify,Yuqing Gao,Bowen Zhou.??Stereo hidden Markov modeling for noise robust speech recognition(J)Computer Speech & Language . 2013 (2)
  • 10Bengio Y,Yao L,Alain G,et al.Generalized denoising autoencoders as generative models. Advances in Neural Information Processing Systems . 2013

共引文献107

同被引文献20

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部