摘要
为了改进传统的语音识别方法,使其达到低时延、高识别率的目的,以前馈序列记忆网络(feed-forward sequential memory network,FSMN)为研究主线,对语音识别的发展历程,FSMN的各种变体及其相应的应用状况进行综述,结果表明FSMN在语言建模任务上,使用wiki 9数据集能使Perplexity值降到90,并且训练周期比RNN快2倍.未来语音识别的发展趋向于更高识别性能和更多语种.
In order to improve the traditional speech recognition method to achieve the purpose of low latency and high recognition rate,a review was made of applications of(feed-forward sequential memory network,FSMN)in terms of the development of speech recognition,the various variants of FSMN with feedforward sequence memory network as the research main line.The results showed that FSMN can reduce the Perplexity to 90 by use of the wiki9 data set on solving language modeling tasks,and the training cycle is 2 times faster than by use of RNN.Future developing trend is directed to a higher recognition performance in speech recognition and multilingual speech recognition.
作者
付婧
罗建
龙彦霖
苗晨
程玉勤
FU Jing;LUO Jian;LONG Yanlin;MIAO Chen;CHENG Yuqin(School of Computer Science, China West Normal University, Nanchong,Sichuan 637009, China)
出处
《内江师范学院学报》
2020年第4期41-51,共11页
Journal of Neijiang Normal University
基金
国家自然科学基金项目(61871330)
四川省教育厅自然科学重点项目(18ZA0468,14ZA0123)
西华师范大学英才科研基金项目(17YC155,17YC157)。