Sichuan dialect speech recognition with deep LSTM network 被引量：4

导出

摘要 In speech recognition research,because of the variety of languages,corresponding speech recognition systems need to be constructed for different languages.Especially in a dialect speech recognition system,there are many special words and oral language features.In addition,dialect speech data is very scarce.Therefore,constructing a dialect speech recognition system is difficult.This paper constructs a speech recognition system for Sichuan dialect by combining a hidden Markov model(HMM)and a deep long short-term memory(LSTM)network.Using the HMM-LSTM architecture,we created a Sichuan dialect dataset and implemented a speech recognition system for this dataset.Compared with the deep neural network(DNN),the LSTM network can overcome the problem that the DNN only captures the context of a fixed number of information items.Moreover,to identify polyphone and special pronunciation vocabularies in Sichuan dialect accurately,we collect all the characters in the dataset and their common phoneme sequences to form a lexicon.Finally,this system yields a 11.34%character error rate on the Sichuan dialect evaluation dataset.As far as we know,it is the best performance for this corpus at present.

作者 Wangyang YING Lei ZHANG Hongli DENG

机构地区 Machine Intelligence Laboratory Education and Information Technology Center

出处《Frontiers of Computer Science》 SCIE EI CSCD 2020年第2期378-387,共10页 中国计算机科学前沿（英文版）

基金 the National Key R&D Program of China(2016YFC0801800) General Program of the National Natural Science Foundation of China(Grant No.61772353) the Key Program of the National Natural Science Foundation of China(Grant No.61332002) and Fok Ying Tung Education Foundation(151068).

关键词 SPEECH recognition SICHUAN DIALECT HMMDNN HMM-LSTM SICHUAN DIALECT LEXICON

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献27

1Wei ZHAO Ye SAN.RBF neural network based on q-Gaussian function in function approximation[J].Frontiers of Computer Science,2011,5(4):381-386. 被引量：2
2邓森,景博.基于测试性的电子系统综合诊断与故障预测方法综述[J].控制与决策,2013,28(5):641-649. 被引量：29
3范红军,殷合香.航空锌银蓄电池失效机理及延寿方法[J].电源技术,2013,37(10):1805-1808. 被引量：4
4黄鹤,卢海涛.综合化航空电子系统PHM应用与设计[J].电讯技术,2014,54(3):245-250. 被引量：13
5文佳.基于多信号模型的综合化航电系统综合诊断算法设计[J].电讯技术,2014,54(3):361-367. 被引量：8
6Qianjun ZHANG,Lei ZHANG.Convolutional adaptive denoising autoencoders for hierarchical feature extraction[J].Frontiers of Computer Science,2018,12(6):1140-1148. 被引量：5
7陈绍炜,潘新,刘涛.基于遗传算法SVM的电子元件寿命预测[J].西北工业大学学报,2014,32(4):637-641. 被引量：2
8Yongyi YAN,Zengqiang CHEN,Zhongxin LIU.Semi-tensor product of matrices approach to reachability of finite automata with application to language recognition[J].Frontiers of Computer Science,2014,8(6):948-957. 被引量：10
9Wenge RONG,Baolin PENG,Yuanxin OUYANG,Chao LI,Zhang XIONG.Structural information aware deep semi-supervised recurrent neural network for sentiment analysis[J].Frontiers of Computer Science,2015,9(2):171-184. 被引量：5
10卢海涛,王自力.综合航空电子系统故障诊断与健康管理技术发展[J].电光与控制,2015,22(8):60-65. 被引量：32

引证文献4

1宋宇,李治霖,程超.基于CNN-BILSTM的工业控制系统ARP攻击入侵检测方法[J].计算机应用研究,2020,37(S02):242-244. 被引量：14
2Anirban DUTTA,Gudmalwar ASHISHKUMAR,Ch V Rama RAO.Performance analysis of ASR system in hybrid DNN-HMM framework using a PWL euclidean activation function[J].Frontiers of Computer Science,2021,15(4):185-195.
3龙如银,张钦,吴梅芬.中国煤炭产业社会许可研究——基于新闻文本的实证分析[J].中国矿业大学学报（社会科学版）,2022,24(1):95-106. 被引量：1
4文佳,梁天辰,陈擎宙,钱东.基于多模型融合的航空电子产品故障预测方法[J].电讯技术,2023,63(8):1237-1242.

二级引证文献15

1杨忠君,郑志权,敖然,王国刚,宗学军,李鹏程.基于改进麻雀算法的工控入侵检测方法[J].信息技术与网络安全,2021,40(12):32-39. 被引量：2
2王铁胜.基于机器学习的传感云入侵检测方法[J].太原师范学院学报（自然科学版）,2022,21(1):57-61. 被引量：3
3夏英,韩星雨.融合统计方法和双向卷积LSTM的多维时序数据异常检测[J].计算机应用研究,2022,39(5):1362-1367. 被引量：10
4马泽煊,李进,路艳丽,陈晨.融合WaveNet和BiGRU的网络入侵检测方法[J].系统工程与电子技术,2022,44(8):2652-2660. 被引量：15
5曾欣,马力,戴子卿.基于动态MIC优化TCN的混凝土坝变形预测模型研究[J].水力发电,2022,48(10):58-63. 被引量：3
6彭玉兰,代琪怡,李佳芮,李宗雷.基于GNS3+Wireshark的网络协议分析实验教学改革[J].现代信息科技,2022,6(18):185-187. 被引量：2
7宗学军,郭鑫,何戡,连莲.面向工业控制网络的入侵检测方法研究[J].重庆理工大学学报（自然科学）,2023,37(7):208-216.
8谭旭红,王朕卿.基于Bayesian-Ridge模型的煤炭企业净资产收益率影响因素[J].黑龙江科技大学学报,2023,33(4):622-628.
9王明雄.局域网中ARP攻击防范研究[J].科技资讯,2023,21(24):28-31.
10陈家乐.基于大数据分析的网络实时异常入侵行为检测[J].信息与电脑,2024,36(2):198-200.

1双语全文!习近平2020年新年贺词:只争朝夕,不负韶华![J].国外测井技术,2020,41(1):4-6.
2ZHU Tao,CHENG Chunling.Joint CTC-Attention End-to-End Speech Recognition with a Triangle Recurrent Neural Net work Encoder[J].Journal of Shanghai Jiaotong university(Science),2020,25(1):70-75. 被引量：2
3Yiran Zhi.Analysis of Cognitive Linguistics Phenomena and Associated Learning Capability in Smartphone Class for the Elderly[J].Journal of Contemporary Educational Research,2019,3(6):43-46.
4m.forster,徐浩.Notes on the English Character[J].英语学习,2020(4):20-34. 被引量：1
5Bing Han,Lan Hong.Comparative Study of Mechanical and Manual Compression in the Resuscitation of Patients with Outof-hospital Cardiac Arrest[J].Journal of Clinical and Nursing Research,2020,4(2):9-12.
6Guijuan ZHANG,Yang LIU,Xiaoning JIN.A survey of autoencoder-based recommender systems[J].Frontiers of Computer Science,2020,14(2):430-450. 被引量：12
7王丽.Different Ways of Expressing Direction in the Verb Phrase in Chinese and English[J].校园英语,2020(7):251-251.
8Martin John Rees.On the Future:A Keynote Address[J].Engineering,2020,6(2):110-114. 被引量：1
9Randall E.Basham.Constructs of L.S.Vygotsky:Studies in Cognitive Development:Implications for Computer Gaming[J].Psychology Research,2020,10(1):35-41.
10姜丽萍,王立,王圆圆.美国《21世纪外语学习标准》发展研究[J].世界汉语教学,2020,34(2):275-286. 被引量：24

Frontiers of Computer Science

2020年第2期

浏览历史

内容加载中请稍等...

Sichuan dialect speech recognition with deep LSTM network 被引量：4

同被引文献27

引证文献4

二级引证文献15

相关作者

相关机构

相关主题

浏览历史