Robust Speech Recognition System Using Conventional and Hybrid Features of MFCC,LPCC,PLP,RASTA-PLP and Hidden Markov Model Classifier in Noisy Conditions 被引量：7

下载PDF

导出

摘要 In recent years, the accuracy of speech recognition (SR) has been one of the most active areas of research. Despite that SR systems are working reasonably well in quiet conditions, they still suffer severe performance degradation in noisy conditions or distorted channels. It is necessary to search for more robust feature extraction methods to gain better performance in adverse conditions. This paper investigates the performance of conventional and new hybrid speech feature extraction algorithms of Mel Frequency Cepstrum Coefficient (MFCC), Linear Prediction Coding Coefficient (LPCC), perceptual linear production (PLP), and RASTA-PLP in noisy conditions through using multivariate Hidden Markov Model (HMM) classifier. The behavior of the proposal system is evaluated using TIDIGIT human voice dataset corpora, recorded from 208 different adult speakers in both training and testing process. The theoretical basis for speech processing and classifier procedures were presented, and the recognition results were obtained based on word recognition rate. In recent years, the accuracy of speech recognition (SR) has been one of the most active areas of research. Despite that SR systems are working reasonably well in quiet conditions, they still suffer severe performance degradation in noisy conditions or distorted channels. It is necessary to search for more robust feature extraction methods to gain better performance in adverse conditions. This paper investigates the performance of conventional and new hybrid speech feature extraction algorithms of Mel Frequency Cepstrum Coefficient (MFCC), Linear Prediction Coding Coefficient (LPCC), perceptual linear production (PLP), and RASTA-PLP in noisy conditions through using multivariate Hidden Markov Model (HMM) classifier. The behavior of the proposal system is evaluated using TIDIGIT human voice dataset corpora, recorded from 208 different adult speakers in both training and testing process. The theoretical basis for speech processing and classifier procedures were presented, and the recognition results were obtained based on word recognition rate.

作者 Veton Z.Kepuska Hussien A.Elharati

机构地区 Electrical&Computer Engineering Department

出处《Journal of Computer and Communications》 2015年第6期1-9,共9页 电脑和通信（英文）

分类号 TN91 [电子电信—通信与信息系统]

引文网络
相关文献

同被引文献28

1翟时雨.成都、重庆话在四川方言分区中的地位[J].西南师范大学学报（哲学社会科学版）,1999,31(2):13-15. 被引量：8
2汪长学.重庆方言儿化音刍议[J].西南师范大学学报（哲学社会科学版）,1996,28(4):65-67. 被引量：7
3钟维克.重庆方言音系研究[J].重庆社会科学,2005(6):118-118. 被引量：18
4谢秋云,肖铁军.语音MFCC特征提取的FPGA实现[J].计算机工程与设计,2008,29(21):5474-5475. 被引量：7
5周跃海,童峰,洪青阳.采用DTW算法和语音增强的嵌入式声纹识别系统[J].厦门大学学报（自然科学版）,2012,51(2):174-178. 被引量：2
6王彪.基于LPCC参数的语音识别系统[J].电子设计工程,2012,20(7):18-20. 被引量：7
7牛砚波,张雪英,刘晓峰.支持向量机语音识别算法在DM6446上的实现[J].计算机工程与应用,2012,48(20):67-69. 被引量：1
8龙顺宇,郑泽龙,谭冬凤.基于STM32和SD卡文件系统的非特定人语音识别系统设计[J].现代电子技术,2013,36(21):62-66. 被引量：8
9崔金钟,周远彬,陈雷霆.基于DHMM的嵌入式语音识别系统的实现与优化[J].电子科技大学学报,2013,42(6):930-934. 被引量：7
10王海荣.基于SOPC嵌入式数字存储音频采集与回放系统设计[J].山东农业大学学报（自然科学版）,2014,45(2):223-228. 被引量：1

引证文献7

1成利江,景新幸,杨海燕.基于SOC FPGA的车载语音识别系统设计[J].桂林电子科技大学学报,2016,36(6):454-460. 被引量：1
2张策,韦鹏程,陆晓燕,石熙.重庆方言语音识别系统的设计与实现[J].计算机测量与控制,2018,26(1):256-259. 被引量：6
3陈盛,胡维平,张佑贤,覃以威.基于嵌入式的语音控制系统的设计与实现[J].电子设计工程,2018,26(19):57-61. 被引量：10
4张策,韦鹏程,石熙.小语料库重庆话语音识别的研究[J].计算机测量与控制,2018,26(11):252-255. 被引量：3
5郭佳敏,李鸿燕.一种改进LSTM训练的语音分离技术[J].电子设计工程,2021,29(11):140-145.
6杜海云,王宏霞.基于改进胶囊网络的音调篡改检测算法[J].通信技术,2022,55(8):984-989.
7Hussien A.Elharati,Mohamed Alshaari,Veton Z.Kepuska.Arabic Speech Recognition System Based on MFCC and HMMs[J].Journal of Computer and Communications,2020,8(3):28-34. 被引量：1

二级引证文献20

1薛辉.基于语音识别的智能家居控制系统的研究与设计[J].微型电脑应用,2020,36(2):149-151. 被引量：10
2胡永,张旭东,赵静,吴蔚华,徐永生.智能终端语音识别用户体验测评研究[J].电视技术,2019,43(1):60-65. 被引量：3
3赵晓纯,王芳.嵌入式智能家居电子控制系统开发策略[J].电子质量,2019(2):31-35. 被引量：2
4邱煌彬,郑超,阳加远,周畅.基于外场装备保障的移动终端虚拟助理[J].电子设计工程,2019,27(20):97-100. 被引量：2
5黄侃,张丙旭,徐文涛.基于TensorFlow和Android技术的人脸识别智慧班牌[J].电子设计工程,2019,27(20):133-137. 被引量：2
6杨波.基于RNN的桂柳方言语音识别系统研究[J].现代计算机,2019,0(31):6-9.
7冀常鹏,程琳,李锋.基于改进BP-Adaboost和HMM混合模型的方言情感识别[J].成都信息工程大学学报,2019,34(5):495-500. 被引量：1
8于镭,林再腾.基于香橙派的智能语音识别系统的设计[J].电子测量技术,2019,42(19):36-40. 被引量：6
9朱祥.基于隐马尔可夫模型和聚类的英语语音识别混合算法[J].计算机测量与控制,2020,28(5):175-179. 被引量：14
10邱煌彬,郑超,阳加远,郑鑫.面向信息化保障领域的虚拟助理应用研究[J].软件,2020,41(8):196-199.

1Danyang Liu,Ji Xu,Pengyuan Zhang,Yonghong Yan.Investigation of Knowledge Transfer Approaches to Improve the Acoustic Modeling of Vietnamese ASR System[J].IEEE/CAA Journal of Automatica Sinica,2019,6(5):1187-1195. 被引量：4
2陈志高,张旭龙,肖寒,肖川.基于U-Net和BGRU-RNN的实用歌声检测系统[J].微型电脑应用,2019,35(10):109-112. 被引量：1
3Alfredo Maesa,Fabio Garzia,Michele Scarpiniti,Roberto Cusani.Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficient and Gaussian Mixture Models[J].Journal of Information Security,2012,3(4):335-340. 被引量：1
4Jean-Yves Fourniols,Nadim Nasreddine,Christophe Escriba,Pascal Acco,Julien Roux,Georges Soto Romero.An Overview of Basics Speech Recognition and Autonomous Approach for Smart Home IOT Low Power Devices[J].Journal of Signal and Information Processing,2018,9(4):239-257.
5Tao Jiang,Jiqing Han.MAP-based Audio Coding Compensation for Speaker Recognition[J].Journal of Signal and Information Processing,2011,2(3):165-169.
6Marc Karam,Hasan F. Khazaal,Heshmat Aglan,Cliston Cole.Noise Removal in Speech Processing Using Spectral Subtraction[J].Journal of Signal and Information Processing,2014,5(2):32-41. 被引量：3
7V. Sellam,J. Jagadeesan.Classification of Normal and Pathological Voice Using SVM and RBFNN[J].Journal of Signal and Information Processing,2014,5(1):1-7. 被引量：3
8Abdollah Doosti Aref,Mohammad Javad Jannati,Vahid Tabataba Vakili.Design and Simulation of a Secure and Robust Underwater Acoustic Communication System in the Persian Gulf[J].Communications and Network,2011,3(2):99-112.

Journal of Computer and Communications

2015年第6期

浏览历史

内容加载中请稍等...

Robust Speech Recognition System Using Conventional and Hybrid Features of MFCC,LPCC,PLP,RASTA-PLP and Hidden Markov Model Classifier in Noisy Conditions 被引量：7

同被引文献28

引证文献7

二级引证文献20

相关作者

相关机构

相关主题

浏览历史