变异特征加权的异常语音说话人识别算法被引量：5

Speaker Recognition Algorithm for Abnormal Speech Based on Abnormal Feature Weighting

下载PDF

导出

摘要常用的加权算法难以跟踪非常态语音特征的变异,为此,文中提出了一种变异特征加权的异常语音说话人识别算法.首先统计大量正常语音各阶MFCC特征的概率分布,建立正常语音特征模板;然后用测试语音特征与正常语音特征模板之间的K-L距离和欧氏距离来度量语音的变异程度,确定K-L加权因子和欧氏加权因子;最后利用加权因子对测试语音的MFCC特征进行加权,并将加权后的特征输入高斯混合模型进行异常语音说话人识别.实验结果表明,文中提出的K-L加权和欧氏加权的异常语音说话人识别算法的整体识别率分别为46.61%和42.25%,而基于各阶特征对说话人识别贡献的加权算法和不加权算法的整体识别率分别为39.68%和36.36%. As the commonly-used weighting algorithm is inefficient in tracking the abnormal feature of abnormal speech,a speaker recognition algorithm for abnormal speech is proposed based on the abnormal feature weighting.In this algorithm,first,a feature template of normal speech is established by computing the probability distribution of MFCC features of each order in a large number of normal speech samples.Then,the K-L distance and the Euclidean distance are used to measure the differences between a given test speech and the normal speech templates and to further determine the K-L and the Euclidean weighting factors.Finally,the two weighting factors are used to weight the MFCC features of the test speech,and the weighted MFCC features are input in the Gaussian mixture model for the speaker recognition with abnormal speech.Experimental results show that the global recognition rates of the speaker recognition algorithms based on the K-L weighting and the Euclidean weighting are respectively 46.61% and 42.25%,while those of the algorithms with and without the weighting of speaker recognition contribution of each order feature are respectively only 39.68% and 36.36%.

作者何俊李艳雄贺前华李威

机构地区华南理工大学电子与信息学院

出处《华南理工大学学报（自然科学版）》 EI CAS CSCD 北大核心 2012年第3期106-111,共6页 Journal of South China University of Technology(Natural Science Edition)

基金国家自然科学基金资助项目(60972132 61101160) 广东省自然科学基金团队项目(9351064101000003) 广东省自然科学基金博士科研启动项目(10451064101004651) 华南理工大学中央高校基本科研业务费专项资金资助项目(2011ZM0029)

关键词异常语音说话人识别变异特征加权 K-L距离加权因子 abnormal speech speaker recognition abnormal feature weighting K-L distance weighting factor

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献12

1Rashid R A,Mahalin N H,Sarijari M A,et al.Securitysystem using biometric technology design and implementa-tion of voice recognition system[C]∥Proceedings of In-ternational Conference on Computer and CommunicationEngineering.Kuala Lumpur:IEEE,2008:898-902.
2杨继臣,贺前华,潘伟锵,徐益君,李艳雄.一种改进的BIC说话人改变检测算法[J].华南理工大学学报（自然科学版）,2009,37(9):47-51. 被引量：5
3张磊,韩纪庆,王承发.变异语音处理的研究进展[J].电子学报,2003,31(3):411-418. 被引量：3
4Alpan A,Maryn Y,Kacha A,et al.Multi-band dysperio-dicity analyses of disordered connected speech[J].SpeechCommunication,2011,53(1):131-141.
5Maciel C D,Pereira J C,Stewart D.Identifying healthyand pathologically affected voice signals[J].IEEE SignalProcessing Magazine,2010,27(1):120-123.
6Togneri R,Pullella D.An overview of speaker identifica-tion:accuracy and robustness issues[J].Circuits andSystems Magazine,2011,11(2):23-61.
7Garner Philip N.Cepstral normalisation and the signal tonoise ratio spectrum in automatic speech recognition[J].Speech Communication,2011,53(8):991-1001.
8Yang Hong-wu,Liu Ya-li,Huang De-zhi.Speaker recogni-tion based on beighted Mel-cepstrum[C]∥Proceedingsof the Fourth International Conference on Computer Sci-ences and Convergence Information Technology.Seoul:IEEE,2009:200-203.
9Weng Zufeng,Li Lin,Guo Donghui.Speaker recognitionusing weighted dynamic MFCC based on GMM[C]∥Proceedings of International Conference on Anti-Counter-feiting Security and Identification in Communication.Chendu:IEEE,2010:285-288.
10Kullback S,Leibler R.On information and sufficiency[J].Annals of Mathematical Statistics,1951,30(3):79-86.

二级参考文献22

1张家騄.超音段特征间的相互作用[J].声学学报,1993,18(4):263-271. 被引量：3
2韩纪庆,张磊,王承发.心理紧张情况下的Robust语音识别方法[J].计算机科学,2000,27(9):44-46. 被引量：1
3吕成国张磊韩纪庆等.G-Stress和Lombard效应作用下的变异语音语谱图[J].高技术通讯增刊,2000,:223-226.
4Kaiser J F.On a simple algorithm to calculate the ‘energy'' of a signal [A]..I CASSP''90 [C].USA:IEEE Press,1990.381-384.
5潘胜昔刘加江金涛等.基于多模式及集成判决的稳健电话语音识别算法研究[A].王承发张凯.第五届全国人机语音通讯学术会议论文集[C].,1998.154-159.
6马永林.[D].哈尔滨:哈尔滨工业大学工学,20 01.
7马永林韩纪庆张磊等.应力影响下的变异语音分类[A]..863计划智能计算机主题学术会议论文集[C].,2001.374-378.
8Margarita Kotti,Luis Gustaro. Automatic speaker segmentation using muhiple feature and distance measure:a comparison of three approaches [ C ]//Proceedings of IEEE International Conference on Multimedia and Expo. Toronto : IEEE ,2006 : 1 101-1 104.
9Amit S Malegaonkar, Aladdin M Ariyaeeinia, Perasiriyan Sivakumaran. Efficient speaker change detection using adapted Gaussian mixture models [ J ]. IEEE Transactions on Audio, Speech and Language Processing, 2007, 15 (6) :1 859-1 869.
10Soonil kwon, Shrikanth Narayanan. Unsupervised speaker indexing using generic models[J].IEEE Transactions on Speech and Audio Processing ,2005,13 ( 5 ) : 1004-1013.

共引文献6

1杨继臣,吴裕玲,苏杰华.基于核密度估计的说话人改变检测[J].仲恺农业工程学院学报,2012,25(3):40-41.
2何俊,贺前华,张清华,孙国玺,肖明,左敬龙.基于共同向量的非常态语音说话人识别算法[J].计算机工程与科学,2014,36(8):1599-1603.
3吴伟,李艳雄,王梓里,陈祝允.基于语速差异的新闻发布会中首要说话人检测[J].计算机工程与应用,2015,51(4):222-225.
4李威,贺前华,李艳雄.一种多说话人角色聚类方法[J].华南理工大学学报（自然科学版）,2015,43(1):21-27. 被引量：2
5张阳,刘景天,姜囡.气体变声语音的声学特征变异分析研究[J].光电技术应用,2019,34(2):40-45.
6王方丽,傅嘉俊.基于Python的BIC语音分割算法的实现与应用[J].计算机与数字工程,2020,48(4):763-766. 被引量：2

同被引文献66

1王立媛,刘玉萍,肖青,祁金刚.胎儿心率信号的替代数据分析[J].长春理工大学学报（自然科学版）,2007,30(1):72-75. 被引量：2
2Dibazar A A, Park H O, Berger T W. Nonlinear dynamic modeling of impaired voice [ C]//Proceedings of 2010Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Buenos Aires:IEEE, 2010:2770-2773.
3Tavares R, Brunet N, Costa S C, et al. Combining entropy measurements and cepstral analysis for pathological voice assessment [ C ]//Proceedings of 2011 ISSNIP Biosignals and Biorobotics Conference. Vitoria : IEEE, 2011 : 1 -5.
4Thomas M, Gudnason J, Naylor P. Estimation of glottal closing and opening instants in voiced speech using the YAGA algorithm [J]. IEEE Transactions on Audio, Speech, and Language Processing,2012,20 ( 1 ) : 82- 91.
5Arias-Londono J D, Godino-Llorente J I, Saenz-Lechon N, et al. Automatic detection of pathological voices using complexity measures, noise parameters, and Mel-Cepstral coefficients [ J ]. IEEE Transactions on Biomedical Engi- neering,2011,58 (2) :370-379.
6Maciel C D, Pereira J C, Stewart D. Identifying healthy and pathologically affected voice signals [ J ]. IEEE Signal Processing Magazine, 2010,27 ( 1 ) : 120-123.
7Brockmann Meike, Drinnan Michael J, Storck Claudio, et al. Reliable Jitter and Shimmer measurements in voice clinics : the relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task [J]. Journal of Voice,201 1,25( 1 ) :44-53.
8Kasuya H,Endo Y, Saliu S. Novel acoustic measurements of Jitter and Shimmer characteristics from pathologic voice [ C] //Proceedings of the Third European Conference on Speech Communication and Technology. Berlin:Anne Bon- neau, 1993 : 1973-1976.
9Kasuya H, Ogawa S, Mashima K, et al. Normalized noise energy as an acoustic measure to evaluate pathologic voice [J]. Journal of the Acoustical Society of America, 1986, 80(5) :1329-1334.
10FrohlichM, Michaelis D, Werner Strube H. Acoustic" brea- thiness measures" in the description of pathologic voices [ C ]// Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. Seattle : IEEE, 1998 : 937- 940.

引证文献5

1贺前华,何俊,李艳雄,王志峰.基于相关维数的病变连续语音检测算法[J].华南理工大学学报（自然科学版）,2012,40(6):1-5. 被引量：1
2朱华虹,贺前华,李艳雄,张雪源.基于随机映射的声纹模板保护方法[J].华南理工大学学报（自然科学版）,2013,41(5):48-54. 被引量：1
3何俊,贺前华,张清华,孙国玺,肖明,左敬龙.基于共同向量的非常态语音说话人识别算法[J].计算机工程与科学,2014,36(8):1599-1603.
4李江,赵雅琼,包晔华.基于混沌和替代数据法的中风病人声音分析[J].浙江大学学报（工学版）,2015,49(1):36-41. 被引量：3
5张小恒,谢文宾,李勇明.多类型语音特征进化选择算法[J].计算机工程与应用,2016,52(14):150-155.

二级引证文献5

1姚建新.不同黄芪剂量的补阳还五汤治疗缺血性中风的临床观察[J].陕西中医,2015,36(9):1110-1112. 被引量：14
2陈慧,芮贤义.基于VC++的汽车语音驾驶助手的设计与实现[J].电声技术,2016,40(8):36-39. 被引量：1
3翟晓雪,张皓.非线性动力学分析方法在神经康复领域中的应用进展[J].中国康复医学杂志,2019,34(4):483-486. 被引量：8
4丁勇,李佳慧,唐士杰,王会勇.基于随机映射技术的声纹识别模板保护[J].计算机研究与发展,2020,57(10):2201-2208. 被引量：4
5何立,庞善民.结合年龄监督和人脸先验的语音-人脸图像重建[J].浙江大学学报（工学版）,2022,56(5):1006-1016.

1张艳.异常语音话务浅析与治理[J].河南科技,2015,34(23).
2徐晓瑶,刘娟,杨东.多径信道下MPSK信号调制识别算法的研究[J].电子技术应用,2010,36(2):103-105. 被引量：4
3贺前华,何俊,李艳雄,王志峰.基于相关维数的病变连续语音检测算法[J].华南理工大学学报（自然科学版）,2012,40(6):1-5. 被引量：1
4张雅楠,刘震.基于EMD算法与神经网络的声纹检测识别系统[J].科技创新与生产力,2014(5):91-92. 被引量：1
5马建忠,李海涛,胡地荣.无下采样轮廓波广义高斯纹理检索系统[J].信阳师范学院学报（自然科学版）,2010,23(4):606-609. 被引量：1
6马建华,刘宏伟,保铮.基于小波变换的雷达高分辨距离像识别[J].西安电子科技大学学报,2005,32(6):895-900. 被引量：3
7刘家学,高倩,吴仁彪.基于特征模板的高距离分辨率雷达像自动目标识别[J].信号处理,2003,19(1):28-32. 被引量：3
8孙慧.上行带宽动态分配:ImmenStar又出新招[J].通信世界,2006(27B):25-25.
9韩韬,陶智,顾济华,赵鹤鸣,李玲.基于BP神经网络的耳语音转换为正常语音的研究[J].通信技术,2009,42(2):152-155. 被引量：3
10陈红艳,马上,王海江.新的噪声污染灰度图像边缘检测方法[J].计算机工程与应用,2010,46(11):183-185. 被引量：1

华南理工大学学报（自然科学版）

2012年第3期

浏览历史

内容加载中请稍等...

变异特征加权的异常语音说话人识别算法被引量：5

参考文献12

二级参考文献22

共引文献6

同被引文献66

引证文献5

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

变异特征加权的异常语音说话人识别算法 被引量：5

参考文献12

二级参考文献22

共引文献6

同被引文献66

引证文献5

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

变异特征加权的异常语音说话人识别算法被引量：5