自适应抗噪的清/浊/静音判决算法

Adaptive anti-noise unvoiced / voiced / silence detection algorithm

下载PDF

导出

摘要清/浊/静音判决(UVS)是语音压缩、合成以及识别中的一个重要参数。为了解决传统判决方法训练过程复杂,导致语音编码效率低的问题,给出一种无训练过程的判决方法。提取基于循环平均幅度差的特征参量,利用判决参数间的相关性,自适应调整阈值,实现清/浊/静音判决。该判决方法具有很好的抗噪声干扰能力,有效提高判决的准确率。测试结果表明:该算法简化了清/浊/静音判决的计算量,清音误判率降低了10%,浊音误判率保持在4%以内;将该算法应用于低速率语音编码方案MELP(mixed excitation linear prediction)0.6 kbps的清浊音判决中,解码后的合成语音质量优于原始MELP编码方案,PESQ分数提高0.3,具有较好的可懂度和自然度。 The Unvoiced/ Voiced/ Silence detection UVS provides a preliminary acoustic segment which is a key parameter in speech compression synthesis and recognition.The complication of traditional UVS methods？？ training procedure causes low efficiency of speech vocoder.To solve this problem a UVS detection without training proceeding is proposed in this paper.After new characteristic parameters of unvoiced and voiced signal are extracted adaptable threshold is proposed based on the correlation of those parameters.With its perfect an？ti？noise ability the correct rate of this detection improves sharply.The simulation result shows that this algorithm not only simplifies the unvoiced/ voiced/ silence detection but also efficiently decreases 10% of unvoiced and maintains lower than 4% of voiced discrimination error.The improved 0.6 kbps MELP vocoder applying this detection algorithm gets a 0.3 higher PESQ score and better synthetic speech performance compared with original vocoder which produces good natural and intelligible speech.

作者李荣芸赵晓群徐静云

机构地区同济大学电子与信息工程学院

出处《燕山大学学报》 CAS 北大核心 2015年第2期133-138,共6页 Journal of Yanshan University

基金国家自然科学基金资助项目(61271248)

关键词模式识别清/浊/静音判决自适应阈值低信噪比低速率语音编码 pattern recognition unvoiced/voiced/silence detection adaptive threshold low SNR low bit-rate speech coding

分类号 TN912 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献21

1HUANG CHIEN-UN, MATSUDA S, HORI C. Feature nonnalization using MVAW processing for spoken language recognition [C]//2013 Asia-Pacific Signal and Infonnation Processing Association Annual Summit and Conference, Kaohsiung, Taiwan, 2013: 1-4.
2WU BING-FEI, WANG KUN-CHING.Robust endpoint detection alg0- rithm based on the adaptive band-partitioning spectral entropy in adverse environments [J] .IEEE Transactions on Speech and Audio Processing, 2005, 13{ 5) : 762- 775.
3Wu Gin-Der, Huang Pang-Hsuan.A vectorization-optimization-method based type- 2 fuzzy neural network for noisy data classification [J]. IEEE Transactions on Fuzzy Systems, 2013, 21( 1): 1-15.
4CHOMORIlG, ZHANG ZE. Research on endpoint detection for mongolian speech based on support vector machine [C]/ 12011 International Conference on Intelligence Science and Information Engineering, Wuhan, China, 2011: 290-294.
5VUPPALA A K, YADAV 1, CHAKRABARTI S, et aI.Vowel onset point detection for low bit rate coded speech [J].IEEE! ACM Transactions on Audio, Speech, and language Processing.2012, 20( 6) : 1894- 1903.
6MING J, HAZEN T J, GLASS J R, et aI.Robust speaker recognition in noisy conditions [J] . IEEE! ACM Transactions on Audio, Speech, and Language Processing, 2007, 15( 5): 1711-1723.
7LASKOWSKI K.Contrasting emotion-bearing laughter types in multiparticipant vocal activity detection for meetings [CJIIIEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan, 2009: 4765-4768.
8SHAO C, BOUCHARD M.Efficient classification of noisy speech using neural networks [C]IISeventh International Symposiun on Signal Processing and Its Applications, 2003: 357-360.
9XU HAmAN, DAI.SGAARD P, TAN ZHENG-HUA, et aI.Noise condition-dependent training based on noise classification and SNR estimation [J]. IEEE! ACM Transactions on Audio, Speech, and Language Processing,2007,15 (8):2431-2443.
10BERlTElli F, CASALE S, SERRANO S.Adaptive V/UV speech detection based on acoustic noise estimation and classification [J] .Elec- Ironies Letters, 2007,43( 4): 249-251.

二级参考文献14

1成新民,曾毓敏,赵力.一种改进的AMDF求取语音基音的方法[J].微电子学与计算机,2005,22(11):162-164. 被引量：16
2刘建,郑方,吴文虎.基于幅度差平方和函数的基音周期提取算法[J].清华大学学报（自然科学版）,2006,46(1):74-77. 被引量：22
3A．V奥本海姆黄建国等（译）.离散时间信号处理[M].北京:科学出版社,1998..
4杨行逡迟惠生等.语音信号数字处理[M].北京：电子工业出版社,1995..
5Wolfgang Hess. Pitch Determination of Speech Signals [ M ]. New York: Springer-Verlag, 1983.
6Ross M J, et al. Average magnitude difference function pitch extractor[J]. IEEE Trans on Acoustics, Speech, and Signal Processing, 1974,22(5) :353 - 362.
7Thomas W Parsons. Voice and Speech Processing [ M]. New York:Mc-Graw-Hill, 1986.
8ROSS M J,SHAFFER H L,COHEN A,et al.Average magnitude difference function pitch extractor[J].IEEE Transactions on Acoustics,Speech and Signal Processing,1974,22(5):353-362.
9顾良,刘润生.高性能汉语语音基音周期估计[J].电子学报,1999,27(1):8-11. 被引量：19
10宗源,曾毓敏,孙永熙,郑瑞.基于EMD的AMDF基音检测改进算法[J].南京师范大学学报（工程技术版）,2013,13(1):62-67. 被引量：6

共引文献50

1张超琼,苗夺谦,岳晓冬.基于高斯混合模型的语音性别识别[J].计算机应用,2008,28(S2):360-362. 被引量：1
2李娟娟,俞一彪,薛广荣.说话人性别识别系统的DSP实现[J].现代电子技术,2005,28(24):37-39. 被引量：1
3赵彦平,赵晓晖.用于语音端点检测的鲁棒性特征提取新方法[J].吉林大学学报（工学版）,2006,36(1):77-81. 被引量：6
4刘建,郑方,吴文虎.基于幅度差平方和函数的基音周期提取算法[J].清华大学学报（自然科学版）,2006,46(1):74-77. 被引量：22
5李飞,覃爱娜,赖旭芝.过渡音的基音周期检测方法[J].中南大学学报（自然科学版）,2006,37(4):786-789. 被引量：1
6刘建,郑方,邓菁,吴文虎.基于混合幅度差函数的基音提取算法[J].电子学报,2006,34(10):1925-1928. 被引量：16
7罗亚飞,鲍长春.基于DCT分带谱熵与信号分解的高精度基音检测算法[J].电子学报,2007,35(1):13-22. 被引量：5
8余伶俐,蔡自兴,陈明义.语音信号的情感特征分析与识别研究综述[J].电路与系统学报,2007,12(4):76-84. 被引量：27
9徐明,陈知困,黄云森.基于FFT-ACF和候选值估计的基音周期提取方法[J].深圳大学学报（理工版）,2007,24(4):388-392. 被引量：2
10王佑民,赵杰,江城.从存在伴奏的歌曲中提取歌声基音的时域算法[J].电子工程师,2007,33(11):33-36. 被引量：1

1肖尹浩.基于LMS算法的自适应抗噪通话系统的研究[J].农业网络信息,2006(9):22-24.
2朱益厅,李永明,陈弘毅.一种多带清浊音判决方法[J].微电子学与计算机,1999,16(5):1-4. 被引量：3
3刘庆华,唐宁,黄冰.自适应语音抗噪技术的实时实现[J].桂林电子工业学院学报,2001,21(1):66-69. 被引量：4
4张祎.基于软信息的迭代译码算法[J].消费电子,2012(07X):143-143.
5陈小利,徐金甫.基于小波变换和时域波形的基音检测算法[J].现代电子技术,2011,34(1):77-79. 被引量：4
6李艳玲,李兵兵,刘明骞.瑞利衰落信道下MQAM信号的盲识别方法[J].华中科技大学学报（自然科学版）,2012,40(4):76-79. 被引量：3
7李爱平,党幼云.VQ声纹识别算法和实验[J].西安工程科技学院学报,2007,21(6):848-851. 被引量：1
8魏广英.一种改进的基于ATeager能量和循环平均幅度差的基音检测[J].福建电脑,2008,24(2):96-96.
9周志杰,胡光锐.采用非线性网络实现清浊音判决[J].南京航空航天大学学报,1998,30(1):47-51. 被引量：4
10薛胜尧.基于改进型双门限语音端点检测算法的研究[J].电子设计工程,2015,23(4):78-81. 被引量：21

燕山大学学报

2015年第2期

浏览历史

内容加载中请稍等...

自适应抗噪的清/浊/静音判决算法

参考文献21

二级参考文献14

共引文献50

相关作者

相关机构

相关主题

浏览历史