一种基于奇异谱的语音激活检测方法被引量：1

A method of voice activity detection based on spectrum of singular value

下载PDF

导出

摘要为了提高语音激活检测在低信噪比环境中的检测性能,提出了一种基于奇异谱的语音激活检测方法。首先用多窗口方法计算每一帧语音信号的相关矩阵;然后对相关矩阵进行奇异值分解;利用奇异值可以反映有用信号和噪声分布情况的特性,将每一帧语音信号经过加权处理后的最大奇异值与自适应阈值进行比较进行语音激活检测。该方法原理简单,易于硬件实现,通过实验仿真表明,在低信噪比环境下,和基于对数能量方法相比,本文方法也能够很好的区分语音段和非语音段,有良好的检测性能。 In order to improve the performance of voice activation detection at low SNR（Signal to Noise Ratio）, we proposed a detection approach of voice activity based on singular spectrum. Firstly, we calculate the correlation matrix for each frame of speech signal with multi-window approach; then performed singular value decomposition to the correlation matrix; due to the singular value reflects the characteristics of the useful signal and noise distribution, we can perform activity detection through comparing the weighted maximum singular value of each frame of speech signal with the adaptive threshold value. This method is simple and can be easily implemented in hardware. The simulation indicates that compared with energy method based on logarithm, in low SNR environment, this approach can better distinguish speech segments with non-voice segment better.

作者曹亮张天骐周圣胡然

机构地区重庆邮电大学信号与信息处理重庆市重点实验室

出处《应用声学》 CSCD 北大核心 2013年第2期137-143,共7页 Journal of Applied Acoustics

基金国家自然科学基金项目(61071196 61102131) 教育部新世纪优秀人才支持计划项目(NCET-10-0927) 信号与信息处理重庆市市级重点实验室建设项目(CSTC2009CA2003) 重庆市杰出青年基金项目(CSTC2011jjjq40002) 重庆市自然科学基金项目(CSTC2009BB2287 CSTC2010BB2398 CSTC2010BB2409 CSTC2010BB2411)资助

关键词语音激活检测 Slepian数据窗离散扁椭圆序列相关矩阵奇异值分解自适应阈值 Voice activity detection, Slepian data window, Discrete prolate spheroidal sequences,Correlation matrix, Singular value decomposition, Adaptive threshold

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献12

1LEE H, YOOK D. Space-time voice activity detection[J]. IEEE Transaction Signal Process, 2009, 55(3)" 1471-1476.
2MARZNZIK M, KOLLMEIER B. Speech pause detection for noise spectrum estimation by tracking power envelope dynamics[J]. IEEE Transaction on Speech and Audio Processing. 2002, 10(2): 109-118.
3LI Qi, ZHANG Jinsong, TSAI A, et al. Robust endpoint detection and energy normalization for real-time speech and speaker recognition[J]. IEEE Transaction on Speech and Audio Processing.2002, 10(3): 146-157.
4朱晓晶,侯旭初,崔慧娟,唐昆.基于LPCC和能量熵的端点检测[J].电讯技术,2010,50(6):41-45. 被引量：6
5GAZOR S, ZHANG W. A soft voice activity detector based on a laplacian-gaussian model[J]. IEEE Transaction on Signal and Audio Processing. 2003, 11(5): 498-505.
6CHANG J H, NAM S K, MITRA S K. Voice activity detection based on multiple statistical models[J]. IEEE Transactions on Signal Processing. 2006, 54(6): 1965-1976.
7RIGOZO N R, ECHER E, NORDEMANN D J R, et al. Comparative study between for classical spectral analysis methods[J]. Applied Mathematics and Computation, 2005, 168( 1 ): 411-430.
8ALLEN B, OTTEWILL A. Multi-taper spectral analysis in gravitational wave data analysis[J]. General Relativity and Gravitation, 2000, 32(3): 385-398.
9BANSAL A R, DIMRI V P, SAGAR G V. Depth estimation from gravity data using the maximum entropy method and the multi taper method[J]. Pure and Applied Geophysics. 2006, (163): 1417-1434.
10WU Bingfei, WANG Kunching. Rubost Endpoint detection algorithm based on the adaptive band-partitioning spectral entropy in adverse environments[J]. IEEE Transaction on Speech and Audio Processing, 2005, 13(5): 762-775.

二级参考文献30

1刘晓明,覃胜,刘宗行,江泽佳.语音端点检测的仿真研究[J].系统仿真学报,2005,17(8):1974-1976. 被引量：21
2李晔,张仁智,崔慧娟,唐昆.低信噪比下基于谱熵的语音端点检测算法[J].清华大学学报（自然科学版）,2005,45(10):1397-1400. 被引量：37
3侯周国,钱盛友,姚畅.短时域语音端点检测中谱熵算法的改进[J].计算机工程与应用,2006,42(21):55-56. 被引量：3
4Junqua J C,Mak B,Reaves B.A robust algorithm for word boundary detection in the presence of noise[J].IEEE Transactions on Speech and Audio Processing,1994,2(3):406-412.
5Beritelli F,Casale S,Ruggeri G,et al.Performances evaluation and comparision of G.729/AMR/fuzzy voice activity detectors[J].IEEE Signal Processing Letters,2002,9(3):85-88.
6Pencak J,Neloson D.The NP speech activity detection algorithm[C]//Proceedings of 1995 International Conference on Acoustics,Speech and Signal Processing.Detroit,MI,USA:[s.n.],1995:381-384.
7Reynolds D,Rose R.Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Transactions on Speech and Audio Processing,1995,3(1):72-83.
8Reynolds D A,Quatieri T F,Dunn R B.Speaker Verification Using Adapted Gaussian Mixture Models[J].Digital Signal Processing,2000,10(1):19-41.
9Dempster A D,Laird N M,Rubin D B.Maximum likelihood from incomplete data via the EM algorithm[J].Journal of the Royal Statistical Society,1977,39(2):1-37.
10Gish H,Schmid M.Text-Independent Speaker Identification[J].IEEE Signal Processing Magazine,1994,11(4):18-32.

共引文献41

1彭柏,许刚.利用改进的LF模型进行语音嗓音源合成[J].电声技术,2006,30(5):53-57.
2李圆,赵振东,杨超.基于多带谱相减的语音端点检测算法[J].通信技术,2007,40(11):353-355. 被引量：4
3刘华平,李昕,徐柏龄,姜宁.语音信号端点检测方法综述及展望[J].计算机应用研究,2008,25(8):2278-2283. 被引量：40
4李晋,刘甫,王玲,许慧燕.改进的语音端点检测技术[J].计算机工程与应用,2009,45(24):133-135. 被引量：9
5赵欢,王纲金,赵丽霞.一种新的对数能量谱熵语音端点检测方法[J].湖南大学学报（自然科学版）,2010,37(7):72-77. 被引量：17
6贺怀清,高金枝.两类噪声谱估计方法的对比分析[J].计算机工程与应用,2010,46(23):154-158. 被引量：3
7刘柏森,卢志茂,申丽然,金辉.基于希尔伯特-黄变换的低信噪比语音端点检测[J].吉林大学学报（工学版）,2011,41(3):844-848. 被引量：7
8王景芳.实时语音端点鲁棒检测[J].计算机工程与应用,2011,47(20):147-150. 被引量：4
9马飞,徐海锋.高质量通信信号时频图处理及应用[J].计算机与网络,2011,37(8):50-52.
10王振寰,高炜,梁立.基于一阶有限差分商的带噪语音端点检测方法[J].昆明学院学报,2011,33(3):53-54.

同被引文献7

1戴元红,陈鸿昶,乔德江,李乐.基于短时能量比的语音端点检测算法的研究[J].通信技术,2009,42(2):181-183. 被引量：10
2吕卫强,黄荔.基于短时能量加过零率的实时语音端点检测方法[J].兵工自动化,2009,28(9):69-70. 被引量：15
3赵新燕,王炼红,彭林哲.基于自适应倒谱距离的强噪声语音端点检测[J].计算机科学,2015,42(9):83-85. 被引量：15
4王晓华,屈雷.基于时频参数融合的自适应语音端点检测算法[J].计算机工程与应用,2015,51(20):203-207. 被引量：7
5纪振发,杨晖,李然,金银超.基于短时自相关及过零率的语音端点检测算法[J].电子科技,2016,29(9):52-55. 被引量：13
6洪奕鑫,张浩川,余荣,吴哲顺.语音端点检测在实时语音截取中的应用[J].无线互联科技,2017,14(22):50-53. 被引量：3
7王群,曾庆宁,郑展恒.低信噪比下语音端点检测算法的改进研究[J].科学技术与工程,2017,17(21):50-56. 被引量：8

引证文献1

1苗晓孔,张雄伟.采用骨导语音自适应的语句分割方法[J].应用声学,2019,38(1):68-75.

1杨海博,王海燕,申晓红.奇异谱技术在混沌背景下微弱信号检测中的应用[J].计算机测量与控制,2012,20(3):593-595. 被引量：5
2梁峰,杨勇,曹军勤,张凡.一种新型实用的语音激活检测方法[J].计算机与网络,2012,38(19):59-61.
3Cirrus Logic最新智能音频编解码提供先进音频特性[J].单片机与嵌入式系统应用,2017,17(1):88-88.
4刘福星,何选森.三阶累积量的语音激活检测方法[J].计算机工程与应用,2011,47(17):137-139. 被引量：2
5陈明义,李微,黎华.基于小波包变换的自适应门限的语音激活检测[J].计算机仿真,2009,26(3):340-342.
6宋喆,张德民,张天骐.一种改进的基于子带谱熵的语音激活检测方法[J].重庆邮电大学学报（自然科学版）,2009,21(6):725-730. 被引量：3
7张敏.一种基于分带谱熵的语音激活检测算法[J].微型机与应用,2010,29(20):43-45. 被引量：1
8吴健国,黄建军.消除特定谐波的短数据窗数字滤波器[J].重庆电业,1991(1):52-54.
9李勇发,左小清,杨芳,徐晶.基于小波奇异谱及SVDD的轴承故障检测方法[J].轴承,2016(8):46-49. 被引量：4
10齐峰岩,鲍长春.一种具有鲁棒性的语音激活检测方法[J].信号处理,2005,21(z1):172-175.

应用声学

2013年第2期

浏览历史

内容加载中请稍等...

一种基于奇异谱的语音激活检测方法被引量：1

参考文献12

二级参考文献30

共引文献41

同被引文献7

引证文献1

相关作者

相关机构

相关主题

浏览历史

一种基于奇异谱的语音激活检测方法 被引量：1

参考文献12

二级参考文献30

共引文献41

同被引文献7

引证文献1

相关作者

相关机构

相关主题

浏览历史

一种基于奇异谱的语音激活检测方法被引量：1