期刊文献+

低信噪比环境下的多通道语音端点检测算法 被引量:1

Multi-channel voice activity detection in low signal-to-noise ratio environment
下载PDF
导出
摘要 传统的端点检测算法仅利用信号的时频信息,在低信噪比环境下,尤其是非平稳噪声环境,会出现准确率下降的问题,而多通道语音信号具有丰富的空间信息,可以对时频域的信息进行补充,从而提高检测的准确率。因此在多通道空间特征研究的基础上,利用接收阵列信号的协方差矩阵,提出一种全新的基于多通道协方差矩阵最大特征值的多通道语音端点检测算法。首先通过提取每一帧信号的协方差矩阵的最大特征值作为端点检测的特征参数,从而对语音信号进行跟踪,然后采用双门限阈值法判断当前帧是否为语音帧。实验结果表明,在VCTK及实验室语料库上,与梅尔能量比及新能零熵算法相比,所提出的算法具有更高的检测准确率,并且对于-5 dB的低信噪比环境及非平稳噪声环境具有更好的鲁棒性。 Traditional voice activity detection algorithm only uses the time-frequency information,hence the detection accuracy will reduce rapidly in the low signal-to-noise environment,especially when the noise is non-stationary.Multi-channel speech signal has rich spatial information,which helps to improve the accuracy of detection as a supplement to time-frequency information.In this paper,on the basis of multi-channel spatial feature research,we propose a new multi-channel voice activity detection algorithm,by leveraging the maximum eigenvalue of the multi-channel covariance matrix(covariance matrix maximum eigenvalue,CMME)of the received array signals.First,we extract the CMME of the array signal as the feature of detection frame by frame,to track the speech signal.Then the double threshold method is adopted to determine whether the current frame is a speech frame.The results show that,compared with Mel energy ratio and the improved energy zero-entropy algorithm,the proposed algorithm has higher detection accuracy in VCTK and laboratory corpus,and thus is more robust in the low signal-to-noise ratio and non-stationary noise environment.
作者 肖思 龚杰 李宝清 XIAO Si;GONG Jie;LI Baoqing(School of Microelectronics,University of Chinese Academy of Sciences,Beijing 100049,China;Key Laboratory of Microsystem Technology,Shanghai Institute of Microsystem and Information Technology,Chinese Academy of Sciences,Shanghai 201800,China)
出处 《中国科学院大学学报(中英文)》 CAS CSCD 北大核心 2023年第5期687-693,共7页 Journal of University of Chinese Academy of Sciences
基金 微系统技术重点实验室基金(6142804200408)资助。
关键词 语音端点检测 麦克风阵列 协方差矩阵 低信噪比 voice activity detection microphone array covariance matrix low signal-to-noise rate
  • 相关文献

参考文献6

二级参考文献47

  • 1Rabiner L R, Sambur M R. An algorithm for determining the endpoints of isolated utterances [ J ]. The Bell System Technical Journal, 1975, 54(2) : 297-315.
  • 2Lu L, Jiang H, Zhang H J. A robust audio classification and segmentation method [ C ]// Proceedings of the Ninth ACM International Conference on Multimedia. ACM, 2001: 203-211.
  • 3Shen J, Hung J, Lee L. Robust entropy-based endpoint detection for speech recognition in noisy environments [ C ] //ICSLP. 1998, 98: 232-235.
  • 4Huang L, Yang C. A novel approach to robust speech endpoint detection in car environments [ C ]// Acoustics, Speech, and Signal Processing. IEEE International Conference on. IEEE, 2000, 3 : 1751-1754.
  • 5Haigh J A, Mason J S. Robust voice activity detection using eepstral features[ C ]//TENCON'93 Proceedings. Computer, Communication, Control and Power Engineering. 1993 IEEE Region 10 Conferenee on. IEEE, 1993: 321-324.
  • 6Martin A, Charlet D, Mauuary L. Robust speech/non-speech detection using LDA applied to MFCC [ C ] // Acoustics, Speech, and Signal Processing. IEEE International Conference on. IEEE, 2001, 1 : 237-240.
  • 7Kinnunen T, Chernenko E, Tuononen M, et al. Voice activity detection using MFCC features and support vector machine[ C ] // Int Conf on Speech and Computer. Moscow, Russia, 2007, 2: 556-561.
  • 8Wang H, Xu Y, Li M. Study on the MFCC similarity-based voice activity detection algorithm [ C ]//Artificial Intelligence, Management Science and Electronic Commerce, 2011 2nd International Conference on. IEEE, 2011: 4391-4394.
  • 9Cho N, Kim E K. Enhanced voice activity detection using acoustic event detection and classification [ J ]. Consumer Electronics, IEEE Transactions on, 2011, 57 ( 1 ) : 196-202.
  • 10Ramirez J, Segura J C, Benitez C, et al. Efficient voice activity detection algorithms using long-term speech information [J]. Speech Communication, 2004, 42(3): 271-287.

共引文献36

同被引文献14

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部