话者识别中结合模型和能量的语音激活检测算法被引量：1

Combination of Model and Energy Based VAD Algorithm in Speaker Recognition System

下载PDF

导出

摘要语音激活检测是检测语音起始终止端点的一种算法,合适地选择语音来进行说话人模型的注册和测试对话者识别系统的性能有很大影响.本文将基于能量的语音激活检测算法与基于模型的算法相结合来检测语音,在N IST2006核心测试数据集上,采用本文算法的系统相对于传统基于能量的方法性能最多有19%的提升. Voice activity detection（VAD） is an algorithm to detect the voice endpoint.It can affect the performance of speaker recognition system greatly.In this paper,we combine the energy-based VAD method and the model-based VAD method to detect the voice endpoint.On the NIST 2006 SRE corpus,the proposed VAD algorithm can obtain 19% EER reduction over the traditional energy-based system at most.

作者章钊郭武

机构地区中国科学技术大学电子工程与信息科学系科大讯飞语音实验室

出处《小型微型计算机系统》 CSCD 北大核心 2010年第9期1914-1917,共4页 Journal of Chinese Computer Systems

基金国家自然科学基金项目(60970161)资助

关键词语音激活检测说话人识别支持向量机扰属性投影 voice activity detection speaker recognition support vector machine nuance attribute projection

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1Li Qi, Zheng Jing-song, Augustine Tsai, et al. Robust endpoint detection and energy normalization for real-time speech and speaker recognition[J]. IEEE Trans Speech and Audio, 2002, 10(3): 146-157.
2Sahar E Bou-Ghazale, Khaled Assaleh. A robust cndpoint detection of speech for noisy environments with application to automatic speech recognition[ C]. Proc IEEE ICASSP 02, 2002,3808-3811.
3ITU-T Recommendation G. 729 -Annex B:a silence compression scheme for G. 729 optimized for terminals conforming to recommendation V. 70[Z]. 1996.
4Huang Liang-sheng, Chung-ho Yung. A novel approach to robust speech endpoint detection in car environments [C ]. Proc. ICASSP, Istanbul, 2000: 1751-1754.
5Jongseo Solm, Nam Soo Kim, Wonyong Sung. A statistical model-based voice activity detection[ C]. Proc. IEEE,1999, 6, ( 1 ) : 1-3.
6Lori F Lamel, Lawrence R Rabiner, Aaron E Rosenberg,et al. An improved endpoint detector for isolated word recognition [ C ]. Proc. IEEE, 1981,777-785.
7Douglas A Reynolds, Thomas F Quaffed , Robert B Dunn. Speaker verification using adapted gaussian mixture models [ A ]. Digital Signal Processing 10[ M]. Academic Press, 2000.
8Hermansky H, Morgan N, Bayya A, et al. RASTA-PLP speech analysis [ R ]. In ICSI Technical Report TR-914)69, Berkeley, California.
9Campbell W M, Sturim D E, Reynolds D A. Support vector machines using GMM supervectors for speaker vedfication[ C]. IEEE Signal Processing Letters, 2006,308 -311.
10Alex Solomonoff, Carl Quillen, William M Campbell. Channel compensation for SVM speaker recognition [ C ]. Proc. ICASSP 05, 2005,629-632.

同被引文献10

1张仁志,崔慧娟.基于短时能量的语音端点检测算法研究[J].电声技术,2005,29(7):52-54. 被引量：45
2Lamel L, Rabiner L, Rosenberg A, et al. An improved endpoint detector for isolated word recognition [ J ]. IEEE Transactions on Acoustics Speech & Signal Processing, 1981,29(4) :777- 785.
3Wu J, Zhang X L. An efficient voice activity detection algo- rithm by combining statistical model and energy detection [ J ]. Journal on Advances in Signal Processing,2011 (2):150- 154.
4Zhang X L, Wu J. Denoising deep neural networks based voice activity detection [ C ]//Proc of international conference on a- coustics, speech, and signal processing. [ s. l. ] : [ s. n. ], 1988 : 853-857.
5Reddy A M, Raj B. Soft mask methods for single- channel speaker separation[ J]. IEEE Transactions on Audio Speech & Language Processing ,2007,15 (6) : 1766-1776.
6雷建军,杨震,刘刚,郭军.基于复高斯混合模型的鲁棒VAD算法[J].天津大学学报,2009,42(4):353-356. 被引量：2
7朱杰,韦晓东.噪声环境中基于HMM模型的语音信号端点检测方法[J].上海交通大学学报,1998,32(10):14-16. 被引量：12
8周明忠,吉立新.基于平均幅度和加权过零率的VAD算法及其FPGA实现[J].信息工程大学学报,2010,11(6):713-718. 被引量：3
9黎林,朱军.基于小波分析与神经网络的语音端点检测研究[J].电子测量与仪器学报,2013,27(6):528-534. 被引量：26
10孙战先,储飞黄,王江.一种自适应语音端点检测算法[J].计算机工程与应用,2014,50(1):206-210. 被引量：6

引证文献1

1腾潇琦,冯祥,张翼飞.一种自适应建模的VAD方法[J].计算机技术与发展,2016,26(9):26-29. 被引量：1

二级引证文献1

1林琴,涂铮铮,王庆伟,郭玉堂.一种基于近邻传播聚类的语音端点检测方法[J].安徽大学学报（自然科学版）,2019,43(3):27-32. 被引量：3

1李光源,崔慧娟,唐昆.一种基于噪声估计的语音激活检测算法[J].信息技术,2011,35(10):5-8. 被引量：1
2张金榜,尹冬梅.基于统计模型的语音激活检测算法改进[J].微型机与应用,2015,34(12):14-16. 被引量：1
3柳燕,鲍长春.基于竞争网络的语音激活检测研究[J].信号处理,2006,22(1):57-60. 被引量：2
4陈建涛,陈维娜.基于文本无关的话者识别技术综述[J].电脑知识与技术,2016,0(1):189-191. 被引量：1
5王陈春.一种短波语音激活检测算法[J].中国电子商情（科技创新）,2013(16):5-5.
6周燕,刘韬.基于小波神经网络的话者识别系统研究[J].烟台职业学院学报,2008,14(2):57-61.
7王蕾.噪声环境下话者识别系统的特征提取[J].电脑知识与技术,2008(8):784-785.
8徐治.三门限多级判决语音激活检测算法的研究[J].电子技术（上海）,2015,42(5):33-35. 被引量：1
9Word XP使用技巧三则[J].科技展望（幻想大王）,2005(03X):14-14.
10黄啸,浦小祥.SVM核函数的研究及其在语音激活检测中的应用[J].苏州大学学报（工科版）,2008,28(3):56-59. 被引量：3

小型微型计算机系统

2010年第9期

浏览历史

内容加载中请稍等...

话者识别中结合模型和能量的语音激活检测算法被引量：1

参考文献10

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

话者识别中结合模型和能量的语音激活检测算法 被引量：1

参考文献10

同被引文献10

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

话者识别中结合模型和能量的语音激活检测算法被引量：1