期刊文献+

话者识别中结合模型和能量的语音激活检测算法 被引量:1

Combination of Model and Energy Based VAD Algorithm in Speaker Recognition System
下载PDF
导出
摘要 语音激活检测是检测语音起始终止端点的一种算法,合适地选择语音来进行说话人模型的注册和测试对话者识别系统的性能有很大影响.本文将基于能量的语音激活检测算法与基于模型的算法相结合来检测语音,在N IST2006核心测试数据集上,采用本文算法的系统相对于传统基于能量的方法性能最多有19%的提升. Voice activity detection(VAD) is an algorithm to detect the voice endpoint.It can affect the performance of speaker recognition system greatly.In this paper,we combine the energy-based VAD method and the model-based VAD method to detect the voice endpoint.On the NIST 2006 SRE corpus,the proposed VAD algorithm can obtain 19% EER reduction over the traditional energy-based system at most.
作者 章钊 郭武
出处 《小型微型计算机系统》 CSCD 北大核心 2010年第9期1914-1917,共4页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(60970161)资助
关键词 语音激活检测 说话人识别 支持向量机 扰属性投影 voice activity detection speaker recognition support vector machine nuance attribute projection
  • 相关文献

参考文献10

  • 1Li Qi, Zheng Jing-song, Augustine Tsai, et al. Robust endpoint detection and energy normalization for real-time speech and speaker recognition[J]. IEEE Trans Speech and Audio, 2002, 10(3): 146-157.
  • 2Sahar E Bou-Ghazale, Khaled Assaleh. A robust cndpoint detection of speech for noisy environments with application to automatic speech recognition[ C]. Proc IEEE ICASSP 02, 2002,3808-3811.
  • 3ITU-T Recommendation G. 729 -Annex B:a silence compression scheme for G. 729 optimized for terminals conforming to recommendation V. 70[Z]. 1996.
  • 4Huang Liang-sheng, Chung-ho Yung. A novel approach to robust speech endpoint detection in car environments [C ]. Proc. ICASSP, Istanbul, 2000: 1751-1754.
  • 5Jongseo Solm, Nam Soo Kim, Wonyong Sung. A statistical model-based voice activity detection[ C]. Proc. IEEE,1999, 6, ( 1 ) : 1-3.
  • 6Lori F Lamel, Lawrence R Rabiner, Aaron E Rosenberg,et al. An improved endpoint detector for isolated word recognition [ C ]. Proc. IEEE, 1981,777-785.
  • 7Douglas A Reynolds, Thomas F Quaffed , Robert B Dunn. Speaker verification using adapted gaussian mixture models [ A ]. Digital Signal Processing 10[ M]. Academic Press, 2000.
  • 8Hermansky H, Morgan N, Bayya A, et al. RASTA-PLP speech analysis [ R ]. In ICSI Technical Report TR-914)69, Berkeley, California.
  • 9Campbell W M, Sturim D E, Reynolds D A. Support vector machines using GMM supervectors for speaker vedfication[ C]. IEEE Signal Processing Letters, 2006,308 -311.
  • 10Alex Solomonoff, Carl Quillen, William M Campbell. Channel compensation for SVM speaker recognition [ C ]. Proc. ICASSP 05, 2005,629-632.

同被引文献10

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部