期刊文献+

声码器半解码参数用于说话人身份确认 被引量:2

Research on Speaker Verification With Half-Decoded Parameters of Vocoder
下载PDF
导出
摘要 面向通信领域广泛使用的线性预测声码器,设计了一种不经过“解码-特征提取”过程,而直接由传输码流截取说话人特征的方法,并针对宽带自适应多码率声码器(AMRWB)建立了与文本无关的话者确认系统.系统采用基于概率统计模型的GMMUBM结构,以LPC倒谱作为主要的话者特征矢量,并加入基音衍生参数以提高确认性能.实验表明,该系统在运算速度提高一个数量级的情况下,达到了与基于重建语音的话者确认系统相接近的性能,且对码率失配具有良好的鲁棒性. A feature extraction method is designed for linear predict vocoders widely used in the communication field. In this method, feature vectors are extracted not from the decoded waveform, but from the bit stream of transmission directly. Specifically for Wideband Adaptive Multi Rate vocoder (AMR-WB), we implemented a text-independent speaker verification system. Which employs the probability-statistics-based GMM-UBM framework as speaker model and takes LPC cepstrum and pitch derived parameters as feature vectors. Experiments indicate that the half-decoded based system, which runs ten times faster than the decoded-based system, is capable of similar performance to the latter, and shows robustness for code rate mismatch of AMR-WB.
出处 《中国科学技术大学学报》 CAS CSCD 北大核心 2005年第4期523-529,共7页 JUSTC
基金 国家自然科学基金(6027039) 安徽省自然科学基金(01042205)资助项目.
关键词 话者确认 半解码参数 基音频率 GMM-UBM AMR-WB编码 speaker verification half-decoded parameter pitch GMM-UBM AMR-WB codec
  • 相关文献

参考文献9

  • 1Besacier L, Mayorga P, Bonastre J F, et al.Overview of compression and packet loss effects in speech biometrics[J]. IEEE Proc.Image and Signal Processing, 2003,150.. 372-376.
  • 2Dtmn R B, Quatieri T F, Reynolds D A, etal. Speaker recognition from coded speech and the effects of score normalization, signals[J]. Systems and Computers, 2001, 2 : 1562-1 567.
  • 3Bessette B, Salami R, Lefebvre R, et al.The adaptive multirate wideband speech codec (AMR-WB) [J]. IEEE Trans. Speechand Audio Processing, 2002, 10:8.
  • 4AMR Wideband Speech Codec; Transcoding functions[S], 3GPP TS 26. 190. 510, 2001.
  • 5AMR Wideband Speech Codec; Frame Structure[S], 3GPP TS 26. 201. 500, 2001.
  • 6Sonmez K, Shriberg E, Heek L, et al.Modeling dynamic prosodic variation for speaker verification[J]. Proe. Intk Conf. on Spoken Language Processing, 1998,7 : 3 189-3 192.
  • 7Reynolds D A, Quatieri T F, Dunn R B.Speaker verification using adapted gaussian mixture models[J]. Digital Signal Processing, 2000,10:19-41.
  • 8Reynolds D A, Rose R C. Robust text-independent speaker identification using Gaussian mixture speaker models [J]. IEEE Trans.Speech and Audio Processing, 1995,3: 72-83.
  • 9Hermansky H, Morgan N. RASTA processing of speech[J]. IEEE Trans. Speech and Audio Processing, 1994, 2: 578-589.

同被引文献15

  • 1PETRACCA M,SERVETTI A,DEMARTIN J C. Performance analysis of compressed-domain automatic speaker recognition as a function of speech coding technique and bit rate [C]//Pmceedings of International Conference on Multimedia and Expo (ICME).Toronto : [s.n.], 2006 : 1393- 1396.
  • 2DUNN R B, QUATIERI T F, RENOLDS D A, et al. Speaker recognition from coded speech and the effects of score normalization [C]//Proceedings of the 35th Asilomar Conference on Signals, Systems and Computers. Pacific Grove: [s.n.],2001: 1562-1567.
  • 3ITU. ITU-T Recommendation G.729-1996 Coding of speech at 8 kbit/s using conjugate-structure algebraic-codeexcited linear-prediction (CS-ACELP)[S]. Helsinki: WTSC Resolution, 1996.
  • 4ITU-T Recommendation G.723.1-1996. Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s [S]. Helsinki: WTSC Resolution, 1996.
  • 5唐晖.VoIP说话人识别技术研究[D].郑州:解放军信息工程大学,2008.
  • 6CAMPELL W M. Generalized linear discriminant sequence kernels for speaker recognition [C]//Proceedings of ICASSP. Orlando: [s.n.], 2002 ( 1 ) : 161-164.
  • 7CAMPELL W M, CAMPBELL J P, REYNOLDS D A, et al. Torres-Carrasquillo, support vector machines for speaker and language recognition [J]. Computer Speech and Language, 2005(8) :213-232.
  • 8Frédéric Bimbot.A tutorial on text-independent speaker verification[J].Eurasip journal on applied signal processing,2004,(4):430-451.
  • 9Besacier L,Mayorga P.Overview of compression and packet loss effectsin speech biometrics[C]//Proc.Vision,Image and Signal Processing.2003:372-376.
  • 10Grassi S,Besacier L.Influence of gsm speech coding on the performance of text-independent speaker recognition[C]//Proc.EUSIPCO'00.2000:437-440.

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部