期刊文献+

区分性训练在声纹密码中的新应用

Novel Application of Discriminative Training in Vocal Password
下载PDF
导出
摘要 在声纹密码任务中由于数据稀疏的问题难以实现区分性训练,本文以一种表征距离度量的特征矢量为基础提出新的声纹密码区分性系统框架,对正反例样本的新特征矢量实现了基于最小分类错误准则的区分性训练,将声纹密码从确认问题转化为二类分类问题。在自由说话风格的60人数据集上,声纹密码区分性系统与混合高斯模型-通用背景模型(Gaussian mixture model-universal background model,GMM-UBM)系统融合后等错误率为4.48%,相对GMM-UBM,动态时间规划(Dynamic time warping,DTW)基线系统性能分别提升了17.95%和59.68%。 Due to data sparsity, discriminative training has not been successfully applied to the system of vocal password up to now. Therefor, a novel vocal password framework based on a specific pre-processing strategy is proposed. The new feature is used to represent the distance measure and the problem caused by data sparsity can be solved to some extent. As a consequence, the vocal password is actually transferred from verification to binary classification and the discriminative training of two class models is sueeessfully accomplished on the minimum classification error criteria. After fusing the discriminative system with Gaussian mixture mod- el-universal background model(GMM-UBM) system, the equal error rate (EER) performance decreases to 4.48%, relatively 17.95% and 59.68% lower than the GMM-UBM and the dynamic time warping(DTW) system respectively on the corpus including 60 speakers. The experiment results show that the new application of discriminative training in the vocal password system is feasible and effective.
出处 《数据采集与处理》 CSCD 北大核心 2012年第4期404-409,共6页 Journal of Data Acquisition and Processing
基金 安徽省科技攻关(09120201003)资助项目
关键词 声纹密码 说话人确认 区分性训练 GMM—UBM vocal password speaker verification discriminative training GMM-UBM
  • 相关文献

参考文献13

  • 1Li Q, Juang B H, Zhou Q, et al. Automatic verbal information verification for user authentication [J]. IEEE Trans on Speech and Audio Processing, 2000, 8(5) : 585-596.
  • 2Ramasubramanian V, Das A, Kumar V P. Text-de- pendent speaker-recognition using one-pass dynamic programming algorithm [C]//Proceedings of ICAS- SP' 06. Toulouse, France.- IEEE, 2006.. 901-904.
  • 3Subramanyal A, Zheng Z, Surendran A C, et al. A generative framework using ensemble methods for text-dependent speaker verification[C]//Proceedings of ICASSP' 07. Dallas, Texas, USA: IEEE, 2007: 225-228.
  • 4Li S Z, Zhang D, Ma C, et al. Learning to boost GMM based speaker verification [C]//Proceedings of EuroSpeech 2003. Geneva, Switzerland: ISCA, 2003: 1677-1680.
  • 5Campbell W M, Sturim D E, Reynolds D A, et al. SVM based speaker verification using a GMM super- vector kernel and nap variability compensation [C]// Proceedings of ICASSP' 06. Toulouse, France: IEEE, 2006: 97-100.
  • 6Kenny P, Boulianne G, Ouellet P, et al. Speaker and session variability in GMM based speaker verifi- cation[J]. IEEE Trans on Audio, Speech and Lan- guage Processing, 2007, 15(4):1448-1460.
  • 7Kenny P, Boulianne G, Dumouchel P. Joint factor analysis versus eigen channels in speaker recognition[J]. IEEE Trans on Audio, Speech and Language Processing, 2007, 15 (4): 1435-1447.
  • 8Bilmes J A. A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaus- sian mixture and hidden Markov models[R]. Tech- nical Report ICSI-TR-97-021, University of Califor- nia Berkeley, USA: 1997.
  • 9Povey D, Woodland P C. Minimum phone error and I-smoothing for improved discriminative training [C]//Proceedings of ICASSPr 02. Orlando, FL, USA: IEEE, 2002.- 105-108.
  • 10Juang B H, Hou W, Lee C H. Minimum classifica- tion error rate methods for speech recognition [J]. IEEE Trans on Speech and Audio Processing, 1997, 5(3): 257-265.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部