期刊文献+

Locality Preserving Discriminant Projection for Speaker Verification 被引量:1

Locality Preserving Discriminant Projection for Speaker Verification
下载PDF
导出
摘要 In this paper, a manifold subspace learning algorithm based on locality preserving discriminant projection (LPDP) is used for speaker verification. LPDP can overcome the deficiency of the total variability factor analysis and locality preserving projection (LPP). LPDP can effectively use the speaker label information of speech data. Through optimization, LPDP can maintain the inherent manifold local structure of the speech data samples of the same speaker by reducing the distance between them. At the same time, LPDP can enhance the discriminability of the embedding space by expanding the distance between the speech data samples of different speakers. The proposed method is compared with LPP and total variability factor analysis on the NIST SRE 2010 telephone-telephone core condition. The experimental results indicate that the proposed LPDP can overcome the deficiency of LPP and total variability factor analysis and can further improve the system performance. In this paper, a manifold subspace learning algorithm based on locality preserving discriminant projection (LPDP) is used for speaker verification. LPDP can overcome the deficiency of the total variability factor analysis and locality preserving projection (LPP). LPDP can effectively use the speaker label information of speech data. Through optimization, LPDP can maintain the inherent manifold local structure of the speech data samples of the same speaker by reducing the distance between them. At the same time, LPDP can enhance the discriminability of the embedding space by expanding the distance between the speech data samples of different speakers. The proposed method is compared with LPP and total variability factor analysis on the NIST SRE 2010 telephone-telephone core condition. The experimental results indicate that the proposed LPDP can overcome the deficiency of LPP and total variability factor analysis and can further improve the system performance.
作者 Chunyan Liang Wei Cao Shuxin Cao Chunyan Liang;Wei Cao;Shuxin Cao(College of Computer Science and Technology, Shandong University of Technology, Zibo, China)
出处 《Journal of Computer and Communications》 2020年第11期14-22,共9页 电脑和通信(英文)
关键词 Speaker Verification Locality Preserving Discriminant Projection Locality Preserving Projection Manifold Learning Total Variability Factor Analysis Speaker Verification Locality Preserving Discriminant Projection Locality Preserving Projection Manifold Learning Total Variability Factor Analysis
  • 相关文献

参考文献3

二级参考文献12

  • 1Campbell W M, Sturim D E, and Reynolds D A, et al.. SVM based speaker verification using a GMM supervector kernel and NAP variability compensation [C]. Proc ICASSP 2006, Toulouse, France. 2006, Vol. 1: 97-100.
  • 2Solomonoff A, Campbell W M, and Boardman I. Advances in channel compensation for SVM speaker recognition [C]. Proc. ICASSP 2005, Philadelphia, USA. 2005, Vol. 1: 629-632.
  • 3Reynolds D A, Quatieri T F, and Dunn R, B. Speaker verification using adapted Gaussian mixture models [J]. Digital Signal Processing, 2000, 10(3): 19-41.
  • 4Kenny P, Boulianne G, Ouellet P, and Dumouchel P. Speaker and session variability in GMM-based speaker verification [J]. IEEE Trans. on Audio, Speech and Language Processing, 2007, 15(4): 1448-1460.
  • 5Vogt R, Baker B, and Sridharan S. Modeling session variability in text-independent speaker verification [C]. Proc. Interspeech2005, Lisbon, Portugal. 2005: 3117-3120.
  • 6Kenny P, Mihoubi M, and Dumouchel P. New MAP estimators for speaker recognition [C]. Proc. Eurospeech 2003, Geneva, Switzerland, 2005: 2964-2967.
  • 7Kenny P, Boulianne G, and Dumouchel P. Eigenvoice modeling with sparse training data [J]. IEEE Trans. on Speech and Audio, 2005, 13(3): 345-354.
  • 8Collobert R. SVMTorch: A support vector machine for large-scale regression and classification problems[EB/OL]. Available at: http://bengio.abracadoudou.com/projects/ SVMTorch.htm].
  • 9NIST, The NIST Year 2006 speaker recognition evaluation plan[EB/OL]. Available at: http://www.nist.gov/speech /tests/spk/2006/sre-06_ evalplan-v9.pdf.
  • 10Matejka P, Burget L, and Schwarz P, et al.. STBU system for the NIST 2006 speaker recognition evaluation. Proc. ICASSP 2007, Hawaii, USA. 2007, Vol. 4: 221-224.

共引文献10

同被引文献4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部