Locality Preserving Discriminant Projection for Speaker Verification 被引量：1

Locality Preserving Discriminant Projection for Speaker Verification

下载PDF

导出

摘要 In this paper, a manifold subspace learning algorithm based on locality preserving discriminant projection (LPDP) is used for speaker verification. LPDP can overcome the deficiency of the total variability factor analysis and locality preserving projection (LPP). LPDP can effectively use the speaker label information of speech data. Through optimization, LPDP can maintain the inherent manifold local structure of the speech data samples of the same speaker by reducing the distance between them. At the same time, LPDP can enhance the discriminability of the embedding space by expanding the distance between the speech data samples of different speakers. The proposed method is compared with LPP and total variability factor analysis on the NIST SRE 2010 telephone-telephone core condition. The experimental results indicate that the proposed LPDP can overcome the deficiency of LPP and total variability factor analysis and can further improve the system performance. In this paper, a manifold subspace learning algorithm based on locality preserving discriminant projection (LPDP) is used for speaker verification. LPDP can overcome the deficiency of the total variability factor analysis and locality preserving projection (LPP). LPDP can effectively use the speaker label information of speech data. Through optimization, LPDP can maintain the inherent manifold local structure of the speech data samples of the same speaker by reducing the distance between them. At the same time, LPDP can enhance the discriminability of the embedding space by expanding the distance between the speech data samples of different speakers. The proposed method is compared with LPP and total variability factor analysis on the NIST SRE 2010 telephone-telephone core condition. The experimental results indicate that the proposed LPDP can overcome the deficiency of LPP and total variability factor analysis and can further improve the system performance.

作者 Chunyan Liang Wei Cao Shuxin Cao Chunyan Liang;Wei Cao;Shuxin Cao(College of Computer Science and Technology, Shandong University of Technology, Zibo, China)

机构地区 College of Computer Science and Technology

出处《Journal of Computer and Communications》 2020年第11期14-22,共9页 电脑和通信（英文）

关键词 Speaker Verification Locality Preserving Discriminant Projection Locality Preserving Projection Manifold Learning Total Variability Factor Analysis Speaker Verification Locality Preserving Discriminant Projection Locality Preserving Projection Manifold Learning Total Variability Factor Analysis

分类号 O17 [理学—基础数学]

引文网络
相关文献

参考文献3

1CHEN Chen,HAN Jiqing.Partial Least Squares Based Total Variability Space Modeling for I-Vector Speaker Verification[J].Chinese Journal of Electronics,2018,27(6):1229-1233. 被引量：4
2郭武,戴礼荣,王仁华.采用因子分析和支持向量机的说话人确认系统[J].电子与信息学报,2009,31(2):302-305. 被引量：5
3梁春燕,袁文浩,李艳玲,夏斌,孙文珠.基于判别邻域嵌入算法的说话人识别[J].电子与信息学报,2019,41(7):1774-1778. 被引量：4

二级参考文献12

1Campbell W M, Sturim D E, and Reynolds D A, et al.. SVM based speaker verification using a GMM supervector kernel and NAP variability compensation [C]. Proc ICASSP 2006, Toulouse, France. 2006, Vol. 1: 97-100.
2Solomonoff A, Campbell W M, and Boardman I. Advances in channel compensation for SVM speaker recognition [C]. Proc. ICASSP 2005, Philadelphia, USA. 2005, Vol. 1: 629-632.
3Reynolds D A, Quatieri T F, and Dunn R, B. Speaker verification using adapted Gaussian mixture models [J]. Digital Signal Processing, 2000, 10(3): 19-41.
4Kenny P, Boulianne G, Ouellet P, and Dumouchel P. Speaker and session variability in GMM-based speaker verification [J]. IEEE Trans. on Audio, Speech and Language Processing, 2007, 15(4): 1448-1460.
5Vogt R, Baker B, and Sridharan S. Modeling session variability in text-independent speaker verification [C]. Proc. Interspeech2005, Lisbon, Portugal. 2005: 3117-3120.
6Kenny P, Mihoubi M, and Dumouchel P. New MAP estimators for speaker recognition [C]. Proc. Eurospeech 2003, Geneva, Switzerland, 2005: 2964-2967.
7Kenny P, Boulianne G, and Dumouchel P. Eigenvoice modeling with sparse training data [J]. IEEE Trans. on Speech and Audio, 2005, 13(3): 345-354.
8Collobert R. SVMTorch: A support vector machine for large-scale regression and classification problems[EB/OL]. Available at: http://bengio.abracadoudou.com/projects/ SVMTorch.htm].
9NIST, The NIST Year 2006 speaker recognition evaluation plan[EB/OL]. Available at: http://www.nist.gov/speech /tests/spk/2006/sre-06_ evalplan-v9.pdf.
10Matejka P, Burget L, and Schwarz P, et al.. STBU system for the NIST 2006 speaker recognition evaluation. Proc. ICASSP 2007, Hawaii, USA. 2007, Vol. 4: 221-224.

共引文献10

1朱秉诚,吴乐南,王伟.基于叩齿声音的身份确认方法[J].模式识别与人工智能,2013,26(2):182-188.
2柳欣,李鹤洋,钟必能,杜吉祥.结合有监督联合一致性自编码器的跨音视频说话人标注[J].电子与信息学报,2018,40(7):1635-1642. 被引量：2
3梁春燕,袁文浩,李艳玲,夏斌,孙文珠.基于判别邻域嵌入算法的说话人识别[J].电子与信息学报,2019,41(7):1774-1778. 被引量：4
4梁春燕,曹伟.基于邻域保持嵌入算法的语种识别[J].陕西师范大学学报（自然科学版）,2020,48(2):38-42. 被引量：3
5吕志超,王好忠,白一奇.流形学习在浅海水声通信中的应用[J].电子与信息学报,2021,43(3):767-772. 被引量：1
6杨治学,黄浩,胡英,吾守尔·斯拉木.基于深度神经网络的说话人年龄分类研究[J].现代电子技术,2021,44(10):120-124.
7罗春梅.基于改进MFCC与RCNN的说话人识别算法[J].数学的实践与认识,2021,51(17):102-110. 被引量：6
8陈志高,李鹏,肖润秋,黎塔,王文超.文本无关说话人识别的一种多尺度特征提取方法[J].电子与信息学报,2021,43(11):3266-3271. 被引量：4
9肜娅峰,陈晨,陈德运,何勇军.基于贝叶斯主成分分析的i-vector说话人确认方法[J].电子学报,2021,49(11):2186-2194. 被引量：2
10陈晨,韩纪庆,陈德运,何勇军.文本无关说话人识别中句级特征提取方法研究综述[J].自动化学报,2022,48(3):664-688. 被引量：4

同被引文献4

1刘嘉敏,王会岩,周晓莉,罗甫林.基于改进的等距离映射算法的人脸识别[J].计算机应用,2013,33(1):76-79. 被引量：4
2李昌华,李智杰,高阳.图谱和Kuhn-Munkres算法在图匹配中的应用研究[J].计算机工程与科学,2017,39(10):1896-1900. 被引量：8
3娄雪,闫德勤,王博林,王族.一种改进的邻域保持嵌入算法[J].计算机科学,2018,45(B06):255-258. 被引量：2
4李元,黄莹莹.改进LNS和邻域保持嵌入算法的研究[J].计算机应用与软件,2021,38(2):250-257. 被引量：3

引证文献1

1徐剑豪,胡文军,胡天杰,王哲昀.基于最近邻子空间的邻域保持嵌入[J].湖州师范学院学报,2022,44(10):43-51.

1张恩豪,陈晓红.基于一致判别相关分析的低分辨率人脸识别算法[J].数据采集与处理,2020,35(6):1163-1173. 被引量：4
2高宇,刘跃娟.基于数据多样性的判别多流形降维方法的研究[J].自动化与仪器仪表,2020(4):30-34. 被引量：2
3原志明,林翔.基于子空间学习刮板输送机减速器轴承变工况故障诊断[J].煤炭科学技术,2019(S02):64-67. 被引量：3
4Soumana Oumar Traoré,Cheickna Sylla,Saleck Doumbia,Alou Samaké,Amadou Bocoum,Seydou Fané,Rokiatou Torian Sangaré,Fatoumata Keita,Ibrahima Tegueté,Youssouf Traoré,Niani Mounkoro,Mamadou Traoré,Amadou Ingré Dolo.Epidemiology, the Main Reasons and Maternal-Fetal Complications of Unassisted Childbirth in the Health District of V Bamako Commune, Mali[J].Open Journal of Obstetrics and Gynecology,2020,10(10):1381-1395.
5Ailian Chen,Leilei Zhu,Huaijuan Zang,Zhenglong Ding,Shu Zhan.Computer-aided diagnosis and decision-making system for medical data analysis: A case study on prostate MR images[J].Journal of Management Science and Engineering,2019,4(4):266-278.
6李尚锋.浅谈“SLS”最新流行音乐唱法的歌唱技巧[J].戏剧之家,2021(1):66-67. 被引量：2
7Qian Wang,YueWu,Jianxin Zhang,Hengbo Zhang,Chao Che,Lin Shan.Supervised Deep Second-Order Covariance Hashing for Image Retrieval[J].国际计算机前沿大会会议论文集,2020(1):476-487.
8Tingting Yi,Yue‑Hua Sun,Wei Liang.Nestling discrimination and feeding habits during brooding of Chestnut Thrushes[J].Avian Research,2020,11(2):150-156.
9Tuoc Phan,Yannick Sire.On Well-Posedness of 2D Dissipative Quasi-Geostrophic Equation in Critical Mixed Norm Lebesgue Spaces[J].Analysis in Theory and Applications,2020,36(2):111-127.
10Gligor Bojkov,Sasa Mitrev,Emilija Arsov.Determination on Microclimatic Conditions at Vines upon Development on Gray Mold (<i>Botrytis cinerea</i>)[J].Agricultural Sciences,2020,11(11):1007-1016.

Journal of Computer and Communications

2020年第11期

浏览历史

内容加载中请稍等...