采用因子分析和支持向量机的说话人确认系统被引量：5

Speaker Verification Based on Factor Analysis and SVM

下载PDF

导出

摘要在文本无关的说话人识别中,采用均值超向量作为特征向量的支持向量机系统性能已经超过了传统的混合高斯-通用背景模型系统,但是信道的影响在均值超向量上仍然存在。该文对因子分析算法进行修改后,可以解决均值超向量的信道问题,能够取得优于扰动属性映射的性能,更重要的是采用因子分析的系统的稳定性可以得到保证。在NIST 2006说话人测试数据库上,利用该文的方法能够取得等错误率6.0%。 In the text-independent speaker recognition system, the mean-supervector of Gaussian Mixture Models （GMM） and Support Vector Machine （SVM） system can outperform the traditional GMM and Universal Background Models （UBM） system, but the session variability is still one of the most important reasons that deteriorate the performance. In this paper, the factor analysis is tailored to solve the session variability problem of GMM mean-supervector. The proposed algorithm can outperform the Nuisance Attribute Projection （NAP） algorithm. Furthermore, the proposed system based on factor analysis is more stable than the system based on NAP. In the NIST 2006 SRE corpus, the Equal Error Rate （EER） of the proposed system can obtain 6.0%.

作者郭武戴礼荣王仁华

机构地区中国科技大学电子工程与信息科学系

出处《电子与信息学报》 EI CSCD 北大核心 2009年第2期302-305,共4页 Journal of Electronics & Information Technology

关键词说话人确认超向量联合因子分析扰动属性映射 Speaker verification Supervector Joint factor analysis Nuisance Attribute Projection（NAP）

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献10

1Campbell W M, Sturim D E, and Reynolds D A, et al.. SVM based speaker verification using a GMM supervector kernel and NAP variability compensation [C]. Proc ICASSP 2006, Toulouse, France. 2006, Vol. 1: 97-100.
2Solomonoff A, Campbell W M, and Boardman I. Advances in channel compensation for SVM speaker recognition [C]. Proc. ICASSP 2005, Philadelphia, USA. 2005, Vol. 1: 629-632.
3Reynolds D A, Quatieri T F, and Dunn R, B. Speaker verification using adapted Gaussian mixture models [J]. Digital Signal Processing, 2000, 10(3): 19-41.
4Kenny P, Boulianne G, Ouellet P, and Dumouchel P. Speaker and session variability in GMM-based speaker verification [J]. IEEE Trans. on Audio, Speech and Language Processing, 2007, 15(4): 1448-1460.
5Vogt R, Baker B, and Sridharan S. Modeling session variability in text-independent speaker verification [C]. Proc. Interspeech2005, Lisbon, Portugal. 2005: 3117-3120.
6Kenny P, Mihoubi M, and Dumouchel P. New MAP estimators for speaker recognition [C]. Proc. Eurospeech 2003, Geneva, Switzerland, 2005: 2964-2967.
7Kenny P, Boulianne G, and Dumouchel P. Eigenvoice modeling with sparse training data [J]. IEEE Trans. on Speech and Audio, 2005, 13(3): 345-354.
8Collobert R. SVMTorch: A support vector machine for large-scale regression and classification problems[EB/OL]. Available at: http://bengio.abracadoudou.com/projects/ SVMTorch.htm].
9NIST, The NIST Year 2006 speaker recognition evaluation plan[EB/OL]. Available at: http://www.nist.gov/speech /tests/spk/2006/sre-06_ evalplan-v9.pdf.
10Matejka P, Burget L, and Schwarz P, et al.. STBU system for the NIST 2006 speaker recognition evaluation. Proc. ICASSP 2007, Hawaii, USA. 2007, Vol. 4: 221-224.

同被引文献15

1方昱春,王展.Your New Key:生物特征识别技术[J].自然杂志,2007,29(4):219-224. 被引量：3
2Jain A K, Li S Z. Handbook of Face Recognition. New York, USA: Springer-Verlag, 2005.
3Nomir O, Abdel-Mottaleb M. Human Identification from Dental X- Ray Images Based on the Shape and Appearance of the Teeth. IEEE Trans on Information Forensics and Security, 2007, 2(2) : 188-197.
4Hossan M A, Memon S, Gregory M A. A Novel Approach for MFCC Feature Extraction // Proc of the 4th International Conference on Signal Processing and Communication Systems. Gold Coast, Australia, 2010 : 1-5.
5Dan Zhiping, Zheng Sheng, Sun Shuifa, et al. Speaker Recognition Based on LS-SVM//Proc of the 3rd International Conference on In- novative Computing Information and Control. Dalian, China, 2008: 525 -528.
6Sun Hanwu. An Efficient Feature Selection Method for Speaker Recognition//Proc of the 6th International Symposium on Chinese Spoken Language Processing. Kunming, China, 2008 : 1-4.
7Li Shaomei, Guo Yunfei, Wei Hongquan. Speaker Recognition via Statistics of Acoustic Feature Distribution//Proc of the 1st International Conference on Multimedia Information Networking and Security. Wuhan, China, 2009:190-192.
8Zamalloa M, Bordel G, Rodrignez L J, et al. Feature Selection Based on Genetic Algorithms for Speaker Recognition // Proc of the Workshop on Speaker and Language Recognition. San Juan, USA, 2006:1-8.
9陈存宝,赵力.嵌入自联想神经网络的高斯混合模型说话人辨认[J].电子与信息学报,2010,32(3):528-532. 被引量：4
10梅晓丹,孙圣和.基于小波变换的静音与语音分割新算法[J].哈尔滨工业大学学报,2002,34(3):408-411. 被引量：12

引证文献5

1朱秉诚,吴乐南,王伟.基于叩齿声音的身份确认方法[J].模式识别与人工智能,2013,26(2):182-188.
2柳欣,李鹤洋,钟必能,杜吉祥.结合有监督联合一致性自编码器的跨音视频说话人标注[J].电子与信息学报,2018,40(7):1635-1642. 被引量：2
3梁春燕,袁文浩,李艳玲,夏斌,孙文珠.基于判别邻域嵌入算法的说话人识别[J].电子与信息学报,2019,41(7):1774-1778. 被引量：4
4陈志高,李鹏,肖润秋,黎塔,王文超.文本无关说话人识别的一种多尺度特征提取方法[J].电子与信息学报,2021,43(11):3266-3271. 被引量：3
5Chunyan Liang,Wei Cao,Shuxin Cao.Locality Preserving Discriminant Projection for Speaker Verification[J].Journal of Computer and Communications,2020,8(11):14-22. 被引量：1

二级引证文献10

1梁春燕,曹伟.基于邻域保持嵌入算法的语种识别[J].陕西师范大学学报（自然科学版）,2020,48(2):38-42. 被引量：3
2徐兵,石少青,陈超.基于自然语言的中文地址匹配研究[J].电子设计工程,2020,28(16):7-10. 被引量：4
3吕志超,王好忠,白一奇.流形学习在浅海水声通信中的应用[J].电子与信息学报,2021,43(3):767-772. 被引量：1
4韩圣亚,严莉,刘荫,徐浩,朱韶松.基于XML的自动化异构系统数据一致性校验方法[J].电子设计工程,2021,29(13):137-141. 被引量：2
5罗春梅.基于改进MFCC与RCNN的说话人识别算法[J].数学的实践与认识,2021,51(17):102-110. 被引量：6
6徐剑豪,胡文军,胡天杰,王哲昀.基于最近邻子空间的邻域保持嵌入[J].湖州师范学院学报,2022,44(10):43-51.
7荣玉军,方昳凡,田鹏,程家伟.基于知识蒸馏与ResNet的声纹识别[J].重庆大学学报,2023,46(1):113-124. 被引量：1
8李平,高清源,夏宇,张小勇,曹毅.基于SE-DR-Res2Block的声纹识别方法[J].工程科学学报,2023,45(11):1962-1969.
9宣茜,韩润萍,高静欣.基于Conformer的实时多场景说话人识别模型[J].计算机工程与应用,2024,60(7):147-156. 被引量：1
10Chunyan Liang,Wei Cao,Shuxin Cao.Locality Preserving Discriminant Projection for Speaker Verification[J].Journal of Computer and Communications,2020,8(11):14-22. 被引量：1

1龙艳花,郭武,戴礼荣.采用韵律特征的说话人确认系统[J].数据采集与处理,2010,25(1):76-80. 被引量：1
2李轶杰,郭武,戴礼荣.话者识别的信道补偿[J].小型微型计算机系统,2008,29(12):2344-2347. 被引量：7
3龙艳花,戴礼荣.采用M-矢量和支持向量机的说话人确认系统[J].华中科技大学学报（自然科学版）,2014,42(8):63-68. 被引量：2
4李晋,郭武,戴礼荣.联合因子分析算法中基于信号子空间的空间变换方法[J].模式识别与人工智能,2013,26(8):705-710. 被引量：2
5孙干超,王吉林.基于ARM的说话人识别系统的研究与实现[J].电子器件,2014,37(6):1151-1154. 被引量：2
6程开芳,程进,曹茜.红外探测器光谱响应测试数据库[J].实用测试技术,1998,24(5):20-22.
7宋彦,戴礼荣,王仁华.基于超向量子空间分析的自动语种识别方法[J].模式识别与人工智能,2010,23(2):165-170. 被引量：4
8郭武,李轶杰,戴礼荣,王仁华.采用非监督得分规整和因子分析的说话人确认[J].电子学报,2009,37(4):776-779. 被引量：1
9郭武,李轶杰,戴礼荣,王仁华.说话人识别中的因子分析以及空间拼接[J].自动化学报,2009,35(9):1193-1198. 被引量：14
10张建平,李明,索宏彬,杨琳,付强,颜永红.长时语音特征在说话人识别技术上的应用[J].声学学报,2010,35(2):267-269. 被引量：8

电子与信息学报

2009年第2期

浏览历史

内容加载中请稍等...

采用因子分析和支持向量机的说话人确认系统被引量：5

参考文献10

同被引文献15

引证文献5

二级引证文献10

相关作者

相关机构

相关主题

浏览历史

采用因子分析和支持向量机的说话人确认系统 被引量：5

参考文献10

同被引文献15

引证文献5

二级引证文献10

相关作者

相关机构

相关主题

浏览历史

采用因子分析和支持向量机的说话人确认系统被引量：5