基于极大似然线性回归的模型合成和特征映射进行说话人确认被引量：2

Speaker verification using speaker model synthesis and feature mapping based on maximum-likelihood linear regression

下载PDF

导出

摘要提出了基于极大似然线性回归(MLLR)调整的说话人模型合成和特征映射方法。MAP调整事后确定相应模型间线性关系,变换参数人为确定;而MLLR调整首先确定相应模型间线性关系,变换参数由训练数据确定,并且可以只调整均值向量。模型合成时,MLLR调整指定通用信道背景模型参数间的线性变换;特征映射时,MLLR调整指定Root GMM-UBM与通用信道背景模型参数间的线性变换。通过对模型参数进行分组调整,可以在训练数据和参数数目间达成平衡。实验结果表明,合适选取MLLR回归类,可以取得比相应MAP调整方法更好的识别效果。 This paper proposes new methods of speaker verification,which use speaker model synthesis（SMS） and feature mapping based on maximum-likelihood linear regression.MAP method determines a linear relationship among the corresponding models after adjustment and transformation parameters are determined artificially,while MLLR first identify a linear relationship among the corresponding models and transformation parameters are determined from the training data,also it can only adjust the mean vectors.In SMS,MLLR determines transformation parameters among different channel UBMs.In feature mapping,MLLR determines transformation parameters between Root GMM-UBM and the channel UBM.By grouping to the model parameters,it can reach a balance between the training data and the number of parameters.The experimental results show that MLLR adjustment can achieve better verification effect than MAP adjustment by selecting the appropriate classes of regression.

作者陈存宝赵力邹采荣

机构地区东南大学水声信号处理教育部重点实验室东南大学信息科学与工程学院

出处《声学学报》 EI CSCD 北大核心 2011年第1期81-87,共7页 Acta Acustica

基金国家自然科学基金(60872073 60975017 51075068) 江苏省自然科学基金(BK2008291)资助项目

关键词模型参数说话人确认特征映射线性回归极大似然合成 MLLR 线性关系 Mapping Maximum likelihood Metadata Regression analysis Speech recognition

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献20

1Jayanna H S,,Mahadeva S R.Analysis,feature extraction, modeling and testing techniques for speaker recognition. IETE Technical Review . 2009
2Reynolds D A.Channel robust speaker verification via feature mapping. Proceedings of ICASSP . 2003
3Ferras M,Leung C C,Barras C,Gauvain J L.Constrained MLLR for speaker recognition. Proceedings of ICASSP . 2007
4ZHU Donglai,MA Bin,LI Haizhou,HUO Qiang.A generalized feature transformation approach for channel robust speaker verification. Proceedings of ICASSP . 2007
5Mak Man-Wai,Yiu Kwok-Kwong,Sun-Yuan Kung.Probabilistic feature-based transformation for channel robust speaker verification over telephone networks. Neurocomputing . 2007
6Gales M.The generation and use of regression class trees for MLLR adaptation. Technical Report,CUED/FINFENG/ TR263,Cambridge University . 1996
7Stolcke A,Ferrer L,Kajarekar S.Improvements inMLLR-transform-based speaker recognition. Proc IEEEOdyssey-The Speaker and Language Recognition Workshop . 2006
8Kajarekar, S.S,Scheffer, N,Graciarena, M.THE SRI NIST 2008 speaker recognition evaluation system. IEEE International Conference on Acoustics, Speech and Signal Processing . 2009
9A. Mandal,M. Ostendorf,A. Stolcke.Improving robustness of MLLR adaptation with speaker-clustered regression class trees. Computer Speech and Language . 2009
10Heck,L.P.,Weintraub,M.Handset-dependent background models for robust textindependent,speaker recognition. Proceedings of the International Conference on Acoustics,Speech and Signal Processing . 1997

同被引文献25

1刘海滨,吴镇扬,赵力,曾毓敏.噪声环境下基于最大后验非线性变换的隐马尔可夫模型自适应算法[J].声学学报,2004,29(5):467-471. 被引量：4
2YUYibiao,WANGShuozhong.Speaker identification based on complete feature corpus and evaluation of mutual information[J].Chinese Journal of Acoustics,2005,24(3):280-288. 被引量：1
3俞一彪,王朔中.文本无关说话人识别的全特征矢量集模型及互信息评估方法[J].声学学报,2005,30(6):536-541. 被引量：7
4赵蕤,王作英.语音识别中信道和噪音的联合补偿[J].声学学报,2006,31(5):466-470. 被引量：11
5Garreton C, Yoma N B. Telephone channel compensation in speaker verification using a polynomial approximation in the log-filter-bank energy domain. IEEE Trans. on Audio, Speech, and Language Processing, 2012; 20(1): 336-341.
6郭武.复杂信道下的说话人识别.博士学位论文,中国科学技术大学,2008.
7Lu Yong, Wu Haiyang, Wu Zhenyang. Robust speech recognition using improved vector Taylor series algorithm for embedded systems. IEEE Transactions on Consumer Electronics, 2010; 56(2): 764-769.
8Burger L, Matejka P, Schwarz Pet al. Analysis of feature extraction and channel compensation in a GMM speaker recognition system. IEEE Transactions on Audio, Speech, and Language Processing, 2007; 15(7): 1979-1986.
9Reynolds D A. Channel robust speaker verification via fea- ture mapping. In: Proc. ICASSP, 2003; 2:53-56.
10Teunen R, Shahshahani B, Heck L. A model-based trans- formational approach to robust speaker recognition. In: Proc. ICSLP, 2000; 2:495 498.

引证文献2

1吴海洋,杨飞然,周琳,吴镇扬.矢量泰勒级数特征补偿的说话人识别[J].声学学报,2013,38(1):105-112. 被引量：6
2仲伟峰,方祥,范存航,温正棋,陶建华.深浅层特征及模型融合的说话人识别[J].声学学报,2018,43(2):263-272. 被引量：13

二级引证文献19

1冉国敬,夏秀渝,张凤仪.信道失配环境下鲁棒说话人识别[J].计算机系统应用,2015,24(3):235-240. 被引量：2
2王现彬,杨洁,贾英茜,饶立婵.基于MATLAB的说话人识别系统设计与实现[J].石家庄学院学报,2016,18(3):5-8.
3酆勇,熊庆宇,石为人,曹俊华.深度非线性度量学习在说话人确认中的应用[J].声学学报,2018,43(1):112-120. 被引量：3
4谢景一,霍玉倩.新型智能清洁器的设计与改进[J].电子测试,2018,29(11):27-29. 被引量：3
5王亨佳,翁呈祥,胡乔林,刘康.短波信道下基于鲁棒语音特征参数的身份识别方法[J].空军预警学院学报,2019,33(4):281-286.
6曹毅,黄子龙,张威,刘晨,李巍.N-DenseNet的城市声音事件分类模型[J].西安电子科技大学学报,2019,46(6):9-16. 被引量：6
7张靖,俞一彪.具有环境自学习机制的鲁棒说话人识别算法[J].通信技术,2020,53(3):618-624. 被引量：2
8曾春艳,马超峰,王志锋,朱栋梁,赵楠,王娟,刘聪.深度学习框架下说话人识别研究综述[J].计算机工程与应用,2020,56(7):8-16. 被引量：9
9盛永健,黄子龙,刘晨,曹毅,张洪.基于改进卷积神经网络的燃气调压器故障识别研究[J].现代制造工程,2021(4):132-138. 被引量：3
10张兴明,杨凯.深度学习说话人识别中语音特征参数提取研究[J].现代计算机,2021,27(8):3-7. 被引量：2

1楼智美.用Noether定理确定各向同性谐振子的守恒量[J].力学与实践,2003,25(2):72-73. 被引量：5
2LU Yong WU Zhenyang.Maximum likelihood polynomial regression for robust speech recognition[J].Chinese Journal of Acoustics,2011,30(3):358-370.
3杨海,张翔,梁春燕,索宏彬,颜永红.联合因子分析和稀疏表示在稳健性说话人确认中的应用[J].声学学报,2012,37(5):548-552. 被引量：7
4马静,侯丽敏,王朔中.基于全局背景模型和竞争者模型的说话人确认系统[J].声学技术,2007,26(1):105-110. 被引量：1
5寇鑫,张大军,施英,赵松林.Generating Solutions to Discrete sine-Gordon Equation from Modified B(a|¨)cklund Transformation[J].Communications in Theoretical Physics,2011,55(4):545-550.
6崔建斌,姬安召,鲁洪江,王玉风,何姜毅,许泰.Schwarz Christoffel变换数值解法[J].山东大学学报（理学版）,2016,51(4):104-111. 被引量：5
7唐驾时,尹小波.一类强非线性振动系统的分叉[J].力学学报,1996,28(3):363-369. 被引量：24
8丘甜,华伟平,李新光,白鹏.双幂变换下参数极大似然估计的精确分布研究[J].云南民族大学学报（自然科学版）,2016,25(3):246-250.
9丘甜,白鹏,华伟平.双幂变换下线性回归模型中参数的极大似然和最小二乘估计的比较研究[J].金融经济（下半月）,2013(12):92-93. 被引量：1
10丘甜,华伟平,李新光,白鹏.双幂变换下参数的极大似然估计存在唯一性研究——以线性回归模型为例[J].武夷学院学报,2016,35(3):55-58.

声学学报

2011年第1期

浏览历史

内容加载中请稍等...

基于极大似然线性回归的模型合成和特征映射进行说话人确认被引量：2

参考文献20

同被引文献25

引证文献2

二级引证文献19

相关作者

相关机构

相关主题

浏览历史

基于极大似然线性回归的模型合成和特征映射进行说话人确认 被引量：2

参考文献20

同被引文献25

引证文献2

二级引证文献19

相关作者

相关机构

相关主题

浏览历史

基于极大似然线性回归的模型合成和特征映射进行说话人确认被引量：2