基于非参数独立分量分析的说话人识别方法

Speaker Recognition Based on Nonparametric Independent Component Analysis

下载PDF

导出

摘要首先用非参数独立分量分析方法提取表征说话人音频特性的时域基函数组,语音信号可由这些基函数线性组合而成。每个可识别的说话人对应一个不同的基函数组,对某个特定人的输入音频,只有与它对应的基函数组使其系数向量各分量之间的独立性最强(也就是互信息最小)。对待识别音频,分别用已知说话人的时域基函数组计算各自的系数向量,并计算系数向量各分量之间的互信息。互信息最小的基函数组对应的说话人即为识别结果。实验结果表明,即使用很少的测试数据.也能达到很高的识别率。 Time-domain basis functions are obtained through nonparametric independent component analysis first,whicn exhibit the main characteristics of the specific speaker. Speech signals then can be represented by the superposition of the basis functions. Every speaker candidate has his own set of basis functions, which are different from those of others. And,for a speech signal by a specific speaker,only his own set of basis functions can make the elements of the coefficient vectors most independent （namely, the mutudl information is minimal）. To recognize a test speech signal,all sets of basis functions are used to produe the coefficient vectors,and the mutual information among the elements of the coefficient vectors are calculated. The speaker who has the minimum mutual information is thought of as the producer of the test speech signal. Experiments show that a high recognition rate can be achieved by a small amount of data.

作者陈刚陈莘萌向广利

机构地区武汉大学计算机学院

出处《计算机科学》 CSCD 北大核心 2006年第3期167-170,共4页 Computer Science

基金国家自然科学基金(10371033) 国家211工程重大项目资助

关键词非参数独立分量分析时域基函数组系数向量互信息 Nonparametric component analysis, Time-domain basis functions, Coefficient vectors, Information

分类号 TN911.7 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1Reynolds D A.An Overview of Automatic Speaker Recognition Technology.In:Proc.of the IEEE International Conference 9n Acoustics,Speech and Signal Processing,Orlando,Florida,2002.300～304.
2Che C W,Yuk Q G.An HMM Approach to Text-Prompted Speaker Verification.In:Proc.of the IEEE Intl.Conf.on Acoustics,Speech and Signal Processing,Atlanta,Georgia,1996,7(10):673～676.
3Fakotakis N,Sirigos J.A High-Performance Text-Independent Speaker Recognition System Based on Vowel Spotting and Neural Nets.In:Proc.of the IEEE Intl.Conf.on Acoustics,Speech and Signal Processing,Atlanta,Georgia,1996,2:661～664.
4包威权,陈坷,迟惠生.基于HMM/MLFNN混合结构的说话人辨认研究.第四届全国人机语音通讯会议论文集,1995.185～189.
5Barlow H B.Unsupervised Learning.Neural Computation,1989,1:295～311.
6Cover T M,Thomas J A.Elements of Information Theory.New York:Wiley,1991.
7Lee T W,Lewicki M S.The Generalized Gaussian Mixture Model Using ICA.In:Proc.of the Intl.Workshop on Independent Component Analysis(ICA'00),Helsinki,2000.239～244.
8Cardoso J F.Blind Signal Separation:Statistical Principles.Proc.of the IEEE Special Issue on Blind Identification and Estimation,1998,9 :2009～2025.
9Silverman B W.Density Estimation for Statistics and Data Analysis.New York:Chapman and Hall,1985.
10Boscolo R,Pan H.Independent Component Analysis Based on Nonparametric Density Estimation.IEEE Transactions on Neural Networks,2004,15(1):55～64.

1周林,文吉.一种变系数的m序列产生方法[J].电讯技术,2007,47(1):138-141.
2刘福来,彭泸,汪晋宽,杜瑞燕.基于加权L_1范数的CS-DOA算法[J].东北大学学报（自然科学版）,2013,34(5):654-657. 被引量：5
3余淑萍,杨铁军.独立分量分析及其应用[J].计算机系统应用,2009,18(9):156-158. 被引量：1
4魏萍.数字滤波器的实现方法[J].硅谷,2009,2(18).
5王耀军,林勇刚.压缩感知下的自适应声源定位估计[J].计算机工程与应用,2016,52(14):62-66. 被引量：2
6闫浩,董春曦,赵国庆.基于压缩感知的线性调频信号参数估计[J].电波科学学报,2015,30(3):449-456. 被引量：10
7冯登国,肖国镇.多值逻辑函数组构成置换的一个充要条件[J].电子学报,1995,23(12):75-77. 被引量：2
8陈垚佳,张永平,田建艳.基于分块过完备稀疏表示的多聚焦图像融合[J].电视技术,2012,36(13):48-51. 被引量：7
9张爱丽,李志勇,刘乃安.自适应滤波器的收敛性研究[J].计算机与网络,2007,33(17):44-45. 被引量：1
10闫浩,董春曦,赵国庆.基于压缩感知的分数阶Fourier域LFM信号检测[J].微波学报,2016,32(2):84-89. 被引量：3

计算机科学

2006年第3期

浏览历史

内容加载中请稍等...

基于非参数独立分量分析的说话人识别方法

参考文献11

相关作者

相关机构

相关主题

浏览历史