期刊文献+

GMM文本无关的说话人识别系统研究 被引量:27

Research on GMM text-independent speaker recognition
下载PDF
导出
摘要 在高斯混合模型(Gaussian Mixture Model,GMM)训练时,对传统的模型参数初始化方法(随机法、K均值聚类法)进行改进,提出分裂法与K均值聚类相结合的新方法。实验表明,采用改进的方法与传统方法相比,系统平均识别率有15.47%和7.5%的提高。研究了GMM的阶数、协方差阈值、预加重系数对系统识别率的影响。对实验结果进行详细分析,并根据实验数据,取它们各自表现最好的值,从而使构建的说话人识别系统获得一个较高的识别率。实验表明,在规定的实验条件下,系统可达到90%以上的识别率。 This paper improves the traditional method of Gaussian Mixture Mode(lGMM) parameters initialization at the time of GMM training.A new approach which combines division and K-means clustering is presented.The experiment shows that the proposed method can achieve the average recognition rate increase by 15.47% and 7.5% compared with the randomization and Kmeans clustering.At the same time,the impact of the order of GMM,covariance threshold and pre-emphasis coefficient on system recognition rate are studied.Meanwhile,the experiment results are analyzed in detail.In order to make the speaker recognition system get a higher recognition rate,their optimal values are chosen from the experiment data.The experiment shows that the system can achieve the recognition rate with above 90% under the provided experimental condition.
作者 蒋晔 唐振民
出处 《计算机工程与应用》 CSCD 北大核心 2010年第11期179-182,195,共5页 Computer Engineering and Applications
关键词 说话人识别 高斯混合模型 美尔频率倒谱系数(MFCC) 分裂法与K均值聚类结合法 speaker recognition Gaussian Mixture Moda(lGMM) Mel Frequency Cepstrum Coefficien(tMFCC) combination division and K-means clustering
  • 相关文献

参考文献7

  • 1Reynolds D A,Rose R C.Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Transactions on Speech and Audio Processing,1995,3(1):72-83.
  • 2Reynolds D A.Speaker identification and verification using Gaussian mixture speaker model[J].Speech Communication,1995,17:91-108.
  • 3You K H.Wang H C.Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification[J].Speech Communication,1999,28:227-241.
  • 4Jim Z C.Improvement of the K-means clustering filtering algorithm[J].Pattern Recognition,2008,41 (12):3677-3681.
  • 5岳喜才,叶大田.文本无关的说话人识别:综述[J].模式识别与人工智能,2001,14(2):194-200. 被引量:8
  • 6吴尊敬,曹志刚.Improved MFCC-Based Feature for Robust Speaker Identification[J].Tsinghua Science and Technology,2005,10(2):158-161. 被引量:7
  • 7Reynolds D A,Thomas F.Speaker verification using adapted Gaus-sian mixture models[J].Digital Signal Processing,2000,10 (1-3):19-41.

二级参考文献10

共引文献13

同被引文献200

引证文献27

二级引证文献75

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部