期刊文献+

基于稀疏表征的话者识别 被引量:2

Speaker identification based on sparse representation
下载PDF
导出
摘要 近年来,随着信号的稀疏性理论越来越受到人们的关注,稀疏表征分类器也作为一种新型的分类算法被应用到话者识别系统中。该模型的基本思想是:只要超完备字典足够大,任意待测样本都能够用超完备字典进行线性表示。基于信号的稀疏性理论,未知话者的向量系数,即稀疏解可以通过L1范数最小化获取。超完备字典则可视为语音特征向量在高斯混合模型-通用背景模型(GMM-UBM)上进行MAP自适应而得到的大型数据库。采用稀疏表征模型作为话者辨认的分类方法,基于TIMIT语料库的实验结果表明,所采用的话者辨认方法,能够大大提高说话人识别系统的性能。 The signal sparse theory has received more and more attentions in recent years. Sparse representation, a new classification method for speaker identification has been applied into the speaker identification system. The main idea based on this new approach is that an unknown test utterance can be represented as a linear combination of the training database while the training patterns are sufficient. According to the sparse theory, the coefficients of unknown test utterances corresponding to the class index of test models could be obtained by L1-norm minimization. Over-complete dictionary could be developed by adapting speech features to Gaussian Mixture Model-Universal Background Model(GMM-UBM) using Maximum-A-Posteriori(MAP) adaptation. This paper makes use of the sparse representation model for speaker identification, and the experiments conducted on TIMIT acoustic-phonetic continuous speech corpus show that the perfor-mance of the proposed method consistently outperforms the state of art speaker identification classifiers.
出处 《计算机工程与应用》 CSCD 2014年第20期215-217,243,共4页 Computer Engineering and Applications
基金 国家仪器重大专项(No.2012YQ15008703)
关键词 稀疏表征 高斯混合模型(GMM)均值超向量 超完备字典 最大后验(MAP)算法 sparse representation Gaussian Mixture Model(GMM)supervectors over-complete dictionary Maximum-A-Posteriori(MAP)algorithm
  • 相关文献

参考文献14

  • 1Higgins A,Bahler L,Porter J.Voice identification using nearest neighbor distance measure[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing,1993:375-378.
  • 2Reynolds D A.Speaker identification and verification using Gaussian mixture speaker models[J].Speech Communication,1995,17(1/2):91-108.
  • 3Reynolds D A,Quatieri T F,Dunn R.Speaker verification using adapted Gaussian mixture models[J].Digital Signal Processing,2000,10(1/3):19-41.
  • 4Solomonoff A,Campbell W M,Boardman I.Advances in channel compensation for SVM speaker recognition[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing,2005:629-632.
  • 5Campbell W,Sturim D,Reynolds D.Support vector machines using GMM supervectors for speaker verification[J].IEEE Signal Processing Letters,2006,13(5):309-311.
  • 6Donoho D L.Compressed sensing[J].IEEE Transactions on Information Theory,2006,52(4):1289-1305.
  • 7Candès E,Romberg J,Tao T.Stable signal recovery from incomplete and inaccurate measurements[J].Common on Pure and Applied Mathematics,2006,59(8):1207-1223.
  • 8Lewicki M S,Sejnowski T J.Learning overcomplete representations[J].Neural Computer,2000,12(2):337-365.
  • 9Naseem I,Togneri R,Bennamoun M.Sparse representation for speaker identification[C]//International Conference on Pattern Recognition,2010:4460-4463.
  • 10Mallat S,Zhang Z.Matching pursuit with time-frequency dictionaries[J].IEEE Transactions on Signal Processing,1993,41(12):3397-3415.

同被引文献15

  • 1Cand:s E. Compressive Sampling [ C ]//Proceedings of the International Congress of Mathematicians. Madrid: Amer Mathematical Society, 2006: 1433-1452.
  • 2Richard G, Baraniuk. Compressive Sensing[ J]. IEEE Signal Processing Mag:ine, 2007, 24(4) :118-124.
  • 3Wright J, YANG A Y, Ganesh A, et al. Robust Face Recognition via Sparse Representation [ J ]. IEEE Trans. On Pattern Analysis and Machine Intelligence, 2009, 31(2) : 210-227.
  • 4CHEN Y, LIU J, Lv S T. Modulation Classification based on Bispectmm and Sparse Representation in Cognitive Radio [C]//IEEE 13th International Conference on Communica- tion Technology. Piscataway: IEEE Press ,2011:25-28.
  • 5SHEN Y, LIU G H, LIU It. Classification of Power Quali- ty Disturbances based on Random Matrix Transform and Sparse Representation[ C ]//8tb World Congress on Intelli- gent Control and Automation. Pi:ataway: IEEE Press, 2010: 7-9.
  • 6ZHENG C H, ZHANG L, HUANG D S, et al. Metasam- pie-based Sparse Representation for Tumor Classification [ J ]. IEEE- ACM Trans. On Comput Biol Bioinform, 2011, 8(5): 1273-1282.
  • 7Ptucha R, Savakis A E. LGE-KSVD: Robust Sparse Representation Classification [ J ]. IEEE Trans. On Im- age Processing ,2014,23 ( 04 ) : 1737-1750.
  • 8Li Y, Yu Z, Bi N, et al. Sparse Representation for Brain Signal Processing : A Tutorial on Methods and Applications [ J ]. IEEE Signal Processing Magazine, 2014, 31 (03) :96-106.
  • 9陆慧娟,陆江江,王明怡,陆羿.基于压缩感知的癌症基因表达数据分类[J].中国计量学院学报,2012,23(1):70-74. 被引量:7
  • 10孙道达,赵健,王瑞,冯宁,胡江华.基于稀疏表示的QR码识别[J].计算机应用,2013,33(1):179-181. 被引量:10

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部