期刊文献+

基于矢量量化方法的说话人识别技术

Speaker Recognition Technology Based on VQ
下载PDF
导出
摘要 说话人识别是一项通过语音来识别说话人身份的技术,它在保安、司法、军事、财经和信息服务等领域都具有广泛的应用前景。该文采用线性预测倒谱系数和美尔倒谱系数特征相结合,基于矢量量化聚类方法建立了一个与文本无关的、连续语音发音的说话人识别系统。只要矢量量化聚类法码本大小选择合适,该说话人识别系统就可以获得较好的识别效果。当阈值恰当选取时,该系统具备拒绝识别集外人的功能。 Speaker recognition is a kind of technology to judge the speaker's identify according to his voice. It has good prospect in many areas such as security, judicatory, and military. One speaker identification system by extracting MFCC as feature vector and using VQ in match phase is constructed. The results of the experiment indicate that, the speaker recognition model based on VQ is effective; the advantage is correct classifying, small memory need and rapid judging.
作者 张一清 李轶
出处 《杭州电子科技大学学报(自然科学版)》 2005年第4期58-61,共4页 Journal of Hangzhou Dianzi University:Natural Sciences
关键词 矢量量化 说话人识别 线性预测倒谱系数 美尔倒谱系数 vector quantization(VQ) speaker identification LPCC cepstrum MFCC cepstrum
  • 相关文献

参考文献4

二级参考文献12

  • 1周汀,闵昊,章倩苓.一种矢量量化编码的加速算法[J].电子学报,1997,25(4):95-98. 被引量:6
  • 2Huang Xuedong, Acero A, Hon H W. Spoken Language Processing.Prentice Hall,2001.
  • 3Young S, Kershaw D, Odell J, et al. The HTK Book.Microsoft Corporation &CUED,2000.
  • 4Duda R O, Hart P E, Stork D G. Pattern Classification (Second Edition). A Wiley-interscience Publication, 2001.
  • 5Wendt S, Fink G A, Kummert F. Forward Masking for Increased Robustness in Automatic Speech Recognition. in: Proc. of European Conf. on Speech Communication and Technology, Aalborg,Danemark, 2001,1:615-618.
  • 6Hermansky H. Perceptual Linear Predictive(PLP) Analysis for Speech.J Acoust Soc Am ,1990,87:1738-1752.
  • 7Chang Hsinglee,Signal Processing,1995年,43卷,323页
  • 8Chang Dabei,IEEE Trans Commun,1985年,33卷,10期,1132页
  • 9Rabiner L Juang Biing-Hwang.Fundamentals of Speech Recognition[M].北京:清华大学出版社(影印版),1999..
  • 10杨行峻 迟惠生 等.语音数字信号处理[M].北京:电子工业出版社,1995..

共引文献57

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部