期刊文献+

基于动态阈值失量量化的说话人识别 被引量:4

Vector quantization based on the dynamic threshold of speaker recognition
下载PDF
导出
摘要 在基于矢量量化的说话识别系统所选用的LBG算法中,码本分裂时的阈值是影响初始码本生成的重要因素之一,而传统方式所采用的阈值不容易确定,且需要进行大量的实验来获得经验值。提出在一定范围内动态地,随机地产生阈值的方法来改进初始码本形成策略,并结合差分倒谱参数建立说话人识别模型。实验结果表明该方法在识别率得到一定改善的前提下,训练时间及识别时间有了明显改善。 Code splitting threshold is one of the important factors to initialize codebook in Speaker Recognition based on the Vector Quantitation ( VQ), but traditional threshold is not easy to determine and needs a large number of experiments to determine the value. This paper used dynamic and random method to select the threshold in a certain range, and combined with differential cepstrum thresholds to establish speaker recognition model. The results show that given the method improves the recognition rate, the training time and the recognition time have improved significantly.
出处 《计算机应用》 CSCD 北大核心 2009年第1期146-148,共3页 journal of Computer Applications
基金 重庆市自然科学基金资助项目(CSTC2007BB6118) 中国博士后科学基金资助项目(20080430750)
关键词 说话人识别 矢量量化(VQ) LBG算法 动态阈值 speaker recognition Vector Quantitation (VQ) LBG dynamic threshold
  • 相关文献

参考文献6

  • 1陈善学,李方伟,朱维乐.一种快速的矢量量化编码[J].计算机工程与应用,2007,43(23):83-85. 被引量:3
  • 2[美]Z.米凯利维茨.演化程序:遗传算法和数据编码的结合[M].周家驹,何险峰,译.北京:科学出版社,2000.
  • 3HAN WEI, CHAN CHEONG-FAT, CHOY CHIU-SING, et al. An efficient MFCC extraction method in speech recognition [ C]// ISCAS 2006: Proceedings of 2006 IEEE International Symposium. Hong Kong: IEEE Press 2006:145 - 148.
  • 4VASUKI A, VANATHI P T. A review of vector quantization techniques[J]. Potentials, IEEE, 2006,25(4):39-47.
  • 5PAN ZHI-BIN, KOTANI K. Constructing better partial sums based on energy-maximum criterion for fast encoding of VQ[ C]//APCCAS 2006: IEEE Asia Pacific Conference Circuits and Systems. Singapore: IEEE Press, 2006:1563 - 1566.
  • 6LI JIU-HUA, LING NAM. A novel VQ codebook design technique [ C]//IEEE Transactions Consumer Electronics. Rosemont, IL: IEEE Press, 1997, 43(4) : 1206 - 1212.

二级参考文献13

  • 1陈善学,朱维乐.等误差竞争学习算法在矢量量化中的应用[J].计算机工程与应用,2004,40(34):95-97. 被引量:2
  • 2Linde Y,Buzo A,Gray R M.An algorithm for vector quantizer design[J].IEEE Trans on Com,1980,28 (1):84-95.
  • 3Lee C H,Chen L H.A fast search algorithm for vector quantization using mean pyramids of codewords[J].IEEE Trans on Com,1995,43(2/3/4):1697-1702.
  • 4Torres L,Huguet J.An improvement on codebook search for vector quantization[J].IEEE Trans on Com,1994,42(2/3/4):208-210.
  • 5Soleymani M R,Morgera S D.An efficient nearest neighbor search method[J].IEEE Trans on Com,1987,35(6):677-679.
  • 6Hwang W J.Fast codeword search technique for the encoding of variable-rate vector quantizers[J].IEE Proc-Vis Image Signal Process,1998,145 (2):103-108.
  • 7Lee C H,Chen L H.High-speed closest codeword search algorithms for vector quantization[J].SP,1995,43:323-331.
  • 8Hsieh C H,Liu Y J.Fast search algorithms for vector quantization of images using multiple triangle inequalities and wavelet transform[J].IEEE Trans Image Processing,2000,9(3):321-328.
  • 9Wu K S,Lin J C.Fast VQ encoding by an efficient kick-out condition[J].IEEE Trans Circuits Syst Video Technol,2000,10(1):59-62.
  • 10Song B C,Ra J B.A fast algorithm for vector quantization using L2-norm pyramid of codeword[J].IEEE Trans Image Processing,2002,11(1):10-15.

共引文献2

同被引文献34

  • 1赵鹏喜.基于概率神经网络在声发射信号处理中的应用[J].三门峡职业技术学院学报,2009,8(2):90-92. 被引量:2
  • 2张力.MATLAB在语音信号处理辅助教学中的应用[J].电气电子教学学报,2005,27(2):96-99. 被引量:7
  • 3衣红钢,巩宪锋,王再英,马祥华.凌阳16位单片机实验板的研究[J].实验技术与管理,2006,23(4):63-65. 被引量:4
  • 4刘庆华,陈紫强.基于MATLAB和DSP的语音信号处理课程的建设[J].电气电子教学学报,2006,28(4):26-28. 被引量:9
  • 5陈明义,周昆湘,余伶俐.一种基于VQ的说话人确认的阈值设计方法[J].计算机工程与应用,2007,43(13):117-119. 被引量:1
  • 6Phu Chien Nguyen,Masato Akagi,Tu Bao Ho.A Promising Approach to VQ_Based Spesker Recognition[C]//2003 IEEE International Conference on Acoustics,Speech,and Signal Processing,Procedings Volume Ⅰ of Ⅵ Speech Processing Ⅰ.2003:184-187.
  • 7M.A.EL-Gamal,M.F.ABU El-Yazeed,EL M M H.Ayadi.Enhancing the Performance of Ganssian Mixture Model-Based Text Independent Speaker Recognition[J].International Journal of Speech Technology,2005,8:93-103.
  • 8Limin Xu,Zhenmin Tang.Speaker Identification Using Multi-Step Clustering Algorithm with Transformation-Based GMM[J].Automatic Control and Computer Science,2007,41:224-231.
  • 9Marcos Faundez-Zamuy.A Combination Between VQ Covariance Matrices for Speaker Recognition[C]//The 2001 IEEE International Conference on Acoustics,Speech,and Signal Processing(ICASSP2001),vol.I:Speech Processing 1,Utah,USA,2001:453-456.
  • 10Andrens Stolcke,Sachin S Kajarekar,Luciana Ferrer.Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms[J].IEEE Transaction on Audio,Speech and Language Processing,2007,15(7):1987-1998.

引证文献4

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部