基于动态阈值失量量化的说话人识别被引量：4

Vector quantization based on the dynamic threshold of speaker recognition

下载PDF

导出

摘要在基于矢量量化的说话识别系统所选用的LBG算法中,码本分裂时的阈值是影响初始码本生成的重要因素之一,而传统方式所采用的阈值不容易确定,且需要进行大量的实验来获得经验值。提出在一定范围内动态地,随机地产生阈值的方法来改进初始码本形成策略,并结合差分倒谱参数建立说话人识别模型。实验结果表明该方法在识别率得到一定改善的前提下,训练时间及识别时间有了明显改善。 Code splitting threshold is one of the important factors to initialize codebook in Speaker Recognition based on the Vector Quantitation （ VQ）, but traditional threshold is not easy to determine and needs a large number of experiments to determine the value. This paper used dynamic and random method to select the threshold in a certain range, and combined with differential cepstrum thresholds to establish speaker recognition model. The results show that given the method improves the recognition rate, the training time and the recognition time have improved significantly.

作者亢明汪成亮陈娟娟

机构地区重庆大学计算机学院重庆大学电气工程学院重庆师范大学物理与信息技术学院

出处《计算机应用》 CSCD 北大核心 2009年第1期146-148,共3页 journal of Computer Applications

基金重庆市自然科学基金资助项目(CSTC2007BB6118) 中国博士后科学基金资助项目(20080430750)

关键词说话人识别矢量量化(VQ) LBG算法动态阈值 speaker recognition Vector Quantitation （VQ） LBG dynamic threshold

分类号 TP391.42 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1陈善学,李方伟,朱维乐.一种快速的矢量量化编码[J].计算机工程与应用,2007,43(23):83-85. 被引量：3
2[美]Z.米凯利维茨.演化程序:遗传算法和数据编码的结合[M].周家驹,何险峰,译.北京:科学出版社,2000.
3HAN WEI, CHAN CHEONG-FAT, CHOY CHIU-SING, et al. An efficient MFCC extraction method in speech recognition [ C]// ISCAS 2006: Proceedings of 2006 IEEE International Symposium. Hong Kong: IEEE Press 2006:145 - 148.
4VASUKI A, VANATHI P T. A review of vector quantization techniques[J]. Potentials, IEEE, 2006,25(4):39-47.
5PAN ZHI-BIN, KOTANI K. Constructing better partial sums based on energy-maximum criterion for fast encoding of VQ[ C]//APCCAS 2006: IEEE Asia Pacific Conference Circuits and Systems. Singapore: IEEE Press, 2006:1563 - 1566.
6LI JIU-HUA, LING NAM. A novel VQ codebook design technique [ C]//IEEE Transactions Consumer Electronics. Rosemont, IL: IEEE Press, 1997, 43(4) : 1206 - 1212.

二级参考文献13

1陈善学,朱维乐.等误差竞争学习算法在矢量量化中的应用[J].计算机工程与应用,2004,40(34):95-97. 被引量：2
2Linde Y,Buzo A,Gray R M.An algorithm for vector quantizer design[J].IEEE Trans on Com,1980,28 (1):84-95.
3Lee C H,Chen L H.A fast search algorithm for vector quantization using mean pyramids of codewords[J].IEEE Trans on Com,1995,43(2/3/4):1697-1702.
4Torres L,Huguet J.An improvement on codebook search for vector quantization[J].IEEE Trans on Com,1994,42(2/3/4):208-210.
5Soleymani M R,Morgera S D.An efficient nearest neighbor search method[J].IEEE Trans on Com,1987,35(6):677-679.
6Hwang W J.Fast codeword search technique for the encoding of variable-rate vector quantizers[J].IEE Proc-Vis Image Signal Process,1998,145 (2):103-108.
7Lee C H,Chen L H.High-speed closest codeword search algorithms for vector quantization[J].SP,1995,43:323-331.
8Hsieh C H,Liu Y J.Fast search algorithms for vector quantization of images using multiple triangle inequalities and wavelet transform[J].IEEE Trans Image Processing,2000,9(3):321-328.
9Wu K S,Lin J C.Fast VQ encoding by an efficient kick-out condition[J].IEEE Trans Circuits Syst Video Technol,2000,10(1):59-62.
10Song B C,Ra J B.A fast algorithm for vector quantization using L2-norm pyramid of codeword[J].IEEE Trans Image Processing,2002,11(1):10-15.

共引文献2

1张东辉,何政伟,杨斌.栅格图像自动矢量化系统的研究与实现[J].计算机工程与应用,2010,46(10):171-174. 被引量：5
2黄榜,谢林柏.一种新的矢量量化码书设计算法[J].科学技术与工程,2011,11(1):46-49. 被引量：2

同被引文献34

1赵鹏喜.基于概率神经网络在声发射信号处理中的应用[J].三门峡职业技术学院学报,2009,8(2):90-92. 被引量：2
2张力.MATLAB在语音信号处理辅助教学中的应用[J].电气电子教学学报,2005,27(2):96-99. 被引量：7
3衣红钢,巩宪锋,王再英,马祥华.凌阳16位单片机实验板的研究[J].实验技术与管理,2006,23(4):63-65. 被引量：4
4刘庆华,陈紫强.基于MATLAB和DSP的语音信号处理课程的建设[J].电气电子教学学报,2006,28(4):26-28. 被引量：9
5陈明义,周昆湘,余伶俐.一种基于VQ的说话人确认的阈值设计方法[J].计算机工程与应用,2007,43(13):117-119. 被引量：1
6Phu Chien Nguyen,Masato Akagi,Tu Bao Ho.A Promising Approach to VQ_Based Spesker Recognition[C]//2003 IEEE International Conference on Acoustics,Speech,and Signal Processing,Procedings Volume Ⅰ of Ⅵ Speech Processing Ⅰ.2003:184-187.
7M.A.EL-Gamal,M.F.ABU El-Yazeed,EL M M H.Ayadi.Enhancing the Performance of Ganssian Mixture Model-Based Text Independent Speaker Recognition[J].International Journal of Speech Technology,2005,8:93-103.
8Limin Xu,Zhenmin Tang.Speaker Identification Using Multi-Step Clustering Algorithm with Transformation-Based GMM[J].Automatic Control and Computer Science,2007,41:224-231.
9Marcos Faundez-Zamuy.A Combination Between VQ Covariance Matrices for Speaker Recognition[C]//The 2001 IEEE International Conference on Acoustics,Speech,and Signal Processing(ICASSP2001),vol.I:Speech Processing 1,Utah,USA,2001:453-456.
10Andrens Stolcke,Sachin S Kajarekar,Luciana Ferrer.Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms[J].IEEE Transaction on Audio,Speech and Language Processing,2007,15(7):1987-1998.

引证文献4

1郭春霞.说话人识别算法的研究[J].西安邮电学院学报,2010,15(5):104-106. 被引量：5
2杨顺辽,林峰,刘焕升,陈玉炎.改进加权线性预测倒谱的复合参数说话人识别[J].计算机应用与软件,2011,28(2):242-245.
3孙彦群,俞一彪.基于有效特征集选择的说话人识别[J].电脑知识与技术,2011,7(4):2360-2362.
4杨顺辽.基于说话人确认系统的语音处理综合实验[J].高校实验室工作研究,2012(3):15-17.

二级引证文献5

1杨迪,戚银城,刘明军,张华芳子,武军娜.说话人识别综述[J].电子科技,2012,25(6):162-165. 被引量：5
2武光利.说话人识别方法概述[J].硅谷,2012,5(19):179-179.
3屈直,张伯虎.一种改进的小波阈值算法在激光侦听中的应用[J].激光技术,2014,38(2):218-224. 被引量：6
4王煜.说话人识别研究现状[J].数字技术与应用,2017,35(6):59-61. 被引量：2
5李艳妮,张二华.多人会话混合语音的说话人分割[J].计算机与数字工程,2020,48(7):1558-1563.

1兰少华,叶东海,吴慧中.一种AGENT任务求解联盟形成策略[J].小型微型计算机系统,2004,25(5):941-944. 被引量：11
2杜文龙.一种提高语音特征参数稳健性MLMCC算法的研究[J].智能计算机与应用,2014,4(4):94-96.
3余建潮,张瑞林.基于MFCC和LPCC的说话人识别[J].计算机工程与设计,2009,30(5):1189-1191. 被引量：46
4石柱.声纹识别的应用与矢量量化算法研究[J].电声技术,2006,30(10):44-48. 被引量：2
5蒋建国,张国富,夏娜,苏兆品.基于蚁群正反馈的Agent联盟形成策略[J].中国科技论文在线,2009,4(2):121-125.
6张琳波,王春恒,肖柏华,邵允学.基于类别相关码本生成的图像分类方法[J].计算机工程,2011,37(10):8-10. 被引量：1
7魏巍,刘弘.基于关系网模型的Agent联盟形成策略[J].计算机应用研究,2006,23(10):41-43. 被引量：6
8陈孟元.基于改进型DTW算法和MFCC的语音识别[J].安徽工程大学学报,2014,29(1):53-57. 被引量：9
9徐惠红,栾方军.基于改进的HMM算法的说话人识别研究[J].微计算机信息,2010,26(22):174-175. 被引量：1
10聂静,盖兴杰.基于矢量量化的话者识别的研究[J].今日科苑,2009(24):276-277.

计算机应用

2009年第1期

浏览历史

内容加载中请稍等...

基于动态阈值失量量化的说话人识别被引量：4

参考文献6

二级参考文献13

共引文献2

同被引文献34

引证文献4

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

基于动态阈值失量量化的说话人识别 被引量：4

参考文献6

二级参考文献13

共引文献2

同被引文献34

引证文献4

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

基于动态阈值失量量化的说话人识别被引量：4