基于复杂性的说话人识别技术探讨被引量：5

Studies on Speaker Recognition Based on Complexity

下载PDF

导出

摘要目的用非线性方法中的复杂性特征来分析非平稳的语音信号。方法采用 5 1位被试者讲同一段话的语音数据 ,语音样本包括两类 :同一被试者在不同时刻讲同一段话及不同被试者讲同一段话 ,分别从复杂性特征曲线和复杂性特征曲线重构的嵌入空间分析处理。结果同一被试者不同时刻讲同一段话的复杂性曲线间的差异 ,明显地低于不同被试者讲同一段话的复杂性曲线间的差异。另外 ,从复杂性特征曲线重构的嵌入空间可以看出 ,同一说话人的复杂性曲线离散点相对集中于在此嵌入空间中同一区域 ,而与其他人的特征离散点存在统计学上的明显差异。结论复杂性分析方法能够用于语音特征分析。 Objective To analyze the speech characters with computation complexity. Method Voices of 51 testees spoke the same paragraph were recorded and the same sentence of voice waveform was intercepted as source data. There were two kinds of sample voices: same testee speaking the same sentence at different time and different testee speaking the same sentence. Result The computation complexity curves of the different testee were obviously distinguishing, while those of the same testee were almost the same. In a 2D embedded space the computation complexity features of individual testee differs with others even if testees speak the same sentences. Conclusion This complexity features might be applied to speaker recognition system and using complexity method to analyze speech signals has wide application prospect.

作者李敏丹张宏童勤业

机构地区浙江大学生物医学工程系

出处《航天医学与医学工程》 CAS CSCD 北大核心 2003年第3期215-219,共5页 Space Medicine & Medical Engineering

关键词非线性方法语音识别复杂性二维嵌入空间 non linear methods voice identification complexity 2 D embedded space

分类号 R857.3 [医药卫生—航空、航天与航海医学]

引文网络
相关文献

参考文献4

1杨澄宇,赵文,杨鉴.基于高斯混合模型的说话人确认系统[J].计算机应用,2001,21(4):7-8. 被引量：5
2岳喜才,伍晓宇,郑崇勋.用神经阵列网络进行文本无关的说话人识别[J].声学学报,2000,25(3):230-234. 被引量：14
3赵铮,侯伯亨.基于小波变换说话人识别技术的研究[J].西安电子科技大学学报,2000,27(4):437-441. 被引量：16
4叶明,顾利民.LPC倒谱参数的说话人特征分析[J].南京航空航天大学学报,1994,26(6):797-804. 被引量：7

二级参考文献10

1程俊,张璞,戴善荣,易克初.小波变换用于信号突变的检测[J].通信学报,1995,16(3):96-104. 被引量：36
2牟晓隆,胡起秀,吴文虎.与文本无关的复合策略说话人辨识系统[J].清华大学学报（自然科学版）,1997,37(3):16-19. 被引量：6
3程正兴（译），小波分析导论，1995年
4杨行峻，语音信号数据处理，1995年
5Farrell K R,Mammone R J,Assaleh K T.Speaker recognition using neural networks and conventional classifiers[].IEEE Transactions on Speech and Audio Processing.1994
6Reynolds D A,Rose R.Robust text-independent speakeridentification using Gaussian mixture speaker models[].IEEE Transactions on Speech and Audio Processing.1995
7O’Shaughnessy D.Speaker recognition[].IEEE ASSP Magazine.1986
8Atal B.Effectiveness of linear predictive characteristics ofthe speech wave for automatic speaker identification andverification[].The Journal of The Acoustical Society of America.1974
9Hush D R,Home B G.Progress in supervised neural networks[].IEEE Signal Processing Magazine.1993
10李蕴华.将倒谱参数与基音信息有效结合进行说话人辨认[J].信号处理,2000,16(1):85-89. 被引量：7

共引文献38

1岳喜才,叶大田,管桦.多分类问题的RBF二叉神经树网络方法[J].空军工程大学学报（自然科学版）,2000,1(1):34-39. 被引量：1
2俞一彪,王朔中.基于互信息匹配模型的说话人识别[J].声学学报,2004,29(5):462-466. 被引量：8
3武妍,金明曦,王洪波.基于KL-小波包分析的文本无关的说话人识别[J].计算机工程与应用,2005,41(4):26-28. 被引量：6
4勾轶,刘晓丽,陈长征.基于小波与神经网络的说话人身份识别[J].沈阳工业大学学报,2005,27(1):87-90.
5张海英,赵军,潘永湘.一种基于小波变换的混叠语音基音提取方法[J].西安理工大学学报,2005,21(2):200-203.
6YUYibiao,WANGShuozhong.Speaker identification based on complete feature corpus and evaluation of mutual information[J].Chinese Journal of Acoustics,2005,24(3):280-288. 被引量：1
7芮贤义,俞一彪.基于小波变换的鲁棒型特征提取及说话人识别[J].电路与系统学报,2005,10(5):129-132. 被引量：7
8邱政权,尹俊勋.针对说话人识别进行的加权小波去噪方法[J].电声技术,2005,29(9):53-56. 被引量：3
9俞一彪,王朔中.文本无关说话人识别的全特征矢量集模型及互信息评估方法[J].声学学报,2005,30(6):536-541. 被引量：7
10王永琦,邓琛,杨洋.语音增强用于抗噪声的汉语说话人识别[J].微电子学与计算机,2006,23(2):166-168. 被引量：4

<12 3 4 >

同被引文献47

1祁亨年.支持向量机及其应用研究综述[J].计算机工程,2004,30(10):6-9. 被引量：186
2叶明,顾利民.LPC倒谱参数的说话人特征分析[J].南京航空航天大学学报,1994,26(6):797-804. 被引量：7
3张忠建,陈式刚.圆映象的符号动力学[J].物理学报,1989,38(1):1-8. 被引量：5
4吴淑珍,吴阿华.说话人识别的参量研究和语音库建设[J].北京大学学报（自然科学版）,1995,31(3):316-322. 被引量：4
5黄勇,郑春颖,宋忠虎.多类支持向量机算法综述[J].计算技术与自动化,2005,24(4):61-63. 被引量：33
6Arnold W M Smeulders,Marcel Wordng,Simone Santini et al.Content-Based Image Retrieval at the End of the Early Years[J].IEEE Trans on Pattern Analysis and Machine Intelligence,2000;22(12): 1349-1380.
7James Z Wang,Jia Li,Gio Wiederhold.SIMPLIcity:Semantics-Sensitire Integrated Matching for Picture Libraries[J].IEEE Trans on Pattern Analysis and Machine Intelligence,2001 ;23(9):947-963.
8R Zhao,W I Grosky.Negotiating the semantic gap:from feature maps to semantic landscapes[J].Pattern Recognition, 2002 ; 35 (3) : 593-600.
9B S Manjunath,Jens-Rainer Ohm,Vinod V Vasudevan et al.Color and Texture Deseriptors[J].IEEE Trans on Circuits and Systems for Video Technology ,2001 ; 11 (6) :703-715.
10M K Mandal,T Aboulnasr,S Panchanathan.Fast Wavelet Histogram Techniques for Image Indexing[J].Computer Vision and Image Understanding, 1999 ;75(1/2) :99-110.