期刊文献+

基于复杂性的说话人识别技术探讨 被引量:5

Studies on Speaker Recognition Based on Complexity
下载PDF
导出
摘要 目的用非线性方法中的复杂性特征来分析非平稳的语音信号。方法采用 5 1位被试者讲同一段话的语音数据 ,语音样本包括两类 :同一被试者在不同时刻讲同一段话及不同被试者讲同一段话 ,分别从复杂性特征曲线和复杂性特征曲线重构的嵌入空间分析处理。结果同一被试者不同时刻讲同一段话的复杂性曲线间的差异 ,明显地低于不同被试者讲同一段话的复杂性曲线间的差异。另外 ,从复杂性特征曲线重构的嵌入空间可以看出 ,同一说话人的复杂性曲线离散点相对集中于在此嵌入空间中同一区域 ,而与其他人的特征离散点存在统计学上的明显差异。结论复杂性分析方法能够用于语音特征分析 。 Objective To analyze the speech characters with computation complexity. Method Voices of 51 testees spoke the same paragraph were recorded and the same sentence of voice waveform was intercepted as source data. There were two kinds of sample voices: same testee speaking the same sentence at different time and different testee speaking the same sentence. Result The computation complexity curves of the different testee were obviously distinguishing, while those of the same testee were almost the same. In a 2D embedded space the computation complexity features of individual testee differs with others even if testees speak the same sentences. Conclusion This complexity features might be applied to speaker recognition system and using complexity method to analyze speech signals has wide application prospect.
出处 《航天医学与医学工程》 CAS CSCD 北大核心 2003年第3期215-219,共5页 Space Medicine & Medical Engineering
关键词 非线性方法 语音识别 复杂性 二维嵌入空间 non linear methods voice identification complexity 2 D embedded space
  • 引文网络
  • 相关文献

参考文献4

二级参考文献10

  • 1程俊,张璞,戴善荣,易克初.小波变换用于信号突变的检测[J].通信学报,1995,16(3):96-104. 被引量:36
  • 2牟晓隆,胡起秀,吴文虎.与文本无关的复合策略说话人辨识系统[J].清华大学学报(自然科学版),1997,37(3):16-19. 被引量:6
  • 3程正兴(译),小波分析导论,1995年
  • 4杨行峻,语音信号数据处理,1995年
  • 5Farrell K R,Mammone R J,Assaleh K T.Speaker recognition using neural networks and conventional classifiers[].IEEE Transactions on Speech and Audio Processing.1994
  • 6Reynolds D A,Rose R.Robust text-independent speakeridentification using Gaussian mixture speaker models[].IEEE Transactions on Speech and Audio Processing.1995
  • 7O’Shaughnessy D.Speaker recognition[].IEEE ASSP Magazine.1986
  • 8Atal B.Effectiveness of linear predictive characteristics ofthe speech wave for automatic speaker identification andverification[].The Journal of The Acoustical Society of America.1974
  • 9Hush D R,Home B G.Progress in supervised neural networks[].IEEE Signal Processing Magazine.1993
  • 10李蕴华.将倒谱参数与基音信息有效结合进行说话人辨认[J].信号处理,2000,16(1):85-89. 被引量:7

共引文献38

同被引文献47

  • 1祁亨年.支持向量机及其应用研究综述[J].计算机工程,2004,30(10):6-9. 被引量:186
  • 2叶明,顾利民.LPC倒谱参数的说话人特征分析[J].南京航空航天大学学报,1994,26(6):797-804. 被引量:7
  • 3张忠建,陈式刚.圆映象的符号动力学[J].物理学报,1989,38(1):1-8. 被引量:5
  • 4吴淑珍,吴阿华.说话人识别的参量研究和语音库建设[J].北京大学学报(自然科学版),1995,31(3):316-322. 被引量:4
  • 5黄勇,郑春颖,宋忠虎.多类支持向量机算法综述[J].计算技术与自动化,2005,24(4):61-63. 被引量:33
  • 6Arnold W M Smeulders,Marcel Wordng,Simone Santini et al.Content-Based Image Retrieval at the End of the Early Years[J].IEEE Trans on Pattern Analysis and Machine Intelligence,2000;22(12): 1349-1380.
  • 7James Z Wang,Jia Li,Gio Wiederhold.SIMPLIcity:Semantics-Sensitire Integrated Matching for Picture Libraries[J].IEEE Trans on Pattern Analysis and Machine Intelligence,2001 ;23(9):947-963.
  • 8R Zhao,W I Grosky.Negotiating the semantic gap:from feature maps to semantic landscapes[J].Pattern Recognition, 2002 ; 35 (3) : 593-600.
  • 9B S Manjunath,Jens-Rainer Ohm,Vinod V Vasudevan et al.Color and Texture Deseriptors[J].IEEE Trans on Circuits and Systems for Video Technology ,2001 ; 11 (6) :703-715.
  • 10M K Mandal,T Aboulnasr,S Panchanathan.Fast Wavelet Histogram Techniques for Image Indexing[J].Computer Vision and Image Understanding, 1999 ;75(1/2) :99-110.

引证文献5

二级引证文献17

相关主题

;
使用帮助 返回顶部