期刊文献+

基于离散小波变换和RBF神经网络的说话人识别 被引量:4

Speaker Recognition Based on Discrete Wavelet Transform and RBF Neural Networks
下载PDF
导出
摘要 为提高说话人识别系统的性能,结合离散小波变换与RBF神经网络提出一种说话人识别新方法。把小波变换与美尔频率倒谱系数提取相结合,使用离散小波变换代替美尔频率倒谱系数中的离散余弦变换,提取变换谱振幅作为特征参数。使用逼近能力、分类能力和学习速度均更优的RBF神经网络取代常用的BP网络,采用与输入样本相关的方法优化RBF网络初始权值选取。不同语音长度和信噪比的实验表明,系统识别率和鲁棒性均得到了提高。 This paper presents a novel method of the speaker recognition in combining the discrete wavelet transform with RBF neural network so as to improve the speaker recognition system performances.The wavelet transform and Mel Frequency Cepstrum Coefficient extraction are combined.After displacing the discrete cosine transform with the wavelet transform,the amplitudes of transformed spectrum are extracted as the feature parameters.The BP networks are displaced by the RBF neural networks,with superior studying speed,approaching and characterizing ability.The initial weights choosing of the RBF networks are optimized by using an approach correlating with the input samples.Different speech length and SNR experiments show that the system recognition rate and robustness are all improved.
出处 《西安理工大学学报》 CAS 北大核心 2011年第3期368-372,共5页 Journal of Xi'an University of Technology
基金 陕西省教育厅产业化基金资助项目(05JC13)
关键词 说话人识别 MFWC RBF神经网络 初始权值 speaker recognition MFWC RBF NN initial weight
  • 相关文献

参考文献20

  • 1Bimbot F, Bonastre J F, Fredouille C, et al. A tutorial on text-independent speaker verification [ J ]. EURASIP Journal on Applied Signal Processing, 2004,(4) : 430-451.
  • 2Furui S. Digital Speech Processing, Synthesis, and Recog- nition[ M]. New York: Marcel Dekker, 2000.
  • 3Campbell W M, Campbell J P, Reynolds D A, et al. Sup- port vector machines for speaker and language recognition [J]. Computer Speech and Language, 2006, 20(2) : 210- 229.
  • 4Matsui T, Kanno T, Furui S. Speaker recognition using HMM composition in noisy environments [ J ]. Computer Speech and Language, 1996, 10(2) : 107-116.
  • 5Rabiner L R. A tutorial on hidden markov models and se- lected applications in speech recognition [ J 1. Proceedings of the IEEE, 1989, 77(2) : 257-286.
  • 6Furui S. Fifty Years of Progress in Speech and Speaker Recognition: Proceedings of the 148th ASA Meeting[ C]. San Diego: USA, 2004.
  • 7Furui S. Cepstral analysis technique for automatic speaker verification[J]. IEEE Transactions on Acoustics, Speech, Signal , 1981, 29(2): 254-272.
  • 8Furui S. Speaker independent isolated word recognition using dynamic features of speech spectrum [ J ]. IEEE Transactions on Acoustics, Speech, Signal Processing, 1986, 34 ( 1 ) : 52-59.
  • 9Reynolds D A, Quatieri T F, Dunn R B. Speaker verification using adapted gaussian mixture models [ J ]. Digital Signal , 2000,10( 1 ) : 19-41.
  • 10Reynolds D A. A Gaussian Mixture Modeling Approach to Text-Independent Speaker Identification [D]. Atlant: Georgia Institute of Technology, 1992.

共引文献60

同被引文献28

  • 1张义平,李夕兵.Hilbert-Huang变换在爆破震动信号分析中的应用[J].中南大学学报(自然科学版),2005,36(5):882-887. 被引量:31
  • 2晏俊伟,龙源,方向,周春华.基于小波变换的爆破振动信号能量分布特征分析[J].爆炸与冲击,2007,27(5):405-410. 被引量:34
  • 3Meglis I L,Chow T M, Martin C D. Assessing in situ micro crack damage using ultrasonic velocity tomography [J]. International Journal of Rock Mechanics : Mining Sciences, 2005,42 (1) : 25-34.
  • 4Otto Schulze,Till Popp, Hermit Kern. Development of damage and permeability in deforming rock salt[J]. En- gineering Geology, 2001,61 (2) : 163-180.
  • 5Sayers C M, Kasyanov M. Micro crack induced elastic wave anisotropy of brittle rocks [J]. Journal of Geophys ical Research, 1995,100(B3) :4149-4156.
  • 6Norden E H,Zheng Shen,Steven R L. et al. The empir- ical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis[J]. Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, 1998, 454: 903-995.
  • 7Comon P. Independent component analysis, a new con- cept [J].Signal Processing, 1994,36:287-314.
  • 8Belouchrani A, Mariam K A, Cardoso J F, et al. A blind source separation technique using second order statistics [J]. Signal Processing, IEEE Transactions on, 1997,45(2) :434-444.
  • 9Cardoso J F. Blind Beam forming for Non-Gaussian Signals[J].IEEE Proceedings-F, 1993, 140 (6): 224-230.
  • 10赵孔新,丁宁.基于人工小波神经网的说话人识别[J].电声技术,2008,32(7):51-53. 被引量:2

引证文献4

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部