期刊文献+

基于Radon和平移不变性小波变换的鸟类声音识别 被引量:7

Bird sounds recognition based on Radon and translation invariant discrete wavelet transform
下载PDF
导出
摘要 针对低信噪比(SNR)环境下鸟叫声识别率不够高的问题,提出一种对声谱图进行Radon变换(RT)和平移不变性离散小波变换(TIDWT)的抗噪型鸟类声音识别技术。首先,使用改进的多频带谱减法对鸟叫声进行降噪处理;其次,利用短时能量检测降噪后的鸟叫声的静音段,并去除静音段;接着,将去除静音段的声音信号转化为声谱图,并对声谱图进行RT和TIDWT,提取特征值;最后,采用支持向量机(SVM)分类器对提取的特征值进行分类识别。实验结果表明,该方法在信噪比为10 dB及以下仍可以达到较好的识别效果。 To improve the accuracy of bird sounds recognition in low Signal-to-Noise Ratio (SNR) environment, a new bird sounds recognition technology based on Radon Transform (RT) and Translation Invariant Discrete Wavelet Transform (TIDWT) from spectrogram after the noise reduction was proposed. First, an improved multi-band spectral subtraction method was presented to reduce the background noise. Second, short-time energy was used to detect silence of clean bird sound, and the silence was removed. Then, the bird sound was translated into spectrogram, RT and TIDWT were used to extract features. Finally, classification was achieved by Support Vector Machine (SVM) classifier. The experimental results show that the method can achieve better recognition effect even the SNR belows 10 dB.
作者 周晓敏 李应
出处 《计算机应用》 CSCD 北大核心 2014年第5期1391-1396,1417,共7页 journal of Computer Applications
基金 国家自然科学基金资助项目(61075022)
关键词 鸟类声音识别 多频带谱减法 短时能量 RADON变换 平移不变性离散小波变换 特征提取 bird sounds recognition multi-band spectral subtraction method short-time energy Radon Transform (RT) Translation Invariant Discrete Wavelet Transform (TIDWT) feature extraction
  • 相关文献

参考文献16

  • 1NEAL L,BRIGGS F,RAICH R,et al.Time-frequency segmentation of bird song in noisy acoustic environments[C]//Proceedings of the 2011 IEEE International Conference on Acoustics,Speech and Signal Processing.Piscataway:IEEE Press,2011:2012-2015.
  • 2CHOU C H,LIU P H.Bird species recognition by wavelet transformation of a section of birdsong[C]//Proceedings of the 2009 Symposia and Workshops on Ubiquitous,Autonomic and Trusted Computing.Piscataway:IEEE Press,2009:189-193.
  • 3BRIGGS F,RAICH R,FERN X Z.Audio classification of bird species:a statistical manifold approach[C]//Proceedings of the 2009 9th IEEE International Conference on Data Mining.Piscataway:IEEE Press,2009:51-60.
  • 4LEE C H,HAN C C,CHUANG C C.Automatic classification of bird species from their sounds using two-dimensional cepstral coefficients[J].IEEE Transactions on Audio,Speech,and Language Processing,2008,16(8):1541-1550.
  • 5WIELGAT R,SWIETOJANSKI P,POTEMPA T,et al.On using prefiltration in HMM-based bird species recognition[C]//Proceedings of the 2012 International Conference on Signals and Electronic Systems.Piscataway:IEEE Press,2012:1-5.
  • 6JUANG C F,CHEN T M.Birdsong recognition using predictionbased recurrent neural fuzzy networks[J].Neurocomputing,2007,71(1/2/3):121-130.
  • 7黄苏雨,梁声灼,黄苏园.语音增强方法综述[J].计算机与现代化,2007(3):16-20. 被引量:15
  • 8KAMATH S,LOIZOU P.A multi-band spectral subtraction method for enhancing speech corrupted by colored noise[C]//Proceedings of the 2002 International Conference on Acoustics,Speech,and Signal Processing.Piscataway:IEEE Press,2002,4:4164.
  • 9COHEN I,BERDUGO B.Noise estimation by minima controlled recursive averaging for robust speech enhancement[J].IEEE Signal Processing Letters,2002,9(1):12-15.
  • 10李富强,万红,黄俊杰.基于MATLAB的语谱图显示与分析[J].微计算机信息,2005,21(10X):172-174. 被引量:27

二级参考文献41

  • 1BOLL S F.Suppression of acoustics noise in speech using spectral subtraction[J].IEEE Transactions on Acoustics,Speech,and Signal Procession,1979,27(2):113-120.
  • 2BEROUTI M,SCHWARTZ R,MAKHOUL J.Enhancement of speech corrupted by acoustic noise[C] //Proceeding of IEEE1979 ICASSP.[S.l.] :IEEE Press,1979,4:208-211.
  • 3EPHRAIM Y,MALAH D.Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator[J].IEEE Trans.on Speech Signal Process.1984,ASSP-32(6):1109-1121.
  • 4EPHRAIM Y,VAN-TREES H L.A signal suhspace approach for speech enhancement[J].IEEE Transactions on Speech and Audio Processing,1995,3(4):251-266.
  • 5MEDIMA C A,ALCAIM A.Using neural networks wavelet denoising of speech for threshold selection[J].Electronics Letters,2003,37(25):1869-1871.
  • 6MARTIN R.Noise power spectral density estimation based on optimal smoothing and minimum statics[J].IEEE Trans.on Speech Audio Process,2001,9(5):504-512.
  • 7COHEN I.Noise estimation by minima controlled recursive averaging for robust speech enhancement[J].IEEE Trans.on Signal Process Letters,2002,9(1):12-15.
  • 8COHEN I.Noise spectrum estimation in adverse environments:improved minima controlled recursive averageing[J].IEEE Trans.on Speech Audio Process,2003,11(5).466-475.
  • 9RANGACHARI S,LOIZOU C.Noise-estimation algorithm for highly non-stationary environments[J].Speech Communication,2006,48:220-231.
  • 10COHEN I.Speech enhancement using super-Gaussian speech models and noncausal a priori SNR estimation[J].Speech Communication,2005,47:336-350.

共引文献60

同被引文献125

引证文献7

二级引证文献57

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部