期刊文献+

基于APR-SVM的音频分类方法

Audio Classification Based on APR-SVM
下载PDF
导出
摘要 音频分类在多媒体应用中十分广泛,主要有时域分析和频域分析方法。文中提出了一种基于自适应间距比(APR)算法和支持向量机(SVM)算法的音频分类方法,先用APR算法区分语音与非语音;对于非语音,再通过SVM进行音频分类。APR算法是比较PR参数和阈值来区分语音和非语音,它和信噪比密切相关;而将非语音分成四组:音乐,汽车,会议,雨声,提取特征因子。实验结果表明:文中设计的分类器的精度达到93.75%以上,能很好地把各类型音频分开。 Audio classification is widely applied in multimedia applications, which mainly has time domain analysis and frequency domain analysis methods. In this paper,an audio classification method based on APR algorithm and SVM algorithm is proposed,first use the APR algorithm to distinguish between voice and non voice,for non-voice take audio classification by SVM. APR algorithm is to compare the PR parameters and thresholds to distinguish between voice and non voice,is closely related to SNR ,and non-voiee is divided into four groups:music,cars,meeting, rain, extract the feature factor. The experimental results show that:the accuracy of the classifier designed in this paper is to reach over 93.75% ,good separation of various types of audio.
出处 《计算机技术与发展》 2012年第10期59-61,65,共4页 Computer Technology and Development
基金 上海市科技计划重点项目(08240510800)
关键词 音频分类 特征提取 支持向量机 自适应间距比 信噪比 audio classification feature extraction SVM adaptive pitch ratio SNR
  • 相关文献

参考文献13

  • 1Lin C C, Chen S H,Truong T K. Audio Classification and Cat- egorization Based on Wavelets and Support Vector Machine [ J ]. IEEE Transaction on Speech and Audio Processing, 2005,13(5) :644-651.
  • 2Tran H D, Li Haizhou. Jump Function Kolmogorov for Audio Classification in Noise-Mismatch Conditions[ J ]. IEEE trans- actions on signal processing,2009,57(8 ) :2908-2918.
  • 3Wu Chung-Hsien, Hsieh Cilia-Hsin. Multiple change point audio segmentation and classification using an MDL based Gausslan model[ J ]. IEEE Transactions on Audio, Speech and Language Processing,2006,14 (2) :647-657.
  • 4Ghaemmaghami S. Audio segmentation and classification bas- ed on a selective analysis scheme [ C ]//IEEE Multimedia Modeling Conference. [ s. 1. ] : [ s. n. ] ,2004:42-48.
  • 5Ghoraani B, Krishnan S. Time-frequency Matrix Feature Ex- traction and Classification qff Environmental Audio Signals [ J ]. IEEE Transactions on Audio, Speech, and Language Pro- cesslng,2011,19(7 ) :2197-2209.
  • 6Kiranyaz S, Qureshi A F, Gabbouj M. A generic audio classifi- cation and segmentation approach for multimedia indexing and retrieval [ J ]. IEEE TransactJ/ons on Audio, Speech, and Lan- guage Processing,2006,14(3 ) : 1062-1081.
  • 7白亮,老松杨,陈剑赟,吴玲达.音频自动分类中的特征分析和抽取[J].小型微型计算机系统,2005,26(11):2029-2034. 被引量:13
  • 8史东承,韩玲艳,于明会.基于HMM/SVM的音频自动分类[J].长春工业大学学报,2008,29(2):178-182. 被引量:9
  • 9Briggs F, Raich R, Fern X Z. Audio Classification of Bird Spe- cies : A Statistical Manifold Approach [ C ]//IEEE International Conference on Data Mining. [s. 1. ]:Is. n. ] ,2009:51-60.
  • 10Zhang J X, Brooks S. Audio classification based on adaptive partitioning[ C]//IEEE International Conference on Multime- dia and Expo.. [ s. 1. ] : [ s. n. ] ,2009:490-493.

二级参考文献11

  • 1白亮,老松杨,陈剑赟,吴玲达.音频自动分类中的特征分析和抽取[J].小型微型计算机系统,2005,26(11):2029-2034. 被引量:13
  • 2Feiten B, Frank R, Ungvary T. Organization of sounds with neural nets[C]. In: Proceedings of the 1991 international computer music conference, international computer music association. San Francisco:[s.n. ],1991:441-444.
  • 3Foote J T. Content Based retrieval of music and audio[J]. Multimedia Storage and Archiving Systems Ⅱ, 1997,32 (29) : 138-147.
  • 4Li Dongge, Ishwar K. Classification of general audio data for content-based retrieval[J]. Pattern Recognition Letters, 2001,22:533-544.
  • 5Li S Z, Guo Guo-dong. Content--Based audio classification and retrieval using SVM learning[C]. In.. Proceedings of the 1st IEEE pacific-Rim conference on multimedia. [S.l. ] :[s. n. ] ,2000.
  • 6Rabiner L, Juang B H. Fundamentals of speech recognltion[M].[S. l.]: Prentice-Hall International, Inc. ,1993.
  • 7Vapnik V. The nature of statistical learning theory[M].New York:Springer-Verlag, 1995.
  • 8Zhang Tong. Audio content analysis for online audiovisual data segmentation and classification [J].IEEE Trans. On Speech and Audio Processing,2001,96(4):440-457.
  • 9Lu Jiang L, Zhang H J. Content analysis for audio classification and segmentation[J].IEEE Transaction on Speech and Audio Processing, 2002,10 (7) :504-516.
  • 10卢坚,陈毅松,孙正兴,张福炎.语音/音乐自动分类中的特征分析[J].计算机辅助设计与图形学学报,2002,14(3):233-237. 被引量:26

共引文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部