基于APR-SVM的音频分类方法

Audio Classification Based on APR-SVM

下载PDF

导出

摘要音频分类在多媒体应用中十分广泛,主要有时域分析和频域分析方法。文中提出了一种基于自适应间距比(APR)算法和支持向量机(SVM)算法的音频分类方法,先用APR算法区分语音与非语音;对于非语音,再通过SVM进行音频分类。APR算法是比较PR参数和阈值来区分语音和非语音,它和信噪比密切相关;而将非语音分成四组:音乐,汽车,会议,雨声,提取特征因子。实验结果表明:文中设计的分类器的精度达到93.75%以上,能很好地把各类型音频分开。 Audio classification is widely applied in multimedia applications, which mainly has time domain analysis and frequency domain analysis methods. In this paper,an audio classification method based on APR algorithm and SVM algorithm is proposed,first use the APR algorithm to distinguish between voice and non voice,for non-voice take audio classification by SVM. APR algorithm is to compare the PR parameters and thresholds to distinguish between voice and non voice,is closely related to SNR ,and non-voiee is divided into four groups：music,cars,meeting, rain, extract the feature factor. The experimental results show that：the accuracy of the classifier designed in this paper is to reach over 93.75% ,good separation of various types of audio.

作者王晓峰蒋先涛

机构地区上海海事大学信息工程学院

出处《计算机技术与发展》 2012年第10期59-61,65,共4页 Computer Technology and Development

基金上海市科技计划重点项目(08240510800)

关键词音频分类特征提取支持向量机自适应间距比信噪比 audio classification feature extraction SVM adaptive pitch ratio SNR

分类号 TP315 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献13

1Lin C C, Chen S H,Truong T K. Audio Classification and Cat- egorization Based on Wavelets and Support Vector Machine [ J ]. IEEE Transaction on Speech and Audio Processing, 2005,13(5) :644-651.
2Tran H D, Li Haizhou. Jump Function Kolmogorov for Audio Classification in Noise-Mismatch Conditions[ J ]. IEEE trans- actions on signal processing,2009,57(8 ) :2908-2918.
3Wu Chung-Hsien, Hsieh Cilia-Hsin. Multiple change point audio segmentation and classification using an MDL based Gausslan model[ J ]. IEEE Transactions on Audio, Speech and Language Processing,2006,14 (2) :647-657.
4Ghaemmaghami S. Audio segmentation and classification bas- ed on a selective analysis scheme [ C ]//IEEE Multimedia Modeling Conference. [ s. 1. ] : [ s. n. ] ,2004:42-48.
5Ghoraani B, Krishnan S. Time-frequency Matrix Feature Ex- traction and Classification qff Environmental Audio Signals [ J ]. IEEE Transactions on Audio, Speech, and Language Pro- cesslng,2011,19(7 ) :2197-2209.
6Kiranyaz S, Qureshi A F, Gabbouj M. A generic audio classifi- cation and segmentation approach for multimedia indexing and retrieval [ J ]. IEEE TransactJ/ons on Audio, Speech, and Lan- guage Processing,2006,14(3 ) : 1062-1081.
7白亮,老松杨,陈剑赟,吴玲达.音频自动分类中的特征分析和抽取[J].小型微型计算机系统,2005,26(11):2029-2034. 被引量：13
8史东承,韩玲艳,于明会.基于HMM/SVM的音频自动分类[J].长春工业大学学报,2008,29(2):178-182. 被引量：9
9Briggs F, Raich R, Fern X Z. Audio Classification of Bird Spe- cies : A Statistical Manifold Approach [ C ]//IEEE International Conference on Data Mining. [s. 1. ]:Is. n. ] ,2009:51-60.
10Zhang J X, Brooks S. Audio classification based on adaptive partitioning[ C]//IEEE International Conference on Multime- dia and Expo.. [ s. 1. ] : [ s. n. ] ,2009:490-493.

二级参考文献11

1白亮,老松杨,陈剑赟,吴玲达.音频自动分类中的特征分析和抽取[J].小型微型计算机系统,2005,26(11):2029-2034. 被引量：13
2Feiten B, Frank R, Ungvary T. Organization of sounds with neural nets[C]. In: Proceedings of the 1991 international computer music conference, international computer music association. San Francisco:[s.n. ],1991:441-444.
3Foote J T. Content Based retrieval of music and audio[J]. Multimedia Storage and Archiving Systems Ⅱ, 1997,32 (29) : 138-147.
4Li Dongge, Ishwar K. Classification of general audio data for content-based retrieval[J]. Pattern Recognition Letters, 2001,22:533-544.
5Li S Z, Guo Guo-dong. Content--Based audio classification and retrieval using SVM learning[C]. In.. Proceedings of the 1st IEEE pacific-Rim conference on multimedia. [S.l. ] :[s. n. ] ,2000.
6Rabiner L, Juang B H. Fundamentals of speech recognltion[M].[S. l.]: Prentice-Hall International, Inc. ,1993.
7Vapnik V. The nature of statistical learning theory[M].New York:Springer-Verlag, 1995.
8Zhang Tong. Audio content analysis for online audiovisual data segmentation and classification [J].IEEE Trans. On Speech and Audio Processing,2001,96(4):440-457.
9Lu Jiang L, Zhang H J. Content analysis for audio classification and segmentation[J].IEEE Transaction on Speech and Audio Processing, 2002,10 (7) :504-516.
10卢坚,陈毅松,孙正兴,张福炎.语音/音乐自动分类中的特征分析[J].计算机辅助设计与图形学学报,2002,14(3):233-237. 被引量：26

共引文献20

1于俊清,崔玉强,何云峰.足球比赛中的音频信息提取与自动分类[J].华中科技大学学报（自然科学版）,2007,35(10):35-38. 被引量：1
2杨圣云,袁德辉,赖国明.基于串核的音乐风格聚类[J].计算机工程与设计,2008,29(3):687-689.
3朱映映,明仲,周景洲.一种面向基于内容视频检索的音频场景分割方法[J].小型微型计算机系统,2008,29(3):557-562.
4史东承,韩玲艳,于明会.基于HMM/SVM的音频自动分类[J].长春工业大学学报,2008,29(2):178-182. 被引量：9
5杨圣云,赖国明,袁德辉.基于串核的音乐分类研究[J].计算机工程与应用,2008,44(16):243-245. 被引量：1
6张小梅,杨鼎才.基于支持向量机模型的环境音分类研究[J].电子测量技术,2008,31(9):121-123. 被引量：4
7李志忠,滕光辉.基于改进MFCC的家禽发声特征提取方法[J].农业工程学报,2008,24(11):202-205. 被引量：24
8张新彩,张德同,耿国华,王小凤,吴江.基于PCA和CHMM的音频自动分类[J].计算机应用研究,2009,26(4):1257-1259. 被引量：4
9史东承,刘玮,梁超.语音通信质量评价方法的研究[J].长春工业大学学报,2009,30(2):206-209. 被引量：1
10梁超.一种基于Gammatone滤波的语音质量评价算法[J].长春工业大学学报,2010,31(4):432-436. 被引量：1

1金贞华.《音》系列之“回响”[J].陶瓷研究,2015,30(5):40-41.
2赵芳,方贤文,方欢.Petri网动态切片的最小变化域分析方法[J].计算机科学与探索,2016,10(4):516-523. 被引量：1
3且听雨声[J].微型计算机,2016,0(23):11-11.
4邓健.基于稳定域的网络控制系统分析[J].企业科技与发展,2011(9):12-14.
5coolmouse.“好色”电脑特训攻略·桌面气象台[J].计算机应用文摘,2003(13):58-59.
6王德祥.留得残荷听雨声[J].电脑界（电脑高手）,2000(8):48-51.
7微博上在传什么？[J].计算机应用文摘,2013(16):87-87.
8赵凌云.走进台北的冬季[J].歌曲,2014,0(5):93-93.
9马秀奇,李国良.剪辑视频玩转会声会影的声音效果[J].电脑迷,2010(9):58-59. 被引量：1
10吴辉.巧妙添声:让“哑巴开口说话”[J].发明与创新（大科技）,2010(2):20-20.

计算机技术与发展

2012年第10期

浏览历史

内容加载中请稍等...

基于APR-SVM的音频分类方法

参考文献13

二级参考文献11

共引文献20

相关作者

相关机构

相关主题

浏览历史