期刊文献+

基于双重数据增强策略的音频分类方法 被引量:3

Audio classification using double data augmentation strategy
下载PDF
导出
摘要 卷积神经网络模型作为音频特征提取器具有较好的应用效果,但该类模型的训练过程对数据量要求比较高。针对这一问题,本文提出一种基于双重数据增强策略的音频分类方法。首先采用传统音频数据增强方法(旋转、调音、变调、加噪),并将增强后的数据转化为语谱图,再采用随机均值替换法进行谱图增强。在此基础上训练Inception_Resnet_V2神经网络模型作为音频特征提取器,最后训练随机森林模型作为分类器完成音频分类任务。实验结果表明,与已有方法相比,采用双重数据增强策略可明显提升音频分类精度,并且训练出的特征提取模型具有较强的泛化能力。 Convolutional neural network(CNN)model has been proved to be very effective in extracting audio features,but its training process requires a large amount of data.To solve this problem,an audio classification method using double data augmentation strategy is proposed.Firstly,the traditional audio data expansion approaches(rotation,tuning,frequency modulation and adding noise)are adopted and then the expanded data are transformed into spectrogram.The next step is to enhance the spectrogram by random mean replacement.On the basis of doubly enhanced data,Inception_Resnet_V2 neural network model is trained as audio feature extractor.Finally,the random forest model is trained as classifier to complete the audio classification.The experimental results show that,compared with the existing methods,the double data augmentation strategy can significantly improve the accuracy of audio classification,and the trained model for feature extraction has strong generalization ability.
作者 周迅 张晓龙 Zhou Xun;Zhang Xiaolong(College of Computer Science and Technology,Wuhan University of Science and Technology,Wuhan 430065,China;Big Data Science and Engineering Research Institute,Wuhan University of Science and Technology,Wuhan 430065,China;Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System,Wuhan University of Science and Technology,Wuhan 430065,China)
出处 《武汉科技大学学报》 CAS 北大核心 2020年第2期155-160,共6页 Journal of Wuhan University of Science and Technology
基金 国家自然科学基金资助项目(U1803262,61702381).
关键词 音频分类 双重数据增强 卷积神经网络 特征提取 随机森林 语谱图 audio classification double data augmentation convolutional neural network feature extraction random forest spectrogram
  • 相关文献

参考文献4

二级参考文献115

共引文献49

同被引文献29

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部