期刊文献+

D-2-DenseNet噪音鲁棒的城市音频分类模型

Noise Robust Urban Audio Classification Based on 2-Order Dense Convolutional Network Using Dual Features
原文传递
导出
摘要 为了提高噪音环境下城市音频分类系统的鲁棒性,提出了一种双特征2阶密集卷积神经网络(D-2-DenseNet)噪音鲁棒的城市音频分类模型.首先介绍了噪音添加和噪音鲁棒处理,阐述了一种双特征互补偿的算法;然后结合2阶密集卷积神经网络与自适应机制提出了一种噪音鲁棒音频分类模型:双特征2阶密集卷积神经网络.模型采用双特征互补偿自适应算法,可在特征提取与模型训练中更有针对性地提取有效音频信息,降低噪音干扰,以提高噪音鲁棒性.最后,基于Dcase2016数据集开展噪音环境下城市音频分类测试.实验结果表明,模型分类准确率分别可达77.12%、75.52%,与基线模型相比,平均分类准确率分别提高了8.51%和10.38%,验证了模型良好的噪音鲁棒性. A noise robust urban sound event classification model based on 2-order dense convolutional network using dual features (D-2-DenseNet) is proposed,which aims at the problems of insufficient robustness of current models. Firstly,the brief introduction of the method of noise adding and robust processing is presented. Moreover,a dual feature mutual compensation algorithm and 2-order dense convolutional network is presented. Meanwhile,a noise robust urban sound event classification model based on 2-DenseNet using dual features,i. e. D-2-DenseNet is proposed. Theoretically,D-2-DenseNet combines the advantages of feature compensation and 2-order dense convolutional neural network. The dual feature mutual compensation adaptive algorithm can effectively extract audio information and reduce noise interference to improve noise robustness. Finally,in order to validate advantages of the D-2-DenseNet,this new model is exploited in the urban sound event classification based on Dcase2016 datasets. Under conditions of channel noise and environmental noise,the experiment shows that the accuracy of the network is respectively 77. 12% and 75. 52%,which has added 8. 51% and 10. 38% compared with baseline. The noise robustness of D-2-DenseNet are also effectively verified.
作者 曹毅 黄子龙 盛永健 刘晨 费鸿博 CAO Yi;HUANG Zi-long;SHENG Yong-jian;LIU Chen;FEI Hong-bo(School of Mechanical Engineering,Jiangnan University,Wuxi 214122,China)
出处 《北京邮电大学学报》 EI CAS CSCD 北大核心 2021年第1期86-91,共6页 Journal of Beijing University of Posts and Telecommunications
基金 国家自然科学基金项目(51375209) 江苏省“六大人才高峰”计划项目(ZBZZ-012) 高等学校学科创新引智计划项目(B18027) 江苏省研究生创新计划项目(JNKY20_1928)。
关键词 城市音频分类 噪音鲁棒性 双特征互补偿 2阶密集卷积神经网络 双特征2阶密集卷积神经网络 sound event classification noise robust dual features mutual compensation 2-order dense convolutional network 2-order dense convolutional network using dual features
  • 相关文献

参考文献6

二级参考文献44

  • 1赵金山,狄增如,王大辉.北京市公共汽车交通网络几何性质的实证研究[J].复杂系统与复杂性科学,2005,2(2):45-48. 被引量:45
  • 2陆化普,石冶.Complexity of Public Transport Networks[J].Tsinghua Science and Technology,2007,12(2):204-213. 被引量:13
  • 3李英,周伟,郭世进.上海公共交通网络复杂性分析[J].系统工程,2007,25(1):38-41. 被引量:65
  • 4Gong Y. Speech recognition in noisy environments: A sur- vey. Speech Communication, 1995; 16:261--291.
  • 5Huang X, Hon H W. Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice Hall PTR, 2001.
  • 6Moreno P. Speech recognition in noisy environments. Ph.D. thesis, Carnegie Mellon University, 1996.
  • 7Gales M J F. The generation and use of regression class trees for MLLR adaptation. Cambridge University, Tech. Rep. CUED/FINFENG/TR263, 1996.
  • 8Varga A, Moore R. Hidden Markov model decomposition of speech and noise. ICASSP, 1990; 2:845--848.
  • 9Ghitza O. Temporal non-plaze information in the auditory- nerve firing patterns as a front-end for speech recognition in a noisy environment. Journal of Phonetics, 1988; 16: 109--123.
  • 10Gajic B, Paliwal K K. Robust speech recognition in noisy environments based on subband spectral centroid his- tograms. IEEE Trans. Audio, Speech, and Language Pro- cessing, 2006; 14:600----608.

共引文献240

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部