用于声音分类的Deep LightGBM算法

Sound classification using Deep LightGBM algorithm

下载PDF

导出

摘要在医学诊断、场景分析、语音识别、生态环境分析等方面语音分类都有着广泛的应用价值。传统的语音分类器采用的是神经网络。但是在精确度,模型设置,参数调整和资料的预处理等方面,有较大的缺陷。在这一基础上,文章提出了一种以“深度森林”为基础的改进方法——LightGBM的深度学习模型(Deep LightGBM模型)。它能够在保证模型简洁的前提下,提高分类精度和泛化能力。该算法有效降低了参数依赖性。在UrbanSound8K这一数据集中,采用向量方法进行语音特征的提取,其分类精确度达95.84%。将卷积神经网络(Convolutional Neural Network, CNN)抽取的特征和向量法获取的特征进行融合,并利用新的模型进行训练,其准确率可达97.67%。实验证明,此算法采用的特征提取方式与Deep LightGBM配合获得的模型参数调整容易,精度高,不会产生过度拟合,并且泛化能力好。 Applications for sound classification include voice identification, scene analysis, medical diagnosis,ecological environment study, and more. Neural networks, which are mostly used in traditional sound classification methods, have clear limitations in accuracy, model setting, parameter modification, and data pre-processing. Based on this, a Deep LightGBM model is developed, which is an upgraded LightGBM Deep learning model that successfully increases classification accuracy and generalization capacity while maintaining the model’s simplicity and lowering the degree of parameter dependence of the method. The suggested model achieves the accuracy of 95.84% on the UrbanSound8K dataset when sound features are extracted by using the vector approach. Accuracy of 97.67% is attained by combining the vector features with the CNN-extracted features before training the new model. The experimental findings demonstrate that the Deep LightGBM model and the implemented sound feature extraction approach have high accuracy, no over-fitting, and good generalization performance.

作者李行健汤心溢张瑞 LI Xingjian;TANG Xinyi;ZHANG Rui(Key Laboratory of Infrared System Detection and Imaging Technology,Shanghai Institute of Technical Physics,Chinese Academy of Sciences,Shanghai 200083,China;School of Information Science and Technology,Shanghaitech University,Shanghai 201210,China;University of Chinese Academy of Sciences,Beijing 100049,China)

机构地区中国科学院上海技术物理研究所红外探测与成像技术重点实验室上海科技大学信息科学与技术学院中国科学院大学

出处《声学技术》 CSCD 北大核心 2022年第6期871-877,共7页 Technical Acoustics

关键词声音分类 LightGBM算法深度森林特征融合特征提取 sound classification LightGBM algorithm Deep forest feature fusion feature extraction

分类号 TP391.42 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1邓让社.高中数学解题中向量法的运用[J].数理天地（高中版）,2023(3):4-5. 被引量：2
2胡明辉.例谈向量法在几何问题中的运用[J].中学数学教学参考,2022(36):47-48.
3陈始舟.5G技术在智慧高速公路中的应用场景分析[J].通讯世界,2022,29(9):15-17.
4梁静,文奕.知识图谱在医学辅助诊断中的应用研究[J].医学信息学杂志,2022,43(11):34-40. 被引量：4
5妥斯根.弥尔五法在传统医学诊断中的应用研究[J].包头医学院学报,2023,39(1):61-66.
6庄维嘉,谭文安,林瑞钦,郝宵.GA-LightGBM模型及其在车辆保险需求预测中应用[J].上海第二工业大学学报,2022,39(4):339-346. 被引量：1
7白宁,范利波.面向算力网络的航天智慧云服务架构与场景分析[J].无线互联科技,2022,19(24):37-42.
8苗超,杨旭.智慧广电光纤入户线路改造场景分析[J].广播电视网络,2023,30(2):88-90. 被引量：1
9张丹丹.论放射医学技术在临床工作中的意义[J].中国科技期刊数据库医药,2023(1):68-71.
10刘鹏.基于神经网络的图像分类模型的设计与实现[J].无线互联科技,2022,19(22):53-55.

声学技术

2022年第6期

浏览历史

内容加载中请稍等...

用于声音分类的Deep LightGBM算法

相关作者

相关机构

相关主题

浏览历史