An Efficient Approach for Segmentation, Feature Extraction and Classification of Audio Signals

An Efficient Approach for Segmentation, Feature Extraction and Classification of Audio Signals

下载PDF

导出

摘要 Due to the presence of non-stationarities and discontinuities in the audio signal, segmentation and classification of audio signal is a really challenging task. Automatic music classification and annotation is still considered as a challenging task due to the difficulty of extracting and selecting the optimal audio features. Hence, this paper proposes an efficient approach for segmentation, feature extraction and classification of audio signals. Enhanced Mel Frequency Cepstral Coefficient (EMFCC)-Enhanced Power Normalized Cepstral Coefficients (EPNCC) based feature extraction is applied for the extraction of features from the audio signal. Then, multi-level classification is done to classify the audio signal as a musical or non-musical signal. The proposed approach achieves better performance in terms of precision, Normalized Mutual Information (NMI), F-score and entropy. The PNN classifier shows high False Rejection Rate (FRR), False Acceptance Rate (FAR), Genuine Acceptance rate (GAR), sensitivity, specificity and accuracy with respect to the number of classes. Due to the presence of non-stationarities and discontinuities in the audio signal, segmentation and classification of audio signal is a really challenging task. Automatic music classification and annotation is still considered as a challenging task due to the difficulty of extracting and selecting the optimal audio features. Hence, this paper proposes an efficient approach for segmentation, feature extraction and classification of audio signals. Enhanced Mel Frequency Cepstral Coefficient (EMFCC)-Enhanced Power Normalized Cepstral Coefficients (EPNCC) based feature extraction is applied for the extraction of features from the audio signal. Then, multi-level classification is done to classify the audio signal as a musical or non-musical signal. The proposed approach achieves better performance in terms of precision, Normalized Mutual Information (NMI), F-score and entropy. The PNN classifier shows high False Rejection Rate (FRR), False Acceptance Rate (FAR), Genuine Acceptance rate (GAR), sensitivity, specificity and accuracy with respect to the number of classes.

作者 Muthumari Arumugam Mala Kaliappan Muthumari Arumugam;Mala Kaliappan(Department of Computer Science and Engineering, University College of Engineering, Ramanathapuram, India;Department of Computer Science and Engineering, Mepco Schlenk Engineering College, Sivakasi, India)

机构地区 Department of Computer Science and Engineering Department of Computer Science and Engineering

出处《Circuits and Systems》 2016年第4期255-279,共25页 电路与系统（英文）

关键词 Audio Signal Enhanced Mel Frequency Cepstral Coefficient (EMFCC) Enhanced Power Normalized Cepstral Coefficients (EPNCC) Probabilistic Neural Network (PNN) Classifier Audio Signal Enhanced Mel Frequency Cepstral Coefficient (EMFCC) Enhanced Power Normalized Cepstral Coefficients (EPNCC) Probabilistic Neural Network (PNN) Classifier

分类号 TN9 [电子电信—信息与通信工程]

引文网络
相关文献

1曹梦婷,谷玉海,王红军,徐小力.基于GRU与迁移学习的滚动轴承故障诊断[J].现代制造工程,2022(1):143-147. 被引量：4
2郭绍陶,苑玮琦.基于双高斯纹理滤波模板和极值点韦伯对比度的圆柱锂电池凹坑缺陷检测[J].电子学报,2022,50(3):637-642. 被引量：3
3逄英,高军伟.基于ICEEMDAN能量矩和MFOA-PNN的轴承故障诊断[J].现代制造工程,2022(3):122-126. 被引量：5
4Yongqiang Bao,Qi Shao,Xuxu Zhang,Jiahui Jiang,Yue Xie,Tingting Liu,Weiye Xu.A Novel System for Recognizing Recording Devices from Recorded Speech Signals[J].Computers, Materials & Continua,2020(12):2557-2570.
5成兴保,程永强,张博.基于AIA和PNN的路基健康监测[J].电子设计工程,2022,30(3):163-168. 被引量：1
6Yuxue XU,Yun WANG,Tianhong YAN,Yuchen HE,Jun WANG,De GU,Haiping DU,Weihua LI.Quality-related locally weighted soft sensing for non-[J].Frontiers of Information Technology & Electronic Engineering,2021,22(9):1234-1246.
7张一弓,易茜,李剑,李聪波,尹爱军,易树平.鼠标行为HHT变换的工业互联网用户身份认证[J].物联网学报,2022,6(2):77-87. 被引量：2
8甘海林,雷震春,杨印根.孪生Bi-LSTM模型在语音欺骗检测中的研究[J].小型微型计算机系统,2022,43(6):1265-1271. 被引量：2
9崔潇,夏秀渝.基于MRACC特征的鲁棒说话人识别研究[J].智能计算机与应用,2021,11(10):61-66.
10邵睿,彭硕,查文文,陈成鹏,辜丽川,焦俊.基于BiLSTM的生猪音频识别[J].合肥学院学报（综合版）,2022,39(2):113-119. 被引量：2

Circuits and Systems

2016年第4期

浏览历史

内容加载中请稍等...

An Efficient Approach for Segmentation, Feature Extraction and Classification of Audio Signals

相关作者

相关机构

相关主题

浏览历史