期刊文献+

An Efficient Approach for Segmentation, Feature Extraction and Classification of Audio Signals

An Efficient Approach for Segmentation, Feature Extraction and Classification of Audio Signals
下载PDF
导出
摘要 Due to the presence of non-stationarities and discontinuities in the audio signal, segmentation and classification of audio signal is a really challenging task. Automatic music classification and annotation is still considered as a challenging task due to the difficulty of extracting and selecting the optimal audio features. Hence, this paper proposes an efficient approach for segmentation, feature extraction and classification of audio signals. Enhanced Mel Frequency Cepstral Coefficient (EMFCC)-Enhanced Power Normalized Cepstral Coefficients (EPNCC) based feature extraction is applied for the extraction of features from the audio signal. Then, multi-level classification is done to classify the audio signal as a musical or non-musical signal. The proposed approach achieves better performance in terms of precision, Normalized Mutual Information (NMI), F-score and entropy. The PNN classifier shows high False Rejection Rate (FRR), False Acceptance Rate (FAR), Genuine Acceptance rate (GAR), sensitivity, specificity and accuracy with respect to the number of classes. Due to the presence of non-stationarities and discontinuities in the audio signal, segmentation and classification of audio signal is a really challenging task. Automatic music classification and annotation is still considered as a challenging task due to the difficulty of extracting and selecting the optimal audio features. Hence, this paper proposes an efficient approach for segmentation, feature extraction and classification of audio signals. Enhanced Mel Frequency Cepstral Coefficient (EMFCC)-Enhanced Power Normalized Cepstral Coefficients (EPNCC) based feature extraction is applied for the extraction of features from the audio signal. Then, multi-level classification is done to classify the audio signal as a musical or non-musical signal. The proposed approach achieves better performance in terms of precision, Normalized Mutual Information (NMI), F-score and entropy. The PNN classifier shows high False Rejection Rate (FRR), False Acceptance Rate (FAR), Genuine Acceptance rate (GAR), sensitivity, specificity and accuracy with respect to the number of classes.
作者 Muthumari Arumugam Mala Kaliappan Muthumari Arumugam;Mala Kaliappan(Department of Computer Science and Engineering, University College of Engineering, Ramanathapuram, India;Department of Computer Science and Engineering, Mepco Schlenk Engineering College, Sivakasi, India)
出处 《Circuits and Systems》 2016年第4期255-279,共25页 电路与系统(英文)
关键词 Audio Signal Enhanced Mel Frequency Cepstral Coefficient (EMFCC) Enhanced Power Normalized Cepstral Coefficients (EPNCC) Probabilistic Neural Network (PNN) Classifier Audio Signal Enhanced Mel Frequency Cepstral Coefficient (EMFCC) Enhanced Power Normalized Cepstral Coefficients (EPNCC) Probabilistic Neural Network (PNN) Classifier
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部