期刊文献+

基于神经网络的5种音频分类

5 types Audio Classification Based on Nueron Networks
下载PDF
导出
摘要 本文将音频信号的MFCC参数作为特征向量,并使用前馈型神经网络对语音、音乐、语音+音乐、环境音响、静音5类音频进行分类,取得了平均92%的正确率。 In this paper,MFCC is abstracted as feature vectors of the audio signals,and the Nueron Networks is chosen to classify five audio documents:speech, music, speech+music, background and silence. The experimental results show that Nueron Networks is excellent for classification of audio documents,and the average classification accuracy is up to 92%.
作者 刘乔辉
出处 《中国西部科技》 2008年第9期16-17,15,共3页 Science and Technology of West China
关键词 神经网络 音频 分类 Neural Networks Audio Categories
  • 相关文献

参考文献2

二级参考文献9

  • 1[1]Hao Jiang, Tony Lin, Hongjiang Zhang. Video segmentation with the support of audio segmentation and classification[C]. In: Proceedings of ICME'2000-IEEE International Conference on Multimedia and Expo, New York, 2000,3:1507~1510
  • 2[2]Tong Zhang, C-C Jay Kuo. Heuristic approach for generic audio data segmentation and annotation[C]. In: Proceedings of the 7 th ACM International Conference on Multimedia, Orlando, 1999. 67~76
  • 3[3]Savitha Srinivasan, Dragutin Petkovic, Dulce Ponceleon. Towards robust features for classifying audio in the cudeVideo system[C]. In: Proceedings of the 7th ACM International Conference on Multimedia, Orlando, 1999. 393~400
  • 4[4]Guojun Lu, Templar Hankinson. A technique towards automatic audio classification and retrieval[C]. In: Proceedings of the 4th IEEE International Conference on Signal Processing, ICSP 1998, Beijing, 1998,2:1142~1145
  • 5[5]L Rabiner, B H Juang. Fundamentals of Speech Recognition[M]. New Jersey: Prentice-Hall International, 1993
  • 6[6]Rivarol Vergin, Douglas O'Shaughnessay. Generalized mel-frequency cepstral coefficients for large-vocabulary speaker-independent continuous speech recognition[J]. IEEE Transactions on Speech and Audio Processing, 1999, 7(5):525~53
  • 7[7]J T Foote. Content-based retrieval of music and audio[C]. C-C J Kuo, et al. editor. In: Proceedings of SPIE, Multimedia Storage and Archiving Systems II, 1997, 32(29):138~147
  • 8[8]Stan Z Li. Content-based classification and retrieval of audio using the nearest feature line method[J]. IEEE Transactions on Speech and Audio Processing, 2000, 8(5):619~625
  • 9[9]Stan Z Li, GuoDong Guo. Content-based audio classification and retrieval using SVM learning[C]. Invited paper. The Special Session on Multimedia Information Indexing and Retrieval. In: Proceedings of the First IEEE Pacific-Rim Conference on Multimedia, University of Sydney, Australia, 2000. 13~15

共引文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部