期刊文献+

基于EMGD_HMM的音频自动分类 被引量:3

Automatic Audio Classification Based on EMGD_HMM
下载PDF
导出
摘要 音频自动分类是解决音频结构化问题和提取音频内容语义的重要手段之一,是当前基于内容的音频检索领域的一个研究热点。在考察音频数据特征的基础上,针对左-右密度隐马尔可夫模型(left-right DHMM)不能很好反映音频中状态反复的缺点,提出了一种基于各态历经混合高斯密度隐马尔可夫模型(EMGD_HMM)的分类器,并应用于语音、音乐和它们的混合声音的分类。实验结果表明,EMGD_HMM的分类精度要优于left-right DHMM。 Automatic audio classification is one of the significant methods to extract content semantics from audio. An improved classifier based on EMGD_HMM(Ergodic Mixed Gaussian Density Hidden Markov Model) is proposed to classify audio in speech, music, and their mixture. The experimental results show that compared with left-right DHMM(left-right Density Hidden Markov Model), EMGD HMM achieves better classification accuracy.
作者 王超 吴亚锋
出处 《电声技术》 2007年第11期52-54,60,共4页 Audio Engineering
关键词 音频自动分类 left-right DHMM模型 EMGD_HMM模型 MEL倒谱系数 automatic audio classification left-right DHMM EMGD_HMM Mel frequency cepstrnm coefficient
  • 相关文献

参考文献5

  • 1LU G, HANKINSON T. A technique towards automatic audio classification and retrieval[C]// Proceedings of the 4th International Conference on Signal Processing. [S.l.]: IEEE Press, 1998,12:1 142-1 145.
  • 2ZHANG T, KUO J C-C. Heuristic approach for generic audio data segmentation and annotation[C]// Proceedings of the 7th ACM International Conference on Multimedia.Orlando : [s.n,], 1999:67-76.
  • 3卢坚,陈毅松,孙正兴,张福炎.基于隐马尔可夫模型的音频自动分类[J].软件学报,2002,13(8):1593-1597. 被引量:47
  • 4LIU Z, WANG Y, CHEN T. Audio feature extraction and analysis for scene segmentation and classification[J]. Journal of VLSI Signal Processing Systems for Signal, Image,and Video Technology, 1998,20(1/2) :61-79.
  • 5LAWRENCE R, RABINER A. Tutorial on hidden Markov models and selected applications in speech recognition[J]. Proceedings of the IEEE, 1989,77(2) :257-286.

二级参考文献18

  • 1[1]Feiten, B., Frank, R., Ungvary, T. Organization of sounds with neural nets. In: Proceedings of the 1991 International Computer Music Conference, International Computer Music Association. San Francisco, 1991. 441~444.
  • 2[2]Feiten, B., Günzel, S. Automatic indexing of a sound database using self-organizing neural nets. Computer Music Journal, 1994,18(3):53~65.
  • 3[3]Wold, E., Blum, T., Keislar, D., et al. Content-Based classification, search and retrieval of audio. IEEE Multimedia Magazine, 1996,3(3):27~36.
  • 4[4]Foote, J.T. Content-Based retrieval of music and audio. Multimedia Storage and Archiving Systems II, 1997,32(29):138~147.
  • 5[5]Li, S.Z. Content-Based classification and retrieval of audio using the nearest feature line method. IEEE Transactions on Speech and Audio Processing, 2000,8(5):619~625.
  • 6[6]Li, S.Z., Guo, Guo-dong. Content-Based audio classification and retrieval using SVM learning. In: Proceedings of the 1st IEEE Pacific-Rim Conference on Multimedia. 2000.
  • 7[7]Jiang, Hao, Lin, Tony, Zhang, Hong-jiang. Video segmentation with the support of audio segmentation and classification. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2000), Vol 3. NY: IEEE, 2000. 1507~1510.
  • 8[8]He, Li-wei, Sanocki, E., Gupta, A., et al. Auto-Summarization of audio-video presentations. In: Proceedings of the 7th ACM International Conference on Multimedia. Orlando: ACM Press, 1999. 489~498.
  • 9[9]Patel, N., Sethi, I. Audio characterization for video indexing. In: Proceedings of the SPIE on Storage and Retrieval for Still Image and Video Databases, Vol 2670. 1996. 373~384.
  • 10[10]Liu, Zhu, Huang, J., Wang, Y. Classification of TV programs based on audio information using hidden Markov model. In: Proceedings of the IEEE Signal Processing Society 1998 Workshop on Multimedia Signal Processing. IEEE, 1998. 27~32.

共引文献46

同被引文献11

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部