期刊文献+

基于分形布朗运动和Ada Boosting的多类音频例子识别 被引量:8

Recognition of Multiple Audio Clip Classes Based on FBM and Ada Boosting
下载PDF
导出
摘要 提出了一种基于分形布朗运动的音频特征提取和识别方法 这种方法使用分形布朗运动模型计算出音频例子的分形维数 ,并作为其分形特征 针对音频分形特征符合高斯分布的特点 ,使用AdaBoosting算法进行特征约减 然后分别使用Ada 加权高斯分类器和支持向量机对约减特征后的音频分类 ,并在两类分类的基础上构造多类分类的模型 实验表明 。 A novel method for audio feature extraction and recognition is presented In this method, FBM (fractional brownian motion) based fractal dimension is defined as audio fractal feature According to Gaussian distribution characteristic of audio fractal feature, Ada boosting algorithm is used for feature reduction Then two classifiers, weighted Ada Gaussian classifier and support vector machine, are implemented respectively for audio classification Based on these two classifiers, a multiple classifier model is finally constructed Experimental data shows that audio fractal feature achieves better performance than other audio features for music and speech classification
出处 《计算机研究与发展》 EI CSCD 北大核心 2003年第7期941-949,共9页 Journal of Computer Research and Development
基金 国家自然科学基金项目 ( 60 2 72 0 3 1) 浙江省自然科学基金重点项目 (ZD0 2 12 ) 浙江省科技计划重点科研项目 ( 2 0 0 3C2 10 10 )
关键词 分形布朗运动 音频分形维数 音频分形特征 特征约减 FBM (fractional Brownian motion) audio fractal dimension audio fractal feature
  • 相关文献

参考文献16

  • 1吴飞,庄越挺,张引,潘云鹤.基于隐马尔可夫链的音频语义检索[J].模式识别与人工智能,2001,14(1):104-108. 被引量:10
  • 2庄越挺,毛祎,吴飞,潘云鹤.基于隐马尔可夫链的广播新闻分割分类[J].计算机研究与发展,2002,39(9):1057-1063. 被引量:7
  • 3庄越挺,刘骏伟,吴飞,潘云鹤,张引.基于支持向量机的视频字幕自动定位与提取[J].计算机辅助设计与图形学学报,2002,14(8):750-753. 被引量:38
  • 4J T Foote. An overview of audio information retrieval .Multimedia Systems, 1999, 7(1): 2--11.
  • 5John Saunders. Real time discrimination of broadcast speech/music. IEEE Int'l Cord on Acoustic, Speech, and Signal Processing (ICASSP-96), Atla, 1996.
  • 6Eric Scheirer, M Slaney. Construction and evaluation of a robust multifeature music/speech discriminator. Int' 1 Cord on Acoustic,Speech, and Signal Processing ( ICASSP' 97 ), Munich,Germany, 1997.
  • 7J T Foote. A similarity measure for automatic audio classification.AAAI 1997 Spring Symposium on Intelligent Integration and Use of Text, Image, Video, and Audio Corpora, Stanford, 1997.
  • 8B B Mandlebrot. The Fractal Geometry. of Nature. New York: W H Freeman & Co, 1982.
  • 9R F Voss, J Clarke. 1/f noise in music and speech. Nature,1975, 258:317--318.
  • 10R F Moss, J Clark. 1/f noise in music: Music from 1/f noise.Journal of the Acoustical Sodety of America, 1978, 63 (1) : 258--263.

二级参考文献25

  • 1[1]Y Wang, Z Liu, J Huang. Multimedia content analysis using audio and visual information[J]. IEEE Signal Processing Magazine, 2000, 17(6):12~36
  • 2[2]R Lienhart, F Stuber. Automatic text recognition in digital videos[A]. In: Proceedings of ACM Multimedia, Boston, 1996.11~20
  • 3[3]Zhong Yu, Zhang Hongjiang, Jain Anil K. Automatic caption localization in compressed video[J]. Pattern Analysis and Machine Intelligence, 2000, 22(4):385~392
  • 4[4]V Vapnik. The Nature of Statistical Learning Theory[M]. New York: Springer, 1995
  • 5[5]M Schmidt. Identifying speaker with support vector networks[A]. In: Proceedings of Interface'96, Sydney, 1996
  • 6[6]T Joachims. Text categorization with support vector machines: Learning with many relevant features[A]. In: Proceedings of the 10th European Conference on Machine Learning, Chemnitz, Germany, 1998.137~142
  • 7[7]Yuan Qi. Learning algorithms for video and audio processing: Independent component analysis and support vector machine based approaches[R].College Park: University of Maryland at College Park, LAMP-TR-056(CAR-TR-951), 2000
  • 8[8]Edgar Osuna, Robert Freund, Federico Girosi. Training support vector machines: An application to face detection[A]. In: Proceedings of Computer Vision and Pattern Recognition, Puerto Rico, 1997.130~136
  • 9[9]C J C Burges. A tutorial on support vector machines for pattern recognition[J]. Data Mining, and Knowledge Discovery, 1998, 2(2):121~167
  • 10[10]T M Cover. Geometrical and statistical properties of systems and linear inequalities with applications in pattern recognition[J]. IEEE Transactions on Electronic Computers, 1965, 14(3):326~334

共引文献51

同被引文献37

引证文献8

二级引证文献77

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部