期刊文献+

基于节拍谱的语音音乐分类模型 被引量:1

Speech Music Classification Model based on Beat Spectrum
下载PDF
导出
摘要 语音音乐分类是语音信号处理领域的重要研究方向。针对以往方法通过提取短时能量、短时幅度等特征参数来区分语音和音乐而忽视音乐具有节拍的特性,为此提出了基于节拍谱的话音分类模型。此模型在针对语音和音乐两类信号中,先对语音信号进行预处理,并对待分类的信号计算得到梅尔频率倒谱系数,再计算梅尔频率倒谱系数的相似矩阵和相似矩阵的自相关,得到待分类信号的节拍谱,最后通过阈值判断待信号类别。经试验结果证明,此模型对比传统分类模型,分类准确率提高到98%。 Speech and music classification is an important research direction in the field of speech signal processing.Aiming at the previous method to distinguish between speech and music by extracting feature parameters such as short-term energy and short-term amplitude,while ignoring the characteristics of music with beats,a speech classification model based on beat spectrum is proposed.In this model,for speech and music signals,the speech signal is preprocessed first,and the Mel frequency cepstral coefficient is calculated for the signal to be classified,and then the autocorrelation of the similarity matrix and the similarity matrix of the Mel frequency cepstrum coefficients are calculated to obtain the beat spectrum of the signal to be classified.Finally,the threshold value is used to determine the signal category.The experimental results indicate that compared with that of traditional classification models,the classification accuracy of this model is 98%.
作者 郑清杰 龙华 邵玉斌 杜庆治 ZHENG Qing-jie;LONG Hua;SHAO Yu-bin;DU Qing-zhi(Kunming University of Science and Technology,Kunming Yunnan 650000,China)
机构地区 昆明理工大学
出处 《通信技术》 2020年第11期2675-2679,共5页 Communications Technology
关键词 语音音乐 分类 自相关 阈值 分类器 speech vocal music classification since the related classifier
  • 相关文献

参考文献7

二级参考文献61

  • 1胡艳芳,吴及,刘慧星.基于MLER的语音/音乐分类方法[J].清华大学学报(自然科学版),2008,48(S1):720-724. 被引量:6
  • 2Saunders J.Real-time discrimination of broadcast speech/music. In : Proc.IEEE ICASSP, 1996.
  • 3Scheier E, Slaney M.Construction and evaluation of a robust multifeature speech/music discriminator.In :Proc.IEEE ICASSP,1997.
  • 4Zhang T, Kuo J.Audio content analysis for on-line audio visual data segmentation and classification.IEEE Trans.Speech Audio Process, 2001 ; 9(5).
  • 5Panagiotakis C, Tziritas G.A Speech/Music Discriminator Based on RMS and Zero-Crossings.IEEE Transactions on Multimedia, 2005 ; 7 (2).
  • 6Young T, Fu K-S.Handbook of Pattern Recognition and Image Processing.Eds, Academic, New York, 1986.
  • 7Wold E, Blum T, Keislar D et al.Content-based classification, search, and retrieval of audio.IEEE Multimedia Mag,1996 ; 3.
  • 8[1]Feiten, B., Frank, R., Ungvary, T. Organization of sounds with neural nets. In: Proceedings of the 1991 International Computer Music Conference, International Computer Music Association. San Francisco, 1991. 441~444.
  • 9[2]Feiten, B., Günzel, S. Automatic indexing of a sound database using self-organizing neural nets. Computer Music Journal, 1994,18(3):53~65.
  • 10[3]Wold, E., Blum, T., Keislar, D., et al. Content-Based classification, search and retrieval of audio. IEEE Multimedia Magazine, 1996,3(3):27~36.

共引文献56

同被引文献14

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部