期刊文献+

应用无监督最大互信息算法分类鸟类叫声

Application of Unsupervised Maximum Mutual Information Algorithm to Classify Bird Songs
下载PDF
导出
摘要 在建立鸟类叫声的分类模型时,由于自然界中具有准确标签的鸟类叫声数据较少,因此需要解决小样本下的模型训练问题。本文研究应用最大互信息的无监督网络来对鸟类叫声进行分类。通过同时提取梅尔图谱的高层语义特征和浅层特征并计算互信息,减少噪声特征的提取。训练时使用对抗样本,利用先验约束网络拉大不同类别之间的距离,降低模型对数据的依赖。实验证明,与现有无监督方法相比,利用最大互信息方法的无监督学习能够在鸟类叫声分类任务上取得最好的效果。 When establishing a classification model for bird calls,due to the limited number of accurately labeled bird call data in nature,it is necessary to solve the problem of model training in small samples.This article studies the application of unsupervised networks with maximum mutual information to classify bird calls.By simultaneously extracting high-level semantic features and shallow features of the Mel graph and calculating mutual information,the extraction of noise features is reduced.During training,adversarial samples are used,and a prior constraint network is used to widen the distance between different categories and reduce the model's dependence on data.Experiments have shown that compared to existing unsupervised methods,unsupervised learning using the maximum mutual information method can achieve the best results in bird call classification tasks.
作者 潘婕 PAN Jie(School of Electronics and Information Engineering,Ningbo Polytechnic,Ningbo,China,315800)
出处 《福建电脑》 2024年第2期67-69,共3页 Journal of Fujian Computer
基金 浙江省教育厅项目(No.Y202147455) 宁波职业技术学院校级课题(NZ24030Q)资助。
关键词 最大互信息 无监督学习 梅尔图谱 鸟类叫声分类 Maximum Mutual Information Unsupervised Learning Mel Spectrogram Bird-Sounds Classification
  • 相关文献

参考文献1

二级参考文献6

  • 1Benjamin J Shannon,,Kuldip K.Paliwal."Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition"[].Speech Communication.
  • 2MALLAT S."A theory for multiresolution decomposition:the wavelet representation"[].IEEE TransAcoustics Speech and Signal processing.1988
  • 3Ching-Tang HSIEH,Regular Member,WANG You-Chuang."A robust speaker identification system based on wavelet transform"[].IEICE Transactions on Information and Systems.2001
  • 4H.N.Nounou,,M.N.Nounou."Multiscale fuzzy Kalman filtering"[].Engineering Applications of Artificial Intelligence.2006
  • 5Fu,Qiang."Research on parameter representation and objective quali- ty assessment of speech"[]..2000
  • 6DAUBECHIES."Orthonormal bases of compactly supported wavelets"[].Communications of the ACM.1988

共引文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部