期刊文献+

基于压缩感知和音频指纹的固定音频检索方法 被引量:2

Specific Audio Retrieval Method Based on Compressed Sensing and Audio Fingerprint
下载PDF
导出
摘要 针对现有音频检索中样本音频特征库数据量较大且检索速率慢问题,本文提出一种基于压缩感知和音频指纹降维的固定音频检索方法.在音频检索的训练阶段,首先,对样本音频信号进行稀疏化处理,并通过压缩感知算法对稀疏化后的音频数据进行压缩;其次,提取压缩信号的音频指纹;再次,引入音频指纹离散基尼系数通过计算音频指纹各维度的离散基尼系数对指纹实施降维,最终得到检索特征库.在音频检索阶段用和训练阶段相同的算法提取待检音频的特征与音频特征库数据匹配得出检索结论.实验结果表明,所提音频检索方法在确保较好的检索准确率的基础上,大幅度减小了样本音频数据库的存储量,提高了音频的检索速率. In order to solve the problem of large amount of data and slow retrieval speed in the existing audio retrieval,a fixed audio retrieval method is proposed in this study based on compressed sensing and audio fingerprint dimensionality reduction.In the training stage of audio retrieval,the sample audio signal is sparse processed,and the sparse audio data is compressed by the compression sensing algorithm,then the audio fingerprint is extracted,and then the audio fingerprint discrete Gini coefficient is introduced to reduce the dimension of the fingerprint by calculating the discrete Gini coefficient of each dimension of the audio fingerprint.In the recognition stage of audio retrieval,we use the same algorithm as in the training stage to process the audio to be tested and match with the sample audio fingerprint.The experimental results show that the proposed audio retrieval method greatly reduces the storage of the sample audio database and improves the audio retrieval speed on the basis of ensuring a better retrieval accuracy.
作者 赵文兵 贾懋珅 王琪 ZHAO Wen-Bing;JIA Mao-Shen;WANG Qi(Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China)
出处 《计算机系统应用》 2020年第8期165-172,共8页 Computer Systems & Applications
基金 国家自然科学基金(61971015)。
关键词 音频检索 压缩感知 离散基尼系数 音频指纹 audio retrieval compressed sensing discrete Gini coefficient audio fingerprinting
  • 相关文献

参考文献8

二级参考文献119

  • 1王守觉,潘晓霞,徐春燕,陈旭,安冬,曹文明.一种基于高维空间覆盖动态搜索方法的非特定人连续数字语音识别的研究[J].电子学报,2005,33(10):1790-1793. 被引量:7
  • 2Wang Y, Liu Z, Huang JC. Multimedia content analysis-using both audio and visual clues. IEEE Signal Processing Magazine, 2000, 17(6): 12-36
  • 3Foote J. An overview of audio information retrieval. Multimedia Systems, 1999, 7(1):2-10
  • 4Hansen JHL, Huang R, Zhou B, et al. Speechfind.. Advances in spoken document retrieval for a national gallery of the spoken word. IEEE Transactions on Speech and Audio Processing, 2005, 13(5): 712-730
  • 5Kashino K, KurozumiT, Murase H. A quick search method for audio and video signals based on histogram pruning. IEEE Transactions on Multimedia, 2003, 5(3) : 348-357
  • 6Kim KM, Kim SY, Jeon JK, et al. Quick audio retrieval using multiple feature vectors. IEEE Transactions on Consumer Electronics, 2006, 52(1): 200-205
  • 7Zhang WQ, Liu J. Two-stage method for specific audio retrieval. IEEE International Conference on Acoustics, Speech, and Signa Processing(ICASSP), Hawaii, 2007. New Jersey: IEEE Press 2007, Ⅳ 85-88
  • 8Wang SJ, Liu YY. An algorithm for removing facial makeup disturbances based on high dimensional imaginal geometry. Chinese Journal of Electronics, 2006, 15(4A): 789-792
  • 9Haykin S著,宋铁成,等译.通信系统.北京:电子工业出版社,2003,56-58
  • 10Kay SM著,罗鹏飞,等译.统计信号处理基础估计与检测理论.北京:电子工业出版社,2003,409-411

共引文献73

同被引文献10

引证文献2

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部