期刊文献+

基于压缩感知的固定音频检索方法 被引量:2

A Specific Audio Retrival Method Based on Compressive Sensing
下载PDF
导出
摘要 固定音频检索是指在待检索音频或音频库中检测和定位与给定样例音频同源的音频片段,它是音频检索中的基本问题之一。针对现有的音频检索方法在特征提取和匹配检索中数据存储空间大、算法复杂和特征信息容易丢失等问题,提出了一种基于压缩感知理论的固定音频检索方法,对具有稀疏性的信号进行采样压缩获得精简的观测值,提高算法的处理效率。首先对经过预处理的音频信号进行感知压缩,得到少量的压缩观测值,对其提取音频特征;然后采用粗检和精检匹配相结合的分层检索方法计算音频在观测值上对应特征的相似度,实现样例音频的精确检索。实验结果表明,该方法在固定音频检索速度和查全率、查准率等指标上具有较优的性能。 Specific audio retrieval which is one of the basic problems of audio retrieval is a means to detect and locate the fixed audio in the given audio or audio library. The existing audio retrieval methods would consume a large data storage space and have a high algorithm complexity,as well as the feature information was easy to be lost,this paper proposes a specific audio retrieval method based on compressive sensing,refers to the characteristics of compressive sensing( CS) theory in which the sparsely original signal can be compressed sampling to get a small amount of measured value for transmission and processing. First,a small amount of compressed measurements of the original audio signal are gotten by the compressed sampling,and can extract the audio features after the pretreatment. Then,by adopting the method that combines the rough retrieval and the precise retrieval,the audio similarity of the audio features can be calculated. It realizes the accurate retrieval of the specific audio. The experimental results show that this method has a better performance on retrieval speed as well as audio recall and precision,which means that it can retrieval specific audio efficiently and precisely.
出处 《实验室研究与探索》 CAS 北大核心 2015年第6期50-54,共5页 Research and Exploration In Laboratory
关键词 音频检索 压缩感知 观测值 分层检索 audio retrieval compressive sensing measured value hierarchical retrieval
  • 相关文献

参考文献14

  • 1姜洪臣,任晓磊,赵耀宏,徐波.基于音频语谱图像识别的广告检索[J].清华大学学报(自然科学版),2011,51(9):1249-1252. 被引量:9
  • 2李晨,周明全.音频检索技术研究[J].计算机技术与发展,2008,18(8):215-218. 被引量:7
  • 3Zhang W Q,Liu J.Two-stage method for specific audio retrieval[C]∥IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP),2007,4:85-88.
  • 4Smith G,Murase H,Kashino K.Quick audio retrieval using active search[C]∥IEEE International Conference on Acoustics,Speech and Signal Processing(ICASS P),1998,6:3777-3780.
  • 5Kashino K,Kurozumi T,Murase H.A quick search method for audio and video signals based on histogram pruning[J].IEEE Transactions on Multi Media,2003,5(3):348-357.
  • 6张卫强,刘加,陈恩庆.一种基于仿生模式识别思想的固定音频检索方法[J].自然科学进展,2008,18(7):808-813. 被引量:6
  • 7潘俊兰.基于特征相似度的音频检索技术研究[D].北京:清华大学,2009.
  • 8Donoho D L.Compressed Sensing[J].IEEE Transaction on Information Theory,2006,52(4):1289-1306.
  • 9石光明,刘丹华,高大化,刘哲,林杰,王良君.压缩感知理论及其研究进展[J].电子学报,2009,37(5):1070-1081. 被引量:707
  • 10梁瑞宇,奚吉,张学武.压缩感知理论在语音信号处理中的应用[J].声学技术,2010,29(4):280-282.

二级参考文献111

  • 1王守觉,潘晓霞,徐春燕,陈旭,安冬,曹文明.一种基于高维空间覆盖动态搜索方法的非特定人连续数字语音识别的研究[J].电子学报,2005,33(10):1790-1793. 被引量:7
  • 2张春梅,尹忠科,肖明霞.基于冗余字典的信号超完备表示与稀疏分解[J].科学通报,2006,51(6):628-633. 被引量:70
  • 3Wang Y, Liu Z, Huang JC. Multimedia content analysis-using both audio and visual clues. IEEE Signal Processing Magazine, 2000, 17(6): 12-36
  • 4Foote J. An overview of audio information retrieval. Multimedia Systems, 1999, 7(1):2-10
  • 5Hansen JHL, Huang R, Zhou B, et al. Speechfind.. Advances in spoken document retrieval for a national gallery of the spoken word. IEEE Transactions on Speech and Audio Processing, 2005, 13(5): 712-730
  • 6Kashino K, KurozumiT, Murase H. A quick search method for audio and video signals based on histogram pruning. IEEE Transactions on Multimedia, 2003, 5(3) : 348-357
  • 7Kim KM, Kim SY, Jeon JK, et al. Quick audio retrieval using multiple feature vectors. IEEE Transactions on Consumer Electronics, 2006, 52(1): 200-205
  • 8Zhang WQ, Liu J. Two-stage method for specific audio retrieval. IEEE International Conference on Acoustics, Speech, and Signa Processing(ICASSP), Hawaii, 2007. New Jersey: IEEE Press 2007, Ⅳ 85-88
  • 9Wang SJ, Liu YY. An algorithm for removing facial makeup disturbances based on high dimensional imaginal geometry. Chinese Journal of Electronics, 2006, 15(4A): 789-792
  • 10Haykin S著,宋铁成,等译.通信系统.北京:电子工业出版社,2003,56-58

共引文献736

同被引文献14

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部