期刊文献+

一种基于随机段的固定音频检索方法 被引量:4

Method of specific audio retrieval based on randomly segment
下载PDF
导出
摘要 在固定音频检索的整体检索方法中,当检索目标较长时,检索时间会变得很长。为了减小检索时间,提出了一种基于随机段的音频检索方法。把整个检索过程分成随机段检索和整体匹配两个阶段:随机段检索是从参考模板中随机选择一段(随机段)作为检索目标进行检索;整体匹配是在随机段检索出的基础上,判断潜在目标信号是否为参考模板。把这种随机检索的方法应用到计算特征距离和直方图交集方法中,结果证明该检索方法的准确率可以达到90%以上,而且平均检索时间可以降低到随机段与参考模板的比值和整体检索时间的积。 This paper proposed a specific audio retrieval method based on random segment in order to decrease the retrieval time for relatively long object in the total retrieval of audio retrieval. The whole retrieval process was composed of random segment retrieval and total matching: the first was to select a segment from template model as object to retrieve in stored signal and the second was to judge whether the potential object signal was the template model based on random segment. Then using this method in computing feature distance directly and histogram intersection retrieval, the experimental results show retrieval accuracy over 90% and average retrieval time declined to the ratio of random segment to template model multiplying total retrieval time.
出处 《计算机应用》 CSCD 北大核心 2010年第1期230-232,共3页 journal of Computer Applications
基金 国家自然科学基金资助项目(60972132 60602014)
关键词 直方图交集法 特征距离 过零率 histogram intersection feature distance Zero Crossing Rate (ZCR)
  • 相关文献

参考文献11

  • 1FOOTE J. An overview of audio information retrieval[ J]. Multimedia Systems, 1999, 7(1) : 2 - 10.
  • 2HANSEN J H L, HUANG RONGQING. SpeechFind: Advances in spoken document retrieval for a national gallery of the spoken word [J]. IEEE Transactions on Speech and Audio processing, 2005, 13 (5): 712-730.
  • 3CHECHIL G, LE E, REHN M, et al. Large-scale content-based audio retrieval from text queries[ C]//Proceedings of 1st ACM International Conference on Multimedia Information Retrieval. New York: ACM, 2008:105 - 112.
  • 4张卫强,刘加.网络音频数据检索技术[J].通信学报,2007,28(12):152-155. 被引量:10
  • 5张卫强,刘加,陈恩庆.一种基于仿生模式识别思想的固定音频检索方法[J].自然科学进展,2008,18(7):808-813. 被引量:6
  • 6SMITH G, MURASE H, KASHINO K. Quick audio retrieval using active search[C]// IEEE International Conference on Acoustics, Speech and Signal Processing. New York: IEEE, 1998, 6:3777 - 3780,.
  • 7KASHINO K, KUROZUMI T, MURASE H. A quick search method for audio and video signals based on histogram pruning[ J]. IEEE Transactions on Multimedia, 2003, 5 (3) : 384 - 357.
  • 8郑贵滨,韩纪庆,李海峰,郑铁然.基于分段的实时声频检索方法[J].声学学报,2006,31(2):101-108. 被引量:5
  • 9ZHANG W Q, LIY J. two-stage method for specific audio retrieval [ C]// IEEE International Conference on Acoustics, Speech and Signal Processing. New York: IEEE, 2007, 4:85 - 88.
  • 10YAO J C, WAN W W, YU X Q, et al. A quick specific audio retrieval algorithm based on general prediction[ C]// IEEE 2008 International Conference on Audio, Language and Image Processing. New York: IEEE, 2008, 1180 - 1184.

二级参考文献34

  • 1吕成国,韩纪庆,王承发.动态时间规正与差别子空间相结合的变异语音识别方法[J].声学学报,2005,30(3):229-234. 被引量:2
  • 2王守觉,潘晓霞,徐春燕,陈旭,安冬,曹文明.一种基于高维空间覆盖动态搜索方法的非特定人连续数字语音识别的研究[J].电子学报,2005,33(10):1790-1793. 被引量:7
  • 3Wang Y, Liu Z, Huang JC. Multimedia content analysis-using both audio and visual clues. IEEE Signal Processing Magazine, 2000, 17(6): 12-36
  • 4Foote J. An overview of audio information retrieval. Multimedia Systems, 1999, 7(1):2-10
  • 5Hansen JHL, Huang R, Zhou B, et al. Speechfind.. Advances in spoken document retrieval for a national gallery of the spoken word. IEEE Transactions on Speech and Audio Processing, 2005, 13(5): 712-730
  • 6Kashino K, KurozumiT, Murase H. A quick search method for audio and video signals based on histogram pruning. IEEE Transactions on Multimedia, 2003, 5(3) : 348-357
  • 7Kim KM, Kim SY, Jeon JK, et al. Quick audio retrieval using multiple feature vectors. IEEE Transactions on Consumer Electronics, 2006, 52(1): 200-205
  • 8Zhang WQ, Liu J. Two-stage method for specific audio retrieval. IEEE International Conference on Acoustics, Speech, and Signa Processing(ICASSP), Hawaii, 2007. New Jersey: IEEE Press 2007, Ⅳ 85-88
  • 9Wang SJ, Liu YY. An algorithm for removing facial makeup disturbances based on high dimensional imaginal geometry. Chinese Journal of Electronics, 2006, 15(4A): 789-792
  • 10Haykin S著,宋铁成,等译.通信系统.北京:电子工业出版社,2003,56-58

共引文献13

同被引文献36

  • 1史泽林,康娇,孙锐.基于BP神经网络的大视场成像畸变校正方法[J].光学精密工程,2005,13(3):348-353. 被引量:24
  • 2郑贵滨,韩纪庆,李海峰,郑铁然.基于分段的实时声频检索方法[J].声学学报,2006,31(2):101-108. 被引量:5
  • 3Wang Yao, Liu Zhu, Huang Jincheng. Multimedia Content Analysis Using Both Audio and Visual Clues[J]. IEEE Signal Processing Magazine, 2000, 17(6): 12-36.
  • 4Foote J. An Overview of Audio Information Retrieval[J]. Multi- media Systems, 1999, 7(1): 2-10.
  • 5Kashino K, Kurozumi T, Murase H. A Quick Search Method for Audio and Video Signals Based on Histogram Pruning[J]. IEEE Trans. on Multimedia, 2003, 5(3): 348-357.
  • 6Zhang Weiqiang, Liu Jia. Two-stage Method for Specific Audio Retrieval[C] //Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing. [S. l.] : IEEE Press, 2007: 85-88.
  • 7Kim K M, Kim S Y, Jeon J K, et al. Quick Audio Retrieval Using Multiple Feature Vectors[J]. IEEE Trans. on Consumer Electronics, 2006, 52(1): 200-205.
  • 8Cheng Deyuan, Gersho A, Ramamurthi B, et al. Fast Search Algorithm for Vector Quantization and Pattern Matching[C] //Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing. [S. l.] : IEEE Press, 1984: 372-375.
  • 9Huang X, Alex A, Hon H W. Spoken Language Processing[M]. New Jersey, USA: Prentice Hall, 2001.
  • 10Yao Jincao, Wan Wanggen, Yu Xiaoqing, et al. A Quick Specific Audio Retrieval Algorithm Based on General Prediction[C] //Proc. of IEEE International Conference on Audio, Language and Image Processing. New York, USA: IEEE Press, 2008: 1180-1184.

引证文献4

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部