期刊文献+

基于音/视频特征的足球视频体育事件交互式检索方法

An Approach for Football Video Event Retrieval Based on Audio/Video
下载PDF
导出
摘要 本文提出了一种交互式足球体育视频事件检索方法。在该方法中,首先从音频和视频中提取四种类型的特征,计算出它们的均值和标准差,并把这八个数据编码成一个染色体,建立与视频文件的索引。然后,利用交互式遗传算法实现足球体育视频事件的检索。首先,系统从数据库中随机地选取N个视频文件供用户观看与选择;然后,系统根据用户所选视频提取相应的染色体,并对这些染色体进行重组操作得到目标染色体;其次,把目标染色体与数据库中的所有染色体进行比较,利用欧式距离计算出它们的相似度,从中选取N个最相似的染色体对应的视频为下一代视频;最后,不断迭代上面的过程,直到得到用户想要的视频。通过对包含有400个视频事件的数据库的实验,证明该方法能够有效地检索足球视频数据库中的视频文件,准确率达到89%。 This paper proposed an interactive genetic algorithm (IGA) for football video events retrieval with multimodal features. Eight audio-visual features,i, e. , average and standard deviation of shot duration, motion activity, sound energy, and speech rate, were extracted from each video;they were encoded as chromosomes and indexed into search table. First, the proposed algorithm randomly selected N videos from the initial population of videos database, and the user selected what he (she) wanted in mind. Next, the associated chromosomes of selected videos were regarded as target chromosomes after crossover, and chromosomes in the database videos were compared based on Euclidean distances to obtain the most similar videos as solutions of the next generation. By iterating this process, a new population of videos was retrieved. This approach of retrieval shows about 89% of precision on the average over 400 videos.
出处 《信号处理》 CSCD 北大核心 2009年第7期1070-1075,共6页 Journal of Signal Processing
基金 国家863基金资助项目(No.2007AA01Z432) 国家242信息安全计划资助课题成果(No.2006A07) 国家863基金资助项目(No.2007AA01Z433)
关键词 视频检索 音视频特征 相关反馈 交互式遗传算法 Videos indexing Audio-visual features Relevance feedback Interactive Genetic Algorithm (IGA)
  • 相关文献

参考文献17

  • 1史元春,徐光祐,高原,肖鑫.中国多媒体技术研究:2006[J].中国图象图形学报,2007,12(7):1129-1151. 被引量:6
  • 2Smeaton A F.Techniques used and open challenges to the analysis,indexing and retrieval of digital video[J].Information Systems,2007,32(4):545-559.
  • 3Michael S L,Nicu S,Chabane D.Content-based multimedia information retrieval:state of the art and challenges[J].ACM Transactions on Multimedia Computing,Communications and Applications,2006,2(1):1-19.
  • 4TRECVid video retrieval evaluation(last checked July 2006),(http://www-nlpir.nist.gov/projects/t01v/t01v.tml).
  • 5Smith J R,et al.IBM multimedia analysis and retrieval system.In:IBM Research White paper.
  • 6Xiong Z Y,Xiang S Z,Qi T et al.Semantic retrieval of video-review of research on video retrieval in meetings,movies and broadcast news,and sports[J].IEEE Signal Processing Magazine,2006,23(2):18-27.
  • 7Salton G,McGill,M.Introduction to modern information retrieval[M].McGraw-Hill,New York (1983).
  • 8Saha S K,Das A K,Chanda B.Image retrieval based on indexing and relevance feedback[J].Pattern Recognition Letters,2007,28(3):357-366.
  • 9Munesawang P,Guan L.Adaptive video indexing and automatic/semi-automatic relevance feedback[J].IEEE Transaction on Circuits and Systems for Video and Technology,2005,15(8):1032-1046.
  • 10Wu C J,Zeng H C,Huang S H.Learning-based interactive video retrieval system[C].In:IEEE International conference on Multimedia and Expo (ICME2006),Toronto,Canada,2006:1785-1788.

二级参考文献23

  • 1徐光祐,贺伟晟,史元春.中国多媒体技术研究:2003[J].中国图象图形学报(A辑),2004,9(12):1397-1413. 被引量:5
  • 2徐光祐,车轶,史元春.中国多媒体技术研究:2002[J].中国图象图形学报(A辑),2003,8(12):1361-1378. 被引量:4
  • 3徐光祐,贺伟晟,史元春.中国多媒体技术研究:2004[J].中国图象图形学报,2005,10(7):805-820. 被引量:9
  • 4徐光祐,史元春,肖鑫,贺伟晟.中国多媒体技术研究:2005[J].中国图象图形学报,2006,11(7):901-918. 被引量:5
  • 5Grieder W and Kinsner W. Speech segmentation by variance fractal dimension. 1994, Conference Proceedings. 1994 Canadian Conference on Electrical and Computer Engineering, Halifax, Canada, 25-28 Sept. 1994, vol.2:481-485.
  • 6Boshoff H F V. A fast box counting algorithm for determining the fractal dimension of sampled continuous functions. 1992.COMSIG '92, Proceedings of the 1992 South African Symposium on Communications and Signal Processing, New York, NY, USA, 11 Sept. 1992: 43-48.
  • 7Jia Chuan and Xu Bo. An improved entropy-based endpoint detection algorithm. International Symposium on Chinese Spoken Language Processing (ISCSLP 2002). Taipei, Taiwan.August 23-24, 2002: 479-583.
  • 8Lingyun Gu and Zahorian S. A new robust algorithm for isolated word endpoint detection. 2002. Proceedings.(ICASSP '02). IEEE International Conference on Acoustics,Speech, and Signal Processing, Orlando, Florida, USA, 13-17May 2002, vol.4, IV-4161.
  • 9Turiel A and Pérez-Vicente C. Role of multifractal sources in the analysis of stock market time series. Physica A: Statistical Mechanics and its Applications, 2005, 355(24): 475-496.
  • 10F Ferens K and Kinsner W. Multifractal texture classification of images. WESCANEX 95. Communications, Power, and Computing. Conference Proceedings, IEEE. Winnipeg,Manitoba, Canada. May 15-16, 1995, Vol.2: 438-444.

共引文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部