期刊文献+

基于多模态融合和竞争力的视频场景分割算法 被引量:1

Algorithm of Video Scene Segmentation Based on Multimodal Feature Fusion and Competition
下载PDF
导出
摘要 针对视频分割中底层特征与高层语义之间的"语义鸿沟"问题,提出了一种基于多模态融合和镜头间竞争力的场景分割算法,对视频帧的图像、文本、音频等模态进行特征提取,用欧式距离、余弦距离计算出同种模态数据的相似性,用典型相关分析法计算出不同模态数据的相关度,分别对各模态数据的相似性和相关度进行融合得到镜头之间的相似度和相关度,采用镜头间竞争力的方法分别对相似镜头和相关镜头进行场景分割并对分割出的两个场景边界集合取交集得到最终的场景边界,从而实现对视频的场景分割。实验结果表明,该方法在场景分割中具有较高的性能,查全率和查准率分别达到82.1%和86.7%。 To solve the problem of"semantic gap"between low-level features and high-level semantic in video scene seg-mentation, an algorithm of video scene segmentation was put forward based on multimodal feature fusion and competition.The im-age, text and audio features were abstracted as the low-level features of the video frame.Euclidean distance, cosine similarity distance were used to calculate the similarity of homogeneous data, and the method of canonical correlation analysis was used to calculate the heterogeneous data correlation, respectively.The shot similarity and shot relevance were obtained by similarity fu-sion and correlation fusion.Then a competition analysis of splitting and merging forces for scene segmentation was adopted.The final scene was obtained by take the intersection of two segmented scenarios border sets.Thus the video scene segmentation was realized.The results of experiments show that the video scene can be effectively separated by the proposed method, and the recall ratio, precision reached 82.1%and 86.7%respectively.
出处 《武汉理工大学学报(信息与管理工程版)》 CAS 2014年第6期759-763,共5页 Journal of Wuhan University of Technology:Information & Management Engineering
基金 湖北省自然科学基金资助项目(2009Chb008 2010CDB06603) 湖北省教育厅重点科研基金资助项目(D20101703)
关键词 竞争力 多模态融合 相似性度量 典型相关性 场景分割 competition multi-modality similarity measurement canonical correlation scene segmentation
  • 相关文献

参考文献4

二级参考文献46

共引文献44

同被引文献11

  • 1BABER J, AFZULPURKAR N, SATOH S. A frame- work for video segmentation using global and local fea- tures[ J ]. International Journal of Pattern Recognition and Artificial Intelligence, 2013, 27 (5) : 13550071 - 135500729.
  • 2YANG H, YI J, ZHAO J, et al. Extreme learning ma- chine based genetic algorithm and its application in power system economic dispatch [ J ]. Neurocomput- ing, 2013,102 ( 15 ) : 154 - 162.
  • 3NGOC T A, HIRAMATSU K, HARADA M. Optimizing the rule curves of multi - use reservoir operation using a genetic algorithm with a penalty strategy [ J ]. Paddy and Water Environment, 2014,12( 1 ) : 125 - 137.
  • 4GUPTA N, SHEKHAR R, KALRA P K. Congestion management based roulette wheel simulation for opti- mal capacity selection: probabilistie transmission ex- pansion planning[ J ]. International Journal of Electri- cal Power and Energy Systems, 2012,43 ( 1 ) : 1259 - 1266.
  • 5HWANG S F, HSU Y C, CHEN Y. A genetic algo- rithm for the optimization of fiber angles in composite laminates [ J ]. Journal of Mechanical Science and Technology, 2014,28 ( 8 ) : 3163 - 3169.
  • 6张鸿,吴飞,庄越挺,陈建勋.一种基于内容相关性的跨媒体检索方法[J].计算机学报,2008,31(5):820-826. 被引量:34
  • 7印勇,王旭军.基于主色跟踪和质心运动的视频场景分割[J].计算机应用研究,2010,27(4):1563-1565. 被引量:1
  • 8华漫.基于语义的体育视频场景分割方法[J].计算机工程,2010,36(15):206-207. 被引量:2
  • 9郭小川,刘明杰,王婧璐,董道国,万乾荣.基于频繁镜头集合的视频场景分割方法[J].计算机应用与软件,2011,28(6):116-120. 被引量:1
  • 10刘嘉琦,封化民,闫建鹏.基于多模态特征融合的新闻故事单元分割[J].计算机工程,2012,38(24):161-165. 被引量:8

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部