期刊文献+

基于多模态特征融合的新闻故事单元分割 被引量:8

News Story Unit Segmentation Based on Multi-modal Feature Fusion
下载PDF
导出
摘要 对新闻视频进行结构分析,提出一种基于多模态特征融合的新闻故事单元分割方法。将新闻视频分割成音频流和视频流,选择静音区间为音频候选点,将镜头边界切变点作为视频候选点,做主持人镜头和主题字幕的探测,挑选主持人镜头为候选区间,并记录主题字幕的起始位置和结束位置,利用时间轴融合音频候选点、视频候选点、主持人镜头和主题字幕,对新闻视频进行故事单元分割。实验结果表明,该方法的查全率为83.18%,查准率为83.92%。 News story unit segmentation method based on multi-modal feature fusion is proposed in this paper by analyzing news video structure.News video is divided into audio stream and video stream.Mute intervals are detected as audio candidate points,and the shot segmentations for news video are detected and shot boundary points are chosen as video candidate points,anchorperson shot and topic caption are detected.Story units are detected by fusing audio candidate points,video candidate points,anchorperson shot and topic caption based on time axis.Experimental results show that this method can get 83.18% in recall and 83.92% in precision.
出处 《计算机工程》 CAS CSCD 2012年第24期161-165,共5页 Computer Engineering
基金 国家自然科学基金资助项目(60972139) 北京市自然科学基金资助项目(4092041)
关键词 新闻视频 多模态特征 字幕 音频 故事单元分割 news video multi-modal feature caption audio story unit segmentation
  • 相关文献

参考文献9

二级参考文献64

  • 1严明,秦嘉杭.基于文本信息的数字视频检索研究[J].情报科学,2004,22(7):865-869. 被引量:10
  • 2苏新宁.视频信息索引技术研究进展[J].情报学报,2004,23(4):410-416. 被引量:7
  • 3严明,苏新宁.数字视频信息的元数据研究[J].情报学报,2004,23(5):605-610. 被引量:8
  • 4刘华咏.基于音视频特征和文字信息自动分段新闻故事[J].系统仿真学报,2004,16(11):2608-2610. 被引量:8
  • 5王策,何炎祥,王云,张春林.基于视音频特征和文本信息的新闻视频自动场景分割[J].计算机工程,2005,31(6):171-172. 被引量:1
  • 6D'ANNA L, PERCANNELLA G, SANSONE C, et al. A multi-stage approach for news video segmentation based on automatic anchorperson number detection [ C]// Proceedings of the 2007 International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies. Washington, D C: IEEE Computer Society, 2007: 229 - 234.
  • 7de SANTO M, PERCANNELLA G, SANSONE C, et al. Segmentation of news videos based on audio-video information[ J]. Pattern Analysis and Applications, 2007, 10(2): 135 -145.
  • 8XU XIN-WEN, LI GUO-HUI, YUAN JIAN. A segmentation method of news video stories based on announcer's voiceprint[ C]//Proceedings of the 7th International Conference on Machine Learning and Cybernetics. Kunming, China: IEEE, 2008:2749-2753.
  • 9卿来云,王伟强,高文.文字自动提取及其在视频索引和检索中的应用[C]//中科院第7届计算机研究生科技论坛,2002:1-9.
  • 10Tang X O,Gao X B,Liu J Z,et al.A spatial-temporal approach for video caption detection and recognition[J].IEEE Transactions on Neural Networks,2002,13(4) :961-971.

共引文献28

同被引文献57

引证文献8

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部