一种新的视频摘要可视化算法被引量：2

A Novel Approach for Abstractive Video Visualization

下载PDF

导出

摘要提出一种摘要式浏览视频文件的可视化方法,将顺序视频转化成图像形式摘要,能够帮助读者快速有效地获得视频数据的结构信息.算法通过检测视频中每帧的尺度不变特征(SIFT),应用改进的词袋模型构建特征词库并统计词频,将整段视频映射为高维词库空间的一条曲线.通过多维尺度分析(MDS)方法对该曲线降维,生成反映视频语义信息的一条三维平滑曲线.实验结果表明,该曲线很好地体现视频中各帧之间的关联性和语义转折,可辅助读者快速理解视频情节结构. We present a novel approach for abstractive video visualization, which can help users understand the semantic information from the video in a fast and effective manner. We use the scale- invariant feature transform （SIFT） algorithm to detect features of each frame, together with a modified bag of words algorithm to construct a feature vocabulary in order to compute the feature frequencies. By mapping the video sequence onto a 3D curve in a high dimensional vocabulary space with the use of the multi-dimensional scaling （MDS） algorithm, the video is abstracted and embedded into a visually recognizable curve in 3D space. This generated visualization result can vividly illustrate the evolvement of the video contents, while well protecting and preserving the semantic meaning that are encoded within the video. Experimental results indicate that this curve-based visualization technique can uncover the semantic relationship between the frames, characterize the transition of video contents, and help the users understand the semantic structure of the underlying video sequence with a quick glance.

作者彭帝超刘琳陈广宇陈海东左伍衡陈为

机构地区 CAD&CG国家重点实验(浙江大学)室邵逸夫医院影像科浙江工业大学之江学院信息工程分院

出处《计算机研究与发展》 EI CSCD 北大核心 2013年第2期371-378,共8页 Journal of Computer Research and Development

基金国家"八六三"高技术研究发展计划基金项目(2012AA120903) 国家自然科学基金项目(61003193) 浙江省科技厅公益基金项目(2011C21058)

关键词可视化视频摘要词袋低维嵌入 visualization video abstract bag of word low-dimensional embedding

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献16

1Karam G. Visualization using timelines[A].New York:ACM,1994.125-137.
2Pless R. Image spaces and video trajectories:Using Isomap to explore video sequences[A].Piscataway,NJ:IEEE,2003.1433-1440.
3Tenenbaum J,Silva V,Langford J. A global geometric framework for nonlinear dimensionality reduction[J].Science,2000,(5500):2319-2323.
4Cheung V,Frey B J,Jojic N. Video epitomes[A].Piscataway,NJ:IEEE,2005.42-49.
5Cheung V,Frey B J,Jojic N. Video epitomes[J].International Journal of Computer Vision,2008,(02):141-152.
6Sonka M,Hlavac V,Boyle R. Image Processing Analysis,and Machine Vision[M].Pacific Grove,MA:PWS Publishing,1999.
7Romero M,Summet J,Stasko J. Viz-A-vis:Toward visualizing video through computer vision[J].IEEE Transactions on Visualization and Computer Graphics,2008,(06):1261-1268.
8Daniel G,Chen Min. Video visualization[A].Piscataway,NJ:IEEE,2003.409-416.
9Mao Yi,Dillon J,Lebanon G. Sequential document visualization[J].IEEE Transactions on Visualization and Computer Graphics,2007,(06):1208-1215.
10Lebanon G,Mao Yi,Dillon J. The locally weighted bag of words framework for document representation[J].Journal of Machine Learning Research,2007,(10):2405-2441.

同被引文献36

1鲁敏,郁文贤,鲍虎军,匡纲要.基于机电跟踪的三维虚拟演播室系统[J].电子学报,2003,31(z1):2035-2039. 被引量：3
2程文刚,须德,蒋轶玮,即丛妍.一种新的动态视频摘要生成方法[J].电子学报,2005,33(8):1461-1466. 被引量：6
3Pritch Y,Rav-Acha A,Peleg S.Nonchronological video synopsis and indexing[J]. IEEE Trans,2008,PAMI-30(11):1971-1984.
4Y N,C X,H S,et al.Compact video synopsis via global spatiotemporal optimization[J]. IEEE Trans,2013,TVCG-19(10):1664-1676.
5SIMAKOV,D,CASPI,Y,SHECHTMAN,E,AND IRANI,M.Summarizing visual data using bidirectional similarity. Computer Vision and Pattern Recognition[C]. Alaska,USA:IEEE.2008.1-8.
6Nie Y,Sun H,Li P,et al.Object movements synopsis via part assembling and stitching[J]. IEEE Trans,2014,TVCG-20(9):1303-1315.
7Germann M,Popa T,Keiser R,et al.Novel-view synthesis of outdoor sport events using an adaptive view-dependent geometry[J]. Computer Graphics Forum,2012,31(21):325-333.
8Saxena A,Sun M,Ng A Y.Make3D:learning 3D scene structure from a single still image[J]. IEEE Trans,2009,PAMI-31(5):824-840.
9HARTLEY,R,ZISSERMAN,A.Multiple view geometry in computer vision[M]. Cambridge,UK:Cambridge University Press.2003:23-64.
10Maki A,Watanabe M,Wiles C.Geotensity:Combining motion and lighting for 3d surface reconstruction[J]. International Journal of Computer Vision,2002,48(2):75-90.

引证文献2

1徐超,聂勇伟,葛红美,周国富.基于新视角合成的视频摘要交互式浏览[J].电子学报,2015,43(11):2263-2270. 被引量：4
2侯丽微,胡珀,曹雯琳.主题关键词信息融合的中文生成式自动摘要研究[J].自动化学报,2019,45(3):530-539. 被引量：27

二级引证文献31

1冀中,樊帅飞.基于超图排序算法的视频摘要[J].电子学报,2017,45(5):1035-1043. 被引量：5
2ZHANG Lijun,LI Yue,ZHU Qiuyu,LI Mingqi.Generating Virtual Images for Multi-view Video[J].Chinese Journal of Electronics,2017,26(4):810-813.
3石磊,阮选敏,魏瑞斌,成颖.基于序列到序列模型的生成式文本摘要研究综述[J].情报学报,2019,38(10):1102-1116. 被引量：12
4张兴旺,黄晓斌,郑聪.基于视觉摘要的古代南海海图可视化人机交互模型研究——以《郑和航海图》为例[J].情报理论与实践,2019,42(10):71-76. 被引量：3
5陶兴,张向先,郭顺利,张莉曼.学术问答社区用户生成内容的W2V-MMR自动摘要方法研究[J].数据分析与知识发现,2020,4(4):109-118. 被引量：8
6叶俊民,罗达雄,陈曙.基于短文本情感增强的在线学习者成绩预测方法[J].自动化学报,2020,46(9):1927-1940. 被引量：14
7吕瑞,王涛,曾碧卿,刘相湖.TSPT:基于预训练的三阶段复合式文本摘要模型[J].计算机应用研究,2020,37(10):2917-2921. 被引量：3
8柴悦,赵彤洲,江逸琪,高佩东.基于Att-iBi-LSTM的新闻主题词提取方法研究[J].武汉工程大学学报,2020,42(5):575-580.
9谢谦,董立红,厍向阳.基于Attention-GRU的短期电价预测[J].电力系统保护与控制,2020,48(23):154-160. 被引量：43
10宁珊,严馨,徐广义,周枫,张磊.融合关键词的中文新闻文本摘要生成[J].计算机工程与科学,2020,42(12):2265-2272. 被引量：4

1姚勇.针对地形的D3D9基本实现原理[J].程序员（游戏创造）,2007(9):50-57.
2梁敏,王兆仲.摄像机平行于场景运动且晃动下的视频修复[J].计算机工程与应用,2011,47(4):176-180. 被引量：2
3刘丹.利用java语言对三次样条曲线的实现[J].赤峰学院学报（自然科学版）,2014,30(4):8-9.
4刘婷婷,闫德勤,郑宏亮.NMF和Isomap相结合的图像检索新方法[J].计算机应用研究,2011,28(6):2372-2374. 被引量：1
5张维维,王唯玮.基于决策树的入侵数据特征检测模型[J].信息技术,2009(10):107-109.
6邱立达,刘天键,林南,黄章超.基于深度学习模型的无线传感器网络数据融合算法[J].传感技术学报,2014,27(12):1704-1709. 被引量：21
7刘乐.普通路由器端口映射及视频映射的一种方法[J].计算机光盘软件与应用,2014,17(16):284-285.
8App播报[J].新电脑,2016,0(8):49-49.
9徐勤军,吴镇扬.视频序列中的行为识别研究进展[J].电子测量与仪器学报,2014,28(4):343-351. 被引量：20
10王朝晖,郑新奇.基于共词分析的智慧城市研究现状与展望[J].地域研究与开发,2014,33(4):59-63. 被引量：8

计算机研究与发展

2013年第2期

浏览历史

内容加载中请稍等...

一种新的视频摘要可视化算法被引量：2

参考文献16

同被引文献36

引证文献2

二级引证文献31

相关作者

相关机构

相关主题

浏览历史

一种新的视频摘要可视化算法 被引量：2

参考文献16

同被引文献36

引证文献2

二级引证文献31

相关作者

相关机构

相关主题

浏览历史

一种新的视频摘要可视化算法被引量：2