
基于草图交互的视频摘要方法及认知分析 被引量:2

Video sketch summarization, interaction and cognition analysis
摘要 视频是一类重要的视觉媒体,也是人们进行信息交流的重要载体.面向视频数据的高效内容表达,以及自然便捷的用户浏览和搜索等交互操作,本文提出了一种面向视频内容的草图摘要及交互方法.首先提出了面向视频语义的草图化表征方式,利用草图抽象性和概括性等特点,提出语义草图概念,支持对视频内容的语义草图注释,同时提出了草图摘要的优化布局算法.在此基础上,提出了基于草图摘要的草图交互技术,以及支持交互式视频浏览的自然手势操作,并进一步从认知心理学的角度分析了用于视频摘要的草图表征,以及草图交互中各认知单元的作用及相互关系.最后用户评估的实验结果表明,本文所提出的草图摘要以及草图交互方式提高了获取视频主要内容方面的用户效率,减轻了用户的认知负荷. Video, as one typical digital media, is important for message communication. For efficient video content visualization and natural interaction such as video browsing and searching, we propose a sketch-based video summarization with fluent sketch interaction in this paper. Firstly, we present the sketch representation for video semantics, which takes the advantages of abstractness and generality of sketches. The concept of semantic sketch is proposed, which supports annotating video contents with sketches. Furthermore, an optimized layout algorithm for sketch summarization is presented. Secondly, we present the interaction techniques for sketch summarization and natural sketch gesture operations. From the viewpoint of cognitive psychology, we analyze the sketch representation, as well as the effects and relations of cognitive units in sketch interaction. Finally, user studies show that the proposed sketch summarization and sketch interaction improve user efficiency in terms of acquiring the main video content and reduce users’ cognitive load.
出处 《中国科学:信息科学》 CSCD 2013年第8期1012-1023,共12页 Scientia Sinica(Informationis)
基金 国家重点基础研究发展计划(批准号:2011CB302205) 国家自然科学基金(批准号:61232013 61173058 61272228) 国家高技术研究发展计划(批准号:2012AA02A608 2012AA011801) 新世纪优秀人才支持计划(批准号:NCET-11-0273) 清华信息科学与技术国家实验室(筹)资助
关键词 视频摘要 草图布局 手势交互 认知分析 video summarization, sketch layout, gesture operations, cognitive analysis
  • 相关文献


  • 1Borgo R, Chen M, Daubney B, et al. State of the art report on video-based graphics and video visualization. Comput Graph Forum, 2012, 31:2450-2477.
  • 2Ma C X, Liu Y J, Wang H A, et al. Sketch-based annotation and visualization in video authoring. IEEE Trans Multimedia, 2012, 14:1153-1165.
  • 3Winnemoller H, Olsen S C, Gooch B. Real-time video abstraction. ACM Trans Graph (Proc SIGGRAPH'06), 2006, 25:1221-1226.
  • 4Cernekova Z, Pitas I, Nikou C. Information theory-based shot cut/fade detection and video summarization. IEEE Trans Circuits Syst Video Technol, 2006, 16:82-91.
  • 5Ma C X, Liu Y J, Yang H Y, et al. KnitSketch: a sketch pad for conceptual design of 2D garment patterns. IEEE Trans Autom Sci Eng, 2011, 8:431-437.
  • 6Liu Y J, Ma C X, Zhang D L. EasyToy: plush toy design using editable sketching curves. IEEE Comput Graph, 2011, 31:49 57.
  • 7Liu Y J, Luo X, Joneja A, et al. User-adaptive sketch-based 3D CAD model retrieval. IEEE Trans Autom Sci Eng, 2013, DOI: 10.1109/TASE. 2012. 2228481.
  • 8Liu Y J, Lai K L, Dai G, et al. A semantic feature model in concurrent engineering. IEEE Trans Autom Sci Eng, 2010, 7:659-665.
  • 9Goldman D R. A framework for video annotation, visualization, and interaction. Ph.D. Thesis. University of Washington, 2007.
  • 10Guo W J, Zhang Y Q, Ma C X, et al. Improving linear drawing concerning stylized sketch. Signal Process, 2012, 28: 1-4.


  • 1王方石,须德,吴伟鑫.基于自适应阈值的自动提取关键帧的聚类算法[J].计算机研究与发展,2005,42(10):1752-1757. 被引量:33
  • 2欧维,刘荣,蒋红梅.智能视频监控技术在电视监控系统中的应用[J].智能建筑电气技术,2007,1(5):3-5. 被引量:9
  • 3Dimitrova N, Zhang H J, Shahraray B. Applications of video-content analysis and retrieval [J]. IEEE Multimedia, 2002, 9(3):42-55.
  • 4Smoliar S W, Zhang H J. Content-based video indexing and retrieval [J]. IEEE Multimedia, 1994, 1(2) : 62-72.
  • 5Zhang Hongjiang, Wu Jianhua, Zhong Di, et al. An integrated system for content-based video retrieval and browsing [J]. Pattern Recognition, 1997, 30(4): 643-658.
  • 6Naveed E, Irfan M, Sung W. Efficient visual attention based framework for extracting key frames from videos[J].Signal Processing: Image Communication, 2013, 28(1): 34-44.
  • 7Zhang Xudong, Liu Tieyan, et al. Dynamic selection and effective compression of key frames for video abstraction [J]. Pattern Recognition Letters, 2002, 24(9/10): 1523-1532.
  • 8Kin W S; Kin M L, Guoping Q. A new key frame representation for video segment retrieval [J]. IEEE Trans on Circuits and Systems for Video Technology, 2005, 15 (9): 1148-1155.
  • 9Ling Laijie, Yang Yi. Key frame extraction based on visual attention model [J]. Journal of Visual Communication and Image Representation, 2012, 23(1): 114-125.
  • 10Jiang Peng, Qin Xianlin. Key frame based video summary using visual attention clues [J].IEEE Trans on Multimedia, 2010, 17(2): 64-73.










使用帮助 返回顶部