Video summarization with a graph convolutional attention network 被引量：2

导出

摘要 Video summarization has established itself as a fundamental technique for generating compact and concise video, which alleviates managing and browsing large-scale video data. Existing methods fail to fully consider the local and global relations among frames of video, leading to a deteriorated summarization performance. To address the above problem, we propose a graph convolutional attention network(GCAN) for video summarization. GCAN consists of two parts, embedding learning and context fusion, where embedding learning includes the temporal branch and graph branch. In particular, GCAN uses dilated temporal convolution to model local cues and temporal self-attention to exploit global cues for video frames. It learns graph embedding via a multi-layer graph convolutional network to reveal the intrinsic structure of frame samples. The context fusion part combines the output streams from the temporal branch and graph branch to create the context-aware representation of frames, on which the importance scores are evaluated for selecting representative frames to generate video summary. Experiments are carried out on two benchmark databases, Sum Me and TVSum, showing that the proposed GCAN approach enjoys superior performance compared to several state-of-the-art alternatives in three evaluation settings.

作者 Ping LI Chao TANG Xianghua XU

机构地区 School of Computer Science and Technology The State Key Laboratory for Novel Software Technology

出处《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2021年第6期902-913,共12页 信息与电子工程前沿（英文版）

基金 Project supported by the National Natural Science Foundation of China (Nos. 61872122 and 61502131) the Zhejiang Provincial Natural Science Foundation of China (No. LY18F020015) the Open Pro ject Program of the State Key Lab of CAD&CG,China (No. 1802) the Zhejiang Provincial Key Research and Development Program,China (No. 2020C01067)。

关键词 Temporal learning Self-attention mechanism Graph convolutional network Context fusion Video summarization

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1Jie-hao HUANG,Xiao-guang DI,Jun-de WU,Ai-yue CHEN.A novel convolutional neural network method for crowd counting[J].Frontiers of Information Technology & Electronic Engineering,2020,21(8):1150-1160. 被引量：3

共引文献2

1Xinyu TONG,Ziao YU,Xiaohua TIAN,Houdong GE,Xinbing WANG.Improving accuracy of automatic optical inspection with machine learning[J].Frontiers of Computer Science,2022,16(1):45-56.
2朱学岩,张新伟,才嘉伟,郑一力,顾梦梦,陈锋军.基于无人机图像和贝叶斯CSRNet模型的粘连云杉计数[J].农业工程学报,2022,38(14):43-50. 被引量：4

同被引文献13

1黄庆明,郑轶佳,蒋树强,高文.基于用户关注空间与注意力分析的视频精彩摘要与排序[J].计算机学报,2008,31(9):1612-1621. 被引量：4
2陈芬,赖茂生.自适应关键帧提取技术研究[J].情报科学,2014,32(11):139-141. 被引量：3
3彭宇新,綦金玮,黄鑫.多媒体内容理解的研究现状与展望[J].计算机研究与发展,2019,56(1):183-208. 被引量：32
4刘玉杰,唐顺静,高永标,李宗民,李华.基于标签分布学习的视频摘要算法[J].计算机辅助设计与图形学学报,2019,31(1):104-110. 被引量：8
5江泽涛,秦嘉奇,胡硕.基于多路卷积神经网络的多光谱场景识别方法[J].计算机科学,2019,46(9):265-270. 被引量：6
6刘宇琦,赵宏伟,王玉.一种基于QPSO优化的流形学习的视频人脸识别算法[J].自动化学报,2020,46(2):256-263. 被引量：15
7苏筱涵.深度学习视角下视频关键帧提取与视频检索研究[J].网络安全技术与应用,2020(5):65-66. 被引量：6
8陈科圻,朱志亮,邓小明,马翠霞,王宏安.多尺度目标检测的深度学习研究综述[J].软件学报,2021,32(4):1201-1227. 被引量：97
9赖华,高玉梦,黄于欣,余正涛,张勇丙.基于多粒度特征的文本生成评价方法[J].中文信息学报,2022,36(3):45-53. 被引量：3
10Ali JAVED,Amen ALI KHAN.Shot classification and replay detection for sports video summarization[J].Frontiers of Information Technology & Electronic Engineering,2022,23(5):790-800. 被引量：1

引证文献2

1王惠峰,张峰,张昆,王子玮,白立飞,葛建军,张德.基于内容的视频高性能处理框架设计[J].指挥信息系统与技术,2022,13(2):85-90. 被引量：1
2Yuxin HUANG,Huailing GU,Zhengtao YU,Yumeng GAO,Tong PAN,Jialong XU.Enhancing low-resource cross-lingual summarization from noisy data with fine-grained reinforcement learning[J].Frontiers of Information Technology & Electronic Engineering,2024,25(1):121-134.

二级引证文献1

1孙斐然.基于内容检索的视频处理技术相关探讨[J].数字技术与应用,2023,41(3):78-80.

1Praneeth Sadda,John Onofrey,Metehan Imamoglu,Xenophon Papademetris,Bilal Qarni,Mert Ozan Bahtiyar.Real-time computerized video enhancement for minimally invasive fetoscopic surgery[J].Laparoscopic, Endoscopic and Robotic Surgery,2018,1(2):27-32. 被引量：1
2Zhen Ye,Shihao Shi,Zhan Cao,Lin Bai,Cuiling Li,Tao Sun,Yongqiang Xi.Graph-Based Dimensionality Reduction for Hyperspectral Imagery: A Review[J].Journal of Beijing Institute of Technology,2021,30(2):91-112. 被引量：1
3Ling WANG,Xiuqing HU,Na XU,Lin CHEN.Water Vapor Retrievals from Near-infrared Channels of the Advanced Medium Resolution Spectral Imager Instrument onboard the Fengyun-3D Satellite[J].Advances in Atmospheric Sciences,2021,38(8):1351-1366. 被引量：1
4Jhonathan Quillo-Espino,Rosa María Romero-González,Ana-Marcela Herrera-Navarro.A Deep Look into Extractive Text Summarization[J].Journal of Computer and Communications,2021,9(6):24-37.
5Issa Sow,Sacamba Aimé Omer Hema,Antoine Sanon,Issoufou Ouedraogo.Influence of Cotton Crop Types on the Variation of Phonoctonus lutescensPopulation Guérin Meneville and Percheron (Heteroptera: Reduvidae), a Predator of Dysdercus voëlkeri(Schmidt 1932) (Heteroptera: Pyrrochoridae) in Burkina Faso[J].Agricultural Sciences,2021,12(6):684-699.
6GAO Rui-yuan,WANG Chang-ming,LIANG Zhu.Comparison of different sampling strategies for debris flow susceptibility mapping: A case study using the centroids of the scarp area, flowing area and accumulation area of debris flow watersheds[J].Journal of Mountain Science,2021,18(6):1476-1488. 被引量：3
7Sara Jo Breslow,Margaret Allen,Danielle Holstein,Brit Sojka,Raz Barnea,Xavier Basurto,Courtney Carothers,Susan Charnley,arah Coulthard,Nives Dolsak,Jamie Donatuto,Carlos Garcia-Quijano,Christina C.Hicks,Arielle Levine,Michael B.Masci,Karma Norman,Melissa Poe,Terre Satterfield,Kevin St.Martin,Phillip S.Levin.Evaluating indicators of human well-being for ecosystem-based management[J].Ecosystem Health and Sustainability,2017,3(12):2-19.
8Feng-Jiau Lin,Hsiao-Yun Chang,Chun-Wen Tsao,Hung-Du Lin,Yih-Tsong Ueng.Population Structures and Diets of Two Species of Pisodonophis(Ophichthidae) from the Southwest Coast of Taiwan[J].Natural Resources,2021,12(6):197-204. 被引量：1
9Skylar Choi,Yongjin Park,Immanuel H. Anaborne,Jin Sik Song,Ji Woo Han,So Hyun Jeon,Jaewoo Kim,James Kim,Jinkwon Lee,Paul S. Chung.Improvement of Renewable Bioenergy Production in Microbial Fuel Cells with Saponin Supplementation[J].Journal of Sustainable Bioenergy Systems,2021,11(2):82-93.

Frontiers of Information Technology & Electronic Engineering

2021年第6期

浏览历史

内容加载中请稍等...