Image label transfer: Short video labelling by using frame auto-encoder

Image label transfer: Short video labelling by using frame auto-encoder

导出

摘要 Short videos on the Internet have a huge amount, but most of them are unlabeled. In this paper, a rough short video labelling method based on the image classification neural network is proposed. Convolutional auto-encoder is applied to train and learn unlabeled video frames, in order to obtain feature in the specific level. With these features, the video key-frames are extracted by the feature clustering method. These key-frames which represent the video content are put into an image classification network, so that the labels of every video clip can be got. In addition, the different architectures of convolutional auto-encoder are estimated, and a better performance architecture through the experiment result is selected. In the final experiment, the video frame features from the convolutional auto-encoder are compared with those from other extraction methods, where it illustrates remarkable results by the proposed method. Short videos on the Internet have a huge amount, but most of them are unlabeled. In this paper, a rough short video labelling method based on the image classification neural network is proposed. Convolutional auto-encoder is applied to train and learn unlabeled video frames, in order to obtain feature in the specific level. With these features, the video key-frames are extracted by the feature clustering method. These key-frames which represent the video content are put into an image classification network, so that the labels of every video clip can be got. In addition, the different architectures of convolutional auto-encoder are estimated, and a better performance architecture through the experiment result is selected. In the final experiment, the video frame features from the convolutional auto-encoder are compared with those from other extraction methods, where it illustrates remarkable results by the proposed method.

作者 Lü Chaohui Huang Yiyang

机构地区 School of Information and Communication Engineering Beijing Key Laboratory of Modern Entertainment Technology Key Laboratory of Acoustic Visual Technology and Intelligent Control System

出处《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2020年第1期92-99,共8页 中国邮电高校学报（英文版）

基金 supported by the National Key R&D Program of China (2018YFB1404100) the Fundamental Research Funds for the Central Universities (CUC18A002-2).

关键词 IMAGE feature VIDEO labelling convolutional neural network auto-encoder cluster key-frame image feature video labelling convolutional neural network auto-encoder cluster key-frame

分类号 TP391.41 [自动化与计算机技术—计算机应用技术] TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1Hamid Hassanpour,Mehdi Sedighi,Ali Reza Manashty.Video Frame’s Background Modeling: Reviewing the Techniques[J].Journal of Signal and Information Processing,2011,2(2):72-78. 被引量：4
2Ali Aghagolzadeh,Saeed Meshgini,Mehdi Nooshyar,Mehdi Aghagolzadeh.Very Low Bit-Rate Video Coding by Combining H.264/AVC Standard and 2-D Discrete Wavelet Transform[J].Wireless Sensor Network,2010,2(4):328-336.
3JI Yufeng,LI Weixing,FENG Kai,XING Boyang,PAN Feng.Automatic video mosaicking algorithm via dynamic key-frame[J].Journal of Systems Engineering and Electronics,2020,31(2):272-278. 被引量：2
4Dengyin Zhang,Yangyang Xu,Chunling Cheng.A QoE Assessment System in Distance Education[J].Engineering（科研）,2011,3(1):90-96.
5Weiwei Wang,Yuesheng Zhu.A Fast Depth-Map Generation Algorithm based on Motion Search from 2D Video Contents[J].Journal of Software Engineering and Applications,2012,5(12):144-148. 被引量：1
6Carlos Alexandre Gouvea da Silva,Guilherme Fernandes de Souza Miguel,Joao Guilherme Sauer,Carlos Marcelo Pedroso.Evaluation of Impairment Caused by MPEG Video Frame Loss[J].Engineering（科研）,2017,9(5):493-503.
7Shi Zhiming,Huang Chengti.Quality of experience models for network video quality[J].The Journal of China Universities of Posts and Telecommunications,2019,26(4):80-88.
8Chiman Kwan,Bryan Chou,Jonathan Yang,Trac Tran.Deep Learning Based Target Tracking and Classification for Infrared Videos Using Compressive Measurements[J].Journal of Signal and Information Processing,2019,10(4):167-199. 被引量：2
9Chiman Kwan,Jude Larkin.Perceptually Lossless Compression for Mastcam Multispectral Images: A Comparative Study[J].Journal of Signal and Information Processing,2019,10(4):139-166.
10Hong Zhao,Tao Wang,Xiangyan Zeng.A Clustering Algorithm for Key Frame Extraction Based on Density Peak[J].Journal of Computer and Communications,2018,6(12):118-128.

The Journal of China Universities of Posts and Telecommunications

2020年第1期

浏览历史

内容加载中请稍等...

Image label transfer: Short video labelling by using frame auto-encoder

相关作者

相关机构

相关主题

浏览历史