基于生成对抗网络的图像视频编码综述

Review on image and video coding via generative adversarial networks

下载PDF

导出

摘要图像视频编码是多媒体信号处理中重点研究的问题之一,旨在高效、紧凑地表达数据,同时最大程度降低编码失真,节省传输与存储成本。经典的图像视频编码技术自上世纪七十年代起形成基于块的“预测-变换-熵编码”的混合编码框架,每一步均需要人工设计算法分别进行优化,实现像素级别的保真,然而其在低码率下由于量化丢失掉大量高频信息,会产生模糊、块效应等令人无法接受的压缩失真。近年来,基于生成对抗网络的图像视频编码的研究取得了较大的进展。相比经典方法,生成对抗网络在低码率下能够较好地弥补高频纹理细节。本文系统地梳理了基于生成对抗网络的图像视频编码的技术和进展,分别从基于全神经网络的端到端编码、生成对抗网络、基于生成对抗网络的图像视频编码三个方面进行了综述介绍,同时对基于生成对抗网络的图像视频编码的未来发展趋势进行了分析与展望。 Image and video coding is a primary research field in multimedia signal processing,whose objective is to efficiently and compactly represent data while reducing coding distortion and reducing transmission and storage costs.Traditional image video coding technology has developed a block-based hybrid"prediction-transform-entropy"coding framework which optimizes each step separately to achieve pixel-level fidelity.Quantization,however,loses a significant amount of high-frequency information at low bit rates,resulting in blurring,block effects,and other unacceptable compression distortions.A significant amount of progress has been made in recent years in the study of generative adversarial networks(GANs)for video and image coding.Compared with classical methods,GANs are able to compensate for high-frequency texture details at low bit rates.In this paper,the authors review the progress made in end-to-end coding using neural networks and GANs,and the techniques and progress associated with image video coding using GANs.Future growth trends are also assessed and forecasted.

作者王崇宇毛琪金立标 WANG Chongyu;MAO Qi;JIN Libiao(State Key Laboratory of Media Convergence and Communication,Communication University of China,Beijing 100024,China)

机构地区中国传媒大学媒体融合与传播国家重点实验室

出处《中国传媒大学学报（自然科学版）》 2022年第6期19-28,共10页 Journal of Communication University of China：Science and Technology

基金中国传媒大学国家重点实验室专项项目(CUC22GZ035) 国家自然科学青年基金项目(62201522)。

关键词生成对抗网络图像视频编码神经网络 generative adversarial network image and video coding neural networks

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1张珍.电力巡检图像视频编码研究[J].科技创新与应用,2022,12(6):79-81.
2赵利军,曹聪颖,张晋京,白慧慧,赵耀,王安红.联合边路和中路解码特征学习的多描述编码图像增强方法[J].计算机应用研究,2022,39(9):2873-2880.

中国传媒大学学报（自然科学版）

2022年第6期

基于生成对抗网络的图像视频编码综述

相关作者

相关机构

相关主题

基于生成对抗网络的图像视频编码综述

相关作者

相关机构

相关主题

微信扫一扫：分享