摘要
随着扩散模型的提出与迅速发展,依托其高度可解释的数学特性及高质量和多样性的结果,扩散模型逐渐打破对抗生成网络在图像生成和图像编辑领域的垄断地位,基于扩散模型的图像编辑逐渐成为计算机视觉领域的研究热点。本文首先介绍了图像编辑的任务定义和扩散模型的基本原理;其次重点分类依次介绍了基于扩散模型的图像编辑技术的发展历程;然后总结了图像编辑领域常用的评价指标和数据集,同时定性和定量比较了经典方法在不同数据集上的效果;最后对基于扩散模型的图像编辑现状进行总结和展望。
With the introduction and rapid development of diffusion models,these frameworks have begun to challenge the dominance of generative adversarial networks(GANs)in the realms of image generation and editing,thanks to their highly interpretable mathematical properties and the high quality and diversity of their outputs.Image editing based on diffusion models is emerging as a research hotspot in the field of computer vision.In this paper the task definition of image editing and the basic principles of diffusion models were first introduced.Then the developmental trajectory of image editing techniques based on diffusion models was categorized and detailed.Furthermore,common evaluation metrics and datasets used in the image editing domain were reviewed,and both qualitative and quantitative comparisons of classical methods across various datasets were provided.Finally,the current state and prospects of image editing based on diffusion models were summarized.
作者
毛琪
方镇
陈澜
陈浩坤
MAO Qi;FANG Zhen;CHEN Lan;CHEN Haokun(School of Information and Communication Engineering,Communication University of China,Beijing 100024,China)
出处
《中国传媒大学学报(自然科学版)》
2024年第4期38-54,共17页
Journal of Communication University of China:Science and Technology
基金
国家自然科学青年基金项目(62201522)
国家重点研发计划子课题(2022YFF0902402)。
关键词
图像编辑
计算机视觉
扩散模型
image editing
computer vision
diffusion model