基于扩散模型的图像编辑研究现状

An overview of image editing based on diffusion models

下载PDF

导出

摘要随着扩散模型的提出与迅速发展,依托其高度可解释的数学特性及高质量和多样性的结果,扩散模型逐渐打破对抗生成网络在图像生成和图像编辑领域的垄断地位,基于扩散模型的图像编辑逐渐成为计算机视觉领域的研究热点。本文首先介绍了图像编辑的任务定义和扩散模型的基本原理;其次重点分类依次介绍了基于扩散模型的图像编辑技术的发展历程;然后总结了图像编辑领域常用的评价指标和数据集,同时定性和定量比较了经典方法在不同数据集上的效果;最后对基于扩散模型的图像编辑现状进行总结和展望。 With the introduction and rapid development of diffusion models,these frameworks have begun to challenge the dominance of generative adversarial networks(GANs)in the realms of image generation and editing,thanks to their highly interpretable mathematical properties and the high quality and diversity of their outputs.Image editing based on diffusion models is emerging as a research hotspot in the field of computer vision.In this paper the task definition of image editing and the basic principles of diffusion models were first introduced.Then the developmental trajectory of image editing techniques based on diffusion models was categorized and detailed.Furthermore,common evaluation metrics and datasets used in the image editing domain were reviewed,and both qualitative and quantitative comparisons of classical methods across various datasets were provided.Finally,the current state and prospects of image editing based on diffusion models were summarized.

作者毛琪方镇陈澜陈浩坤 MAO Qi;FANG Zhen;CHEN Lan;CHEN Haokun(School of Information and Communication Engineering,Communication University of China,Beijing 100024,China)

机构地区中国传媒大学信息与通信工程学院

出处《中国传媒大学学报（自然科学版）》 2024年第4期38-54,共17页 Journal of Communication University of China：Science and Technology

基金国家自然科学青年基金项目(62201522) 国家重点研发计划子课题(2022YFF0902402)。

关键词图像编辑计算机视觉扩散模型 image editing computer vision diffusion model

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1刘少杰,赵鸿伯,刘浛.基于领域编程模型的可信区块链自动化协议[J].应用科学学报,2024,42(4):569-584.

中国传媒大学学报（自然科学版）

2024年第4期

浏览历史

内容加载中请稍等...

基于扩散模型的图像编辑研究现状

相关作者

相关机构

相关主题

浏览历史