一种面向图像修复的局部优化生成模型

A local optimization generation model for image inpainting

下载PDF

导出

摘要图像修复在照片的编辑、去除等方面有着广泛地应用。针对现有深度学习图像修复模型因受卷积算子感受野局限性的影响,导致修复结果存在结构扭曲或纹理模糊的问题,提出一种局部优化生成模型LesT-GAN,该模型由生成器和鉴别器两部分组成。其中,生成器部分由局部增强滑动窗口Transformer模块构成,该模块将深度卷积的平移不变性、局部性优势与Transformer的全局信息建模能力相结合,既能够覆盖较大范围的感受野又能实现局部细节的优化。鉴别器部分是一种基于掩码指导和补丁的相对平均鉴别器,通过估计给定的真实图像比生成图像更真实的平均概率,模拟缺失区域边界周围的像素传播,使生成器训练时能够直接借助真实图像生成更清晰的局部纹理。在Places2,CelebA-HQ和PairsStreet的3种数据集上,与其他先进的图像修复方法进行对比实验,LesT-GAN在L_(1)和FID评价指标方面分别有10.8%和41.36%的提升。实验结果表明,LesT-GAN在多个场景中有更好的修复效果,同时能很好地泛化到比训练时分辨率更高分辨率的图像中。 Image inpainting has extensive applications in photo editing and removal.In order to address the limitations of existing deep learning-based image inpainting model,which is affected by the receptive field of convolution operators and results in distorted structure or blurred texture,a locally optimized generation model LesT-GAN was proposed.This model comprised a generator and a discriminator.The generator consisted of a locally enhanced sliding window Transformer module.This module combined the translation invariance and locality advantages of deep convolution with the Transformer’s ability to model global information.As a result,it could cover a wide range of receptive fields while optimizing local details.The discriminator part was a relative average discriminator based on mask guidance and patch.It simulated pixel propagation around the boundary of the missing region by estimating the average probability of a given real image being more realistic than a generated image.As a result,during the generator training,it could generate clearer local textures directly from real images.In comparison experiments with other advanced image inpainting methods on the Places2,CelebA-HQ,and PairsStreet datasets,LesT-GAN improved L_(1) and FID by more than 10.8%and 41.36%,respectively.Experimental results demonstrated that LesT-GAN exhibited superior restoration performance across multiple scenes,and that it could be well generalized to images with higher resolution than those used during training.

作者杨红菊高敏张常有薄文武文佳曹付元 YANG Hong-ju;GAO Min;ZHANG Chang-you;BO Wen;WU Wen-jia;CAO Fu-yuan(School of Computer and Information,Shanxi University,Taiyuan Shanxi 030006,China;Computational Intelligence and Chinese Information Processing of Ministry of Education,Shanxi University,Taiyuan Shanxi 030006,China;Institute of Software,Chinese Academy of Sciences,Beijing 100190,China)

机构地区山西大学计算机与信息技术学院山西大学计算智能与中文信息处理教育部重点实验室中国科学院软件研究院

出处《图学学报》 CSCD 北大核心 2023年第5期955-965,共11页 Journal of Graphics

基金国家自然科学基金项目(61976128) 山西省回国留学人员科研资助项目(2022-008)。

关键词深度学习图像修复生成模型 TRANSFORMER 局部优化 deep learning image inpainting generation model Transformer local optimization

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1王元红.气象科技助力大运会[J].今日中学生,2023(25):20-22.
2刘晓飞,耿楠,秦政.高地应力软岩隧道变形特征分析[J].四川水泥,2023(8):263-265.
3郭晴,徐健,霍奕帆,邱立艳,张景峰.MRI对乳腺X线摄影检出BI-RADS3~4类结构扭曲和/或非对称性致密病变的诊断效能分析[J].现代实用医学,2023,35(10):1315-1318.
4武传伟,方雁群,陈海永,古瑞琴,刘欢,张朋,贾林涛.基于FMEDA的气体探测器SIL验证研究[J].电子测量技术,2023,46(12):20-25.
5阳静.基于静态与动态客户混合的车辆路径问题[J].电脑知识与技术,2023,19(29):123-126.
6陈美颖,毕秀丽,刘波.基于网格与超像素的图像重定向方法[J].计算机科学,2023,50(S02):306-313.
7于建,彭驰,金志超.中心聚集效应下多种预测模型构建策略的模拟比较[J].中国循证医学杂志,2023,23(7):834-842.
8李文静,白静,彭斌,杨瞻源.图卷积神经网络及其在图像识别领域的应用综述[J].计算机工程与应用,2023,59(22):15-35. 被引量：5
9刘劲,罗晓曙,徐照兴.空间分组增强注意力的轻量级人脸表情识别[J].计算机工程与应用,2023,59(22):233-241.
10林毅夫,付才辉,郑洁.新结构环境经济学:新框架与新见解[J].经济理论与经济管理,2023,43(9):4-17. 被引量：4

图学学报

2023年第5期

浏览历史

内容加载中请稍等...

一种面向图像修复的局部优化生成模型

相关作者

相关机构

相关主题

浏览历史