联合语义分割与边缘重建的深度学习图像修复被引量：7

Deep learning image inpainting combining semantic segmentation reconstruction and edge reconstruction

导出

摘要目的传统图像修复方法缺乏对图像高级语义的理解,只能应对结构纹理简单的小面积受损。现有的端到端深度学习图像修复方法在大量训练图像的支持下克服了上述局限性,但由于这些方法试图在约束不足的情况下恢复整个目标,修复的图像往往存在边界模糊和结构扭曲问题。对此,本文提出一种语义分割结构与边缘结构联合指导的深度学习图像修复方法。方法该方法将图像修复任务分解为语义分割重建、边缘重建和内容补全3个阶段。首先重建缺失区域的语义分割结构,然后利用重建的语义分割结构指导缺失区域边缘结构的重建,最后利用重建的语义分割结构与边缘结构联合指导图像缺失区域内容的补全。结果在CelebAMask-HQ(celebfaces attributes mask high quality)人脸数据集和Cityscapes城市景观数据集上,将本文方法与其他先进的图像修复方法进行对比实验。在掩膜比例为50%~60%的情况下,与性能第2的方法相比,本文方法在Celebamask-HQ数据集上的平均绝对误差降低了4.5%,峰值信噪比提高了1.6%,结构相似性提高了1.7%;在Cityscapes数据集上平均绝对误差降低了4.2%,峰值信噪比提高了1.5%,结构相似性提高了1.9%。结果表明,本文方法在平均绝对误差、峰值信噪比和结构相似性3个指标上均优于对比方法,且生成的图像边界清晰,视觉上更加合理。结论本文提出的3阶段图像修复方法在语义分割结构与边缘结构的联合指导下,有效减少了结构重建错误。当修复涉及大面积缺失时,该方法比现有方法具有更高的修复质量。 Objective Image in-painting is to reconstruct the missing areas of distorted images.This technique is widely used in multiple scenes like image editing,image de-noising,cultural relics preservation.Conventional image in-painting methods are based on patch blocks to fill the missing pixels or to transmit the pixels to the missing region via diffusion mechanism.These methods have achieved regular effects or small defects in in-painting.However,due to the lack of semantic understanding of the image,more generated images often have a non-photorealistic sense of inconsistent semantic structure when filling large-scale consistent holes.Deep learning-based in-painting method can learn the high-level semantic information of the image from a large number of data.Although these methods have made significant progress in image inpainting,they are often unable to reconstruct feasible structures.Current methods are focused on target-completed restoration without sufficient constraints,so the generated images often have the problems of fuzzy boundaries and distorted structures.Method Our research is aimed to develop a deep image inpainting method guided by semantic segmentation and edge.It divides the image inpainting task into three steps:1)semantic segmentation reconstruction,2)edge reconstruction and 3)content restoration.First,the semantic segmentation reconstruction module reconstructs the semantic segmentation structure.Then,the reconstructed semantic segmentation structure is used to guide the reconstruction of the edge structure of the missing area.Finally,the reconstructed semantic segmentation structure and edge structure are used to guide the content restoration of the missing area.Semantic segmentation can represent the global structure information of the image well.1)The reconstruction of the semantic segmentation structure can improve the accuracy of edge structure-reconstructed.2)Edge contains rich structural information,reconstructing the edge structure is beneficial to generate more inner details of object.3)Under the guidance of reconstructed semantic segmentation structure and edge structure,the content restoration can use texture in-painting to clear the boundary of the generated image.The structure is more reasonable,and the texture is more real.Our network structure is based on the generative adversarial network(GAN-based),including generator and discriminator.The generator network uses encoder-decoder structure and the discriminator network uses 70×70 PatchGAN structure.Joint loss is adopted in terms of loss function in the three steps,which can approach the in-painting results of each step to real results.The two reconstructed modules of semantic segmentation and edge use adversarial loss and feature matching loss.Our feature matching loss used actually includes L1 loss function.Feature matching loss is similar to perceptual loss,which can clarify the ground truth issue of semantic segmentation structure and edge structure.The content restoration module can add the perception loss and style loss in the context of image in-painting when style loss can reduce the“checkerboard”artifact caused by transpose convolution layer.Result First,we analyze the performance of semantic segmentation reconstruction module quantitatively and qualitatively.The results show that the semantic segmentation reconstruction module can reconstruct the feasibility of semantic segmentation structure.When the mask is small,the pixel accuracy can reach 99.16%,and for the larger mask,the pixel accuracy can also reach 92.64%.Next,we compare the edge reconstruction results quantitatively.It shows that the accuracy and recall of the reconstructed edge structure are optimized further under the guidance of the semantic segmentation structure.Finally,the method proposed is compared with four popular image in-painting methods on CelebAMask HQ(celebfaces attributes mask high quality)dataset and Cityscapes dataset.When the mask ratio is 50%~60%,compared with the second-performing method,the mean absolute error(MAE)on the CelebAMask-HQ dataset is reduced by 4.5%,the peak signal-to-noise ratio(PSNR)is increased by 1.6%,and the structure similarity index measure(SSIM)is increased by 1.7%;the MAE on the Cityscapes dataset is reduced by 4.2%,the PSNR is increased by 1.5%,and the SSIM is increased by 1.9%.Our method is optimized for the three indexes of MAE,PSNR and SSIM,the generated image has more clear boundaries and visibility.Conclusion Our 3-steps image in-painting method introduces the guidance of semantic segmentation structure,which can significantly improve the accuracy of edge reconstruction.In addition,it can reduce structure reconstruction errors effectively through the joint guidance of semantic segmentation structure and edge structure.It has stronger potentials in-painting quality for large-area deletions-oriented in-painting task.

作者杨红菊李丽琴王鼎 Yang Hongju;Li Liqin;Wang Ding(School of Computer and Information,Shanxi University,Taiyuan 030006,China;Computational Intelligence and Chinese Information Processing of Ministry of Education,Shanxi University,Taiyuan 030006,China)

机构地区山西大学计算机与信息技术学院山西大学计算智能与中文信息处理教育部重点实验室

出处《中国图象图形学报》 CSCD 北大核心 2022年第12期3553-3565,共13页 Journal of Image and Graphics

基金国家自然科学基金项目(61976128) 山西省高等学校科技创新资助项目(2019L0103) 山西省回国留学人员科研资助项目(2022-008)。

关键词图像修复生成对抗网络(GAN) 语义分割边缘检测深度学习 image inpainting generative adversarial network(GAN) semantic segmentation edge detection deeplearning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1张桂梅,李艳兵.结合纹理结构的分数阶TV模型的图像修复[J].中国图象图形学报,2019,24(5):700-713. 被引量：19
2强振平,何丽波,陈旭,徐丹.深度学习图像修复方法综述[J].中国图象图形学报,2019,0(3):447-463. 被引量：45

二级参考文献8

1杨柱中,周激流,晏祥玉,黄梅.基于分数阶微分的图像增强[J].计算机辅助设计与图形学学报,2008,20(3):343-348. 被引量：98
2李开宇,孙玉刚.引入连续性强度和置信度因子的快速图像修复[J].中国图象图形学报,2012,17(4):465-470. 被引量：8
3刘华明,毕学慧,叶中付,王维兰.样本块搜索和优先权填充的弧形推进图像修复[J].中国图象图形学报,2016,21(8):993-1003. 被引量：17
4张桂梅,孙晓旭,陈彬彬,刘建新.结合分数阶微分和Canny算子的边缘检测[J].中国图象图形学报,2016,21(8):1028-1038. 被引量：25
5张桂梅,孙晓旭,刘建新.基于自适应投影算法的分数阶全变分去噪模型[J].模式识别与人工智能,2016,29(11):1009-1018. 被引量：10
6曾接贤,王璨.基于优先权改进和块划分的图像修复[J].中国图象图形学报,2017,22(9):1183-1193. 被引量：20
7祝严刚,张桂梅.自适应残差图像的分数阶全变分去噪算法[J].中国图象图形学报,2017,22(12):1677-1689. 被引量：10
8张桂梅,孙晓旭,刘建新,储珺.基于分数阶微分的TV-L^1光流模型的图像配准方法研究[J].自动化学报,2017,43(12):2213-2224. 被引量：9

共引文献60

1李红蕾.计算机图形图像处理技术在文物保护领域的应用分析[J].计算机产品与流通,2019,8(12):9-9. 被引量：1
2董莉娜,王如琪,刘群.一种结合数据势能的图像补全方法[J].计算机应用研究,2020,37(S02):362-364.
3张柯,白富生,吴至友,皮家甜,赵立军.基于对抗生成网络的人脸照片去网纹技术[J].重庆师范大学学报（自然科学版）,2019,36(6):110-118. 被引量：4
4范新刚.基于深度学习的图像修复技术研究[J].江苏科技信息,2020,37(8):47-49.
5诸葛燕,徐宏辉,郑建炜.张量化扩展变换的低秩图像修复算法[J].浙江工业大学学报,2020,48(3):319-327.
6陈永,艾亚鹏,郭红光.改进曲率驱动模型的敦煌壁画修复算法[J].计算机辅助设计与图形学学报,2020,32(5):787-796. 被引量：18
7赵然.基于深度学习的图像修复方法综述[J].科技风,2020,0(18):130-130. 被引量：4
8赵卫东,秦锋.基于色阶阈值模型的Criminisi图像修复算法[J].重庆科技学院学报（自然科学版）,2020,22(4):70-75. 被引量：1
9张磬瀚,孙刘杰,王文举,李佳昕,刘丽.基于生成对抗网络的文物图像修复与评价[J].包装工程,2020,41(17):237-243. 被引量：10
10兰红,刘秦邑.图注意力网络的场景图到图像生成模型[J].中国图象图形学报,2020,25(8):1591-1603. 被引量：5

同被引文献35

1沈娟,马小虎.甲骨文的曲线轮廓字形自动生成系统[J].计算机应用与软件,2009,26(1):67-68. 被引量：8
2顾绍通.甲骨拓片字形图像复原方法[J].中文信息学报,2010,24(2):116-121. 被引量：4
3顾绍通,酆格斐,马小虎,杨亦鸣.基于泊松分布和分形几何的甲骨拓片字形复原方法[J].中国科学：信息科学,2011,41(1):23-32. 被引量：3
4吴群,王田,王汉武,赖永炫,钟必能,陈永红.现代智能视频监控研究综述[J].计算机应用研究,2016,33(6):1601-1606. 被引量：69
5马龙,刘日升,姜智颖,王怡洋,樊鑫,李豪杰.自然场景图像去雨的可学习混合MAP网络[J].中国图象图形学报,2018,23(2):277-285. 被引量：5
6赵树阳,李建武.基于生成对抗网络的低秩图像生成方法[J].自动化学报,2018,44(5):829-839. 被引量：23
7李雪瑾,李昕,徐艳杰.基于生成对抗网络的数字图像修复技术[J].电子测量与仪器学报,2019,31(1):40-46. 被引量：14
8顾绍通.基于数学形态学的甲骨拓片字形复原方法[J].计算机技术与发展,2018,28(12):176-178. 被引量：4
9顾绍通.基于迭代函数系统和分形插值的甲骨字形轮廓修复方法[J].科学技术与工程,2018,18(36):87-92. 被引量：4
10张桂梅,李艳兵.结合纹理结构的分数阶TV模型的图像修复[J].中国图象图形学报,2019,24(5):700-713. 被引量：19

引证文献7

1宋传鸣,乔明泽,洪飏.边缘梯度协方差引导的甲骨文字修复算法[J].辽宁师范大学学报（自然科学版）,2023,46(2):194-207. 被引量：1
2王子一,李光亚,张志艺,李旭卿,简丽.基于D2GAN和颜色迁移的侯马盟书虚拟修复[J].国外电子测量技术,2023,42(9):186-192.
3兰治,严彩萍,李红,郑雅丹.混合双注意力机制生成对抗网络的图像修复模型[J].中国图象图形学报,2023,28(11):3440-3452. 被引量：1
4刘庆俞,刘磊,陈磊,肖强.基于生成对抗网络的图像修复研究[J].黑龙江工业学院学报（综合版）,2023,23(10):89-94. 被引量：1
5孙彦景,王兴兴,云霄,张晓光,周玉.基于无监督深度学习的图像拼接实验设计与实现[J].实验室研究与探索,2024,43(1):114-118.
6江奎,贾雪梅,黄文心,王文兵,王正,江俊君.图像复原中自注意力和卷积的动态关联学习[J].中国图象图形学报,2024,29(4):890-907. 被引量：1
7侯戌非.基于深度学习的图书图像修复技术研究[J].计算机应用文摘,2024,40(14):125-130.

二级引证文献4

1宋传鸣,周雨晴,张晋豪,洪飏.连通区域拓扑结构约束的甲骨拓片图像分割[J].闽南师范大学学报（自然科学版）,2023,36(4):35-50.
2郭庚辰,姚剑敏,严群,林智贤,刘德崇.基于扩散模型的人脸图像修复技术[J].信息技术与信息化,2024(3):200-203.
3李冬,杨思路,张恒,王晓明.基于双流U型的单图像超分辨率重建方法研究[J].黑龙江工业学院学报（综合版）,2024,24(3):85-93.
4贺文林,郭豪珺,邓炜,杨军秀.基于深度学习的智能化适配终端特性的方法和移动终端显示效果优化实现[J].广播电视网络,2024,31(7):29-32.

1陈永,陶美风,赵梦雪.结构门控与纹理联合引导的生成对抗壁画修复[J].湖南大学学报（自然科学版）,2023,50(2):1-11. 被引量：3
2张朝柱,刘晓.面向工程教育的产科教问题探索[J].中国科技期刊数据库科研,2022(12):36-39.
3王培忠,胡嘉涛,王梦君.TEE与右心声学造影联合指导房间隔卵圆孔未闭诊断的价值[J].影像研究与医学应用,2022,6(24):79-81. 被引量：2
4金涛.读图时代的图像“缺失”——从媒介变迁看古今出版物插图流变[J].中国文艺评论,2022(12):82-94. 被引量：2
5王向军,李名洋,王霖,刘峰,王玮.边缘信息引导多级尺度特征融合的显著性目标检测方法[J].红外与激光工程,2023,52(1):253-262. 被引量：1
6姚仲坤,安超,刘振青.海洋石油平台燃气透平进气系统优化改造[J].天津科技,2023,50(2):58-61.
7汪剑.国土空间规划视域下主体功能区战略优化研究[J].中文科技期刊数据库（全文版）工程技术,2022(8):180-183.
8丁云乐,王慧琴,王可,王展,甄刚.多尺度特征融合的壁画多光谱图像颜料3D-CNN分类方法[J].激光与光电子学进展,2022,59(22):361-369. 被引量：2
9罗志刚,刘利民,张定娃,范瑞开,王惠群.基于双导师制的教育研究类毕业论文改革与实践——以井冈山大学化学师范专业为例[J].井冈山大学学报（自然科学版）,2022,43(6):93-99.
10刘东涌,柏义,刘翠.E+O型网片设计在改良盆底重建术中的应用及临床效果研究[J].中文科技期刊数据库（全文版）医药卫生,2022(8):34-36.

中国图象图形学报

2022年第12期

浏览历史

内容加载中请稍等...

联合语义分割与边缘重建的深度学习图像修复被引量：7

参考文献2

二级参考文献8

共引文献60

同被引文献35

引证文献7

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

联合语义分割与边缘重建的深度学习图像修复 被引量：7

参考文献2

二级参考文献8

共引文献60

同被引文献35

引证文献7

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

联合语义分割与边缘重建的深度学习图像修复被引量：7