基于条件残差生成对抗网络的风景图生成

Landscape image generation based on conditional residual generative adversarial network

下载PDF

导出

摘要风景图像的语义分割图中包含天空、白云、山川、树木、河流等大量类别信息,针对语义分割图中存在的信息类别过多、不同区域间的色彩变换不明显等问题,现有方法生成的风景图像在清晰度和真实性上效果并不理想。因此提出了一种基于条件残差生成对抗网络(CRGAN)方法,用于生成清晰度更高和内容更真实的风景图像。首先,优化生成器网络的上采样和下采样结构,提升生成器对语义分割图的特征提取效果。其次,在编码器和解码器之间使用跳跃连接传递语义分割图的特征信息,防止特征信息在编码器中传递丢失,保留特征信息的完整性。最后,在网络的编码器和解码器之间添加残差模块,以便更好地提取、传输和保留语义信息。此外,方法中采用均方差(MSE)提升语义分割图和生成图像之间的相似度。实验结果表明,相较于pix2pix和cyclegan方法,CRGAN生成的图像在FID指标中分别增加了26.769和119.333,有效提升了风景图像的清晰度和真实性。同时使用公共数据集验证了CRGAN的泛用性和有效性。 The semantic segmentation map of landscape image encompasses a large number of categorical information such as the sky,white clouds,mountains,rivers,and trees.In view of the challenges presented by the numerous information categories in the semantic segmentation map and the subtle color transformations between different regions,the landscape images generated by current methods are deficient in terms of both clarity and authenticity.Consequently,a method based on conditional residual generation adversarial network(CRGAN)was proposed to generate landscape images with a higher resolution and more realistic content.Firstly,the proposed method involved the upsampling and downsampling structures of the generator network to enhance the feature extraction effect of the generator on the semantic segmentation graph.Secondly,skip connections were utilized between the encoder and decoder to transmit the feature information from the semantic segmentation graph,ensuring the integrity of such information was retained,and not lost in the encoder.Finally,a residual module was added between the encoder and decoder of the network,facilitating better extraction,transmission,and retention of semantic information.In addition,the mean square error(MSE)was employed to enhance the similarity between semantically segmented graphs and generated images.The experimental results demonstrated that compared with pix2pix and cyclegan methods,the FID index of images generated by CRGAN increased by 26.769 and 119.333,respectively.This improvement effectively enhanced the clarity and authenticity of landscape images.The universality and validity of CRGAN were also validated using a common dataset.

作者邵俊棋钱文华徐启豪 SHAO Jun-qi;QIAN Wen-hua;XU Qi-hao(Department of Computer Science Engineering,School of Information Science and Engineering,Yunnan University,Kunming Yunnan 650504,China)

机构地区云南大学信息学院计算机科学与工程系

出处《图学学报》 CSCD 北大核心 2023年第4期710-717,共8页 Journal of Graphics

基金国家自然科学基金项目(62162065) 云南省科技厅应用基础研究计划重点项目(2019FA044) 云南省中青年学术技术带头人后备人才项目(2019HB121) 云南大学研究生科研创新项目(ZC-22222502)。

关键词生成对抗网络风景图像图像生成深度学习清晰度 generative adversarial network landscape image image generation deep learning clarity

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1Shi-Min HU,Dun LIANG,Guo-Ye YANG,Guo-Wei YANG,Wen-Yang ZHOU.Jittor:a novel deep learning framework with meta-operators and unified graph execution[J].Science China(Information Sciences),2020,63(12):114-134. 被引量：16
2李彬,王平,赵思逸.基于双重注意力机制的图像超分辨重建算法[J].图学学报,2021,42(2):206-215. 被引量：11
3任好盼,王文明,危德健,高彦彦,康智慧,王全玉.基于高分辨率网络的人体姿态估计方法[J].图学学报,2021,42(3):432-438. 被引量：12
4林晓,屈时操,黄伟,郑晓妹,马利庄.显著区域保留的图像风格迁移算法[J].图学学报,2021,42(2):190-197. 被引量：11
5黄凯奇,赵鑫,李乔哲,胡世宇.视觉图灵:从人机对抗看计算机视觉下一步发展[J].图学学报,2021,42(3):339-348. 被引量：6

二级参考文献8

1黄凯奇,谭铁牛.视觉认知计算模型综述[J].模式识别与人工智能,2013,26(10):951-958. 被引量：12
2黄凯奇,任伟强,谭铁牛.图像物体分类与检测算法综述[J].计算机学报,2014,37(6):1225-1240. 被引量：195
3黄凯奇,陈晓棠,康运锋,谭铁牛.智能视频监控技术综述[J].计算机学报,2015,38(6):1093-1118. 被引量：399
4李白萍,韩新怡,吴冬梅.基于卷积神经网络的实时人群密度估计[J].图学学报,2018,39(4):728-734. 被引量：7
5吴珍发,赵皇进,郑国磊.人机任务仿真中虚拟人行为建模及仿真实现[J].图学学报,2019,40(2):410-415. 被引量：8
6刘瑜兴,王淑侠,徐光耀,兰望桂,何卫平.基于Leap Motion的三维手势交互系统研究[J].图学学报,2019,40(3):556-564. 被引量：14
7陈国军,杨静,程琰,尹鹏.基于RGBD的实时头部姿态估计[J].图学学报,2019,40(4):681-688. 被引量：8
8黄凯奇,兴军亮,张俊格,倪晚成,徐博.人机对抗智能技术[J].中国科学：信息科学,2020,50(4):540-550. 被引量：28

共引文献45

1马兰村.让人瞩目的服装业“名牌战略”[J].中外服装,2000(5):24-25.
2单嘉良,梁雨欢,冯培基.基于卷积神经网络的人体姿态估计方法研究[J].中国宽带,2021(3):183-183.
3杨国烨,周文洋,刘兰,张松海.基于包围盒回归的图像构图推荐[J].计算机辅助设计与图形学学报,2021,33(5):746-754.
4陈明瑶,徐琨,李晓旋.基于风格迁移的手势分割方法[J].计算机与现代化,2021(5):20-25.
5Wen-Yang Zhou,Guo-Wei Yang,Shi-Min Hu.Jittor-GAN:A fast-training generative adversarial network model zoo based on Jittor[J].Computational Visual Media,2021,7(1):153-157. 被引量：5
6Meng-Hao Guo,Jun-Xiong Cai,Zheng-Ning Liu,Tai-Jiang Mu,Ralph R.Martin,Shi-Min Hu.PCT:Point cloud transformer[J].Computational Visual Media,2021,7(2):187-199. 被引量：111
7郭元晨,蔡韵,张松海.基于空间注意力下边缘图融合的草图图像检索[J].计算机辅助设计与图形学学报,2021,33(6):847-854. 被引量：1
8李洪安,郑峭雪,张婧,杜卓明,李占利,康宝生.结合Pix2Pix生成对抗网络的灰度图像着色方法[J].计算机辅助设计与图形学学报,2021,33(6):929-938. 被引量：10
9杜娟,胡静.基于变分自编码器的现代服饰局部中国风格迁移[J].毛纺科技,2021,49(9):72-77. 被引量：1
10马晨凯,吴毅慧,傅华奇,业宁.基于深度学习的先进陶瓷零件实时缺陷检测系统[J].南京航空航天大学学报,2021,53(5):726-734. 被引量：9

1元文浩,陈强,刘杰,黄光造.基于残差生成对抗网络的光谱数据域适应研究[J].长江信息通信,2023,36(5):29-31.
2陶昕辰,朱涛,黄玉玲,高恬曼,何博,吴迪.基于DDR GAN的低质量图像增强算法[J].激光技术,2023,47(3):322-328. 被引量：5
3胡美慧,肖万幸.基于人脸识别的电力移动终端身份信息认证方法[J].自动化与仪器仪表,2023(6):89-92. 被引量：1

图学学报

2023年第4期

浏览历史

内容加载中请稍等...

基于条件残差生成对抗网络的风景图生成

参考文献5

二级参考文献8

共引文献45

相关作者

相关机构

相关主题

浏览历史