基于自相似与对比学习的图像跨域转换算法被引量：1

Image Cross-Domain Translation Algorithm Based on Self-Similarity and Contrastive Learning

下载PDF

导出

摘要图像跨域转换,又称图像翻译,是一种旨在将源域的图像转换为目标域的图像的技术,具体来说是使生成图像在保持源域图像的结构(轮廓、姿态等)的同时具有目标域图像的风格(纹理、颜色等).图像跨域转换技术在视觉领域有着广泛的应用,如照片编辑和视频特效制作.近年来,该技术在深度学习尤其是生成对抗网络的基础上得到了飞速发展,也取得了令人印象深刻的结果,但是迁移后的生成图像仍然存在颜色模式坍塌、内容结构无法保持等问题,针对这些问题,提出了一种基于自相似性与对比学习的图像跨域转换算法.该算法利用预先训练的深度神经网络模型提取图像的内容特征和风格特征,将感知损失和基于自相似性的损失作为图像内容损失函数,同时使用一种宽松的最优传输损失和基于矩匹配计算的损失作为图像风格损失函数对提出的神经网络进行训练,并通过将生成图像和目标域图像标记为正样本对,将生成图像和源域标记为负样本进行对比学习.在4个数据集上对提出的算法进行了实验验证,结果表明提出的算法在生成的结果图像上较好地保持了源域图像的内容结构,同时减少颜色的模式坍塌,且使生成的图像风格与引导图像的风格更加一致. Image cross-domain transformation,also known as image translation,is a technology that aims to transform the images of the source domain into the ones of the target domain.Specifically,the converted images have the style of the target domain images(contour,posture,etc.)while maintaining the structure of the source domain images(texture,color,etc.).Image cross-domain transformation technology is widely used in the field of vision,such as photo editing and video special effects production.In recent years,this technology has developed rapidly based on deep learning,especially the generation of adversarial networks,and achieved impressive results.However,there are still problems,including the collapse of color mode and the inability to maintain the content structures in the transformed images.To solve the above problems,we propose an image cross-domain transformation algorithm based on self-similarity and contrastive learning.The algorithm uses the pre-trained deep neural network model to extract the content and style features of the images and takes the perceptual loss and the loss based on self-similarity as the image content loss function.At the same time,a loose optimal transport loss and the moment matching loss are used as the image style loss function to train the proposed neural network,and the transformed images and the target domain images are marked as positive sample pairs,and the translated images and the source domain images are marked as negative samples for contrastive learning.The proposed algorithm is verified by experiments on four data sets.The results show that the proposed method maintains the content structure of the source domain images,reduces the mode collapse of color,and makes the style of the translated images more consistent with that of the guidance images.

作者赵磊张慧铭邢卫林志洁林怀忠鲁东明潘洵许端清 Zhao Lei;Zhang Huiming;Xing Wei;Lin Zhijie;Lin Huaizhong;Lu Dongming;Pan Xun;Xu Duanqing(School of Computer Science and Technology,Zhejiang University,Hangzhou 310027;School of Information and Electronic Engineering,Zhejiang University of Science and Technology,Hangzhou 310023;School of International Studies,Zhejiang University,Hangzhou 310027)

机构地区浙江大学计算机科学与技术学院浙江科技学院信息与电子工程学院浙江大学外语学院

出处《计算机研究与发展》 EI CSCD 北大核心 2023年第4期930-946,共17页 Journal of Computer Research and Development

基金国家重点研发计划项目(2020YFC1522704) 国家自然科学基金项目(62172365) 浙江省自然科学基金项目(LY21F02005,LY19F020049) 国家社科基金重大项目(19ZDA197) 浙江省文物保护科技项目(2019011) 浙江省尖兵计划项目(2022C01222) 石窟寺文物数字化保护国家文物局重点科研基地项目浙江大学教育部脑与脑机融合前沿科学中心项目(2021008)。

关键词跨域图像转换自相似比对学习颜色模式坍塌风格迁移 cross-domain image transformation self-similarity contrastive learning color mode collapse style transfer

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献2

1陈金龙,刘雄飞,詹曙.基于无监督生成对抗网络的人脸素描图像真实化[J].计算机工程与科学,2021,43(1):125-133. 被引量：4
2Deng-Ping Fan,Ziling Huang,Peng Zheng,Hong Liu,Xuebin Qin,Luc Van Gool.Facial-sketch Synthesis:A New Challenge[J].Machine Intelligence Research,2022,19(4):257-287. 被引量：3

引证文献1

1曹林,王震,杜康宁,郭亚男.基于层次对比生成对抗网络的非配对素描人脸合成[J].中国科技论文,2024,19(6):715-723.

1洪亮,刘天劢,宋俊康.RGB到Lab色彩转换在Photoshop和Matlab中的应用对比[J].广东印刷,2023(1):19-21.
2罗龙襄.RGB颜色模式图像调色探究[J].电子技术与软件工程,2022(20):182-185. 被引量：3
3胡小艳,李昊洋,刘祥,李雨捷,谭丽芳,李勇帅,陈芋文,易斌.基于深度学习建立睑结膜图像贫血筛查算法模型的研究[J].陆军军医大学学报,2023,45(8):746-752. 被引量：1
4张清扬.可编程控制器通信技术的研究与实现[J].中文科技期刊数据库（引文版）工程技术,2023(5):110-113.
5王子建.基于深度聚合网络的单幅图像超分辨率重建[J].广播电视信息,2023,30(5):53-58.
6覃文辉,章义来.多流派风格迁移算法的设计与实现[J].福建电脑,2023,39(4):42-48. 被引量：1
7袁泉,徐雲鹏,唐成亮.基于路径标签的文档级关系抽取方法[J].计算机应用,2023,43(4):1029-1035.
8张浩,陈圣兵,张伟,陈万华,宋玉宝.局域共振周期管路流致振动抑制技术研究[J].化工设备与管道,2023,60(1):67-72.
9霍雨华,张强,庄翔,张明敏.非小细胞肺癌CT值与免疫治疗疗效的相关性探讨[J].影像研究与医学应用,2023,7(6):52-56. 被引量：2
10林宏宇,关晨辉,张炳先,赵兴成.基于空间分割的影像条带噪声去除方法[J].航天返回与遥感,2023,44(2):109-117.

计算机研究与发展

2023年第4期

浏览历史

内容加载中请稍等...

基于自相似与对比学习的图像跨域转换算法被引量：1

同被引文献2

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于自相似与对比学习的图像跨域转换算法 被引量：1

同被引文献2

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于自相似与对比学习的图像跨域转换算法被引量：1