基于多尺度Scale-Unet的单样本图像翻译

Single-sample Image Translation Based on Multi-scale Scale-Unet

下载PDF

导出

摘要随着生成对抗网络(GAN)的发展,基于单样本的无监督图像到图像翻译(UI2I)取得了重大进展。然而,以前方法无法捕获图像中的复杂纹理并保留原始内容信息。为解决这个问题,提出了一种基于尺度可变U-Net结构(Scale—Unet)的新型单样本图像翻译结构SUGAN。所提出的SUGAN使用Scale—Unet作为生成器,利用多尺度结构和渐进方法不断改进网络结构,以从粗到细地学习图像特征。同时,提出了尺度像素损失scale-pixel来更好地约束保留原始内容信息,防止信息丢失。实验表明,与SinGAN、TuiGAN、TSIT、StyTR2等公共数据集Summer■Winter、Horse■Zebra上的方法相比,该方法生成图像的SIFID值平均降低了30%。所提方法可更好地保留图像内容信息,同时生成详细逼真的高质量图像。 Single-sample unsupervised image-to-image translation(UI2I)has made significant progress with the development of generative adversarial networks(GANs).However,previous methods cannot capture complex textures in images and preserve original content information.We propose a novel one-shot image translation structure SUGAN based on a scale-variable U-Net structure(Scale—Unet).The proposed SUGAN uses Scale—Unet as a generator to continuously improve the network structure using multi-scale structures and progressive methods to learn image features from coarse to fine.Meanwhile,we propose the scale-pixel loss to better constrain the preservation of original content information and prevent information loss.Experiments show that compared with SinGAN,TuiGAN,TSIT,StyTR2 and another methods on public datasets Summer■Winter,Horse■Zebra,the SIFID value of the generated image is reduced by 30%.The proposed method can better preserve the content information of the image while generating detailed and realistic high-quality images.

作者周蓬勃冯龙寇宇帆 ZHOU Peng-bo;FENG Long;KOU Yu-fan(School of Art and Media,Beijing Normal University,Beijing 100032,China;School of Information Science and Technology,Northwest University,Xi’an 710127,China)

机构地区北京师范大学艺术与传媒学院西北大学信息科学与技术学院

出处《计算机技术与发展》 2024年第4期55-61,共7页 Computer Technology and Development

基金国家自然科学基金项目(62271393) 国博文旅部重点实验室开放课题(CRRT2021K01) 陕西省重点研发计划(2019GY-215,2021ZDLSF06-04)。

关键词单样本图像翻译 Scale-Unet 多尺度结构渐进方法尺度像素损失 single-sample image translation Scale-Unet multi-scale structure progressive approach scale-pixel loss

分类号 TP394.1 [自动化与计算机技术—计算机应用技术] TH691.9 [机械工程—机械制造及自动化]

引文网络
相关文献

1颉颃,邢少华,王霞艳.英译汉中的语法隐喻应用探析[J].今古文创,2023(15):116-118.
2刘如飞,杨继奔,任红伟,柴永宁.一种多特征约束的路面点云精简方法[J].遥感信息,2021,36(6):1-8. 被引量：1
3徐倩,钱沄涛.张量低秩约束下的多帧图像去模糊[J].信号处理,2021,37(6):975-983.
4蒋小玲.基于模糊语言学视角的大学英语翻译教学策略[J].鄂州大学学报,2023,30(2):83-85. 被引量：1
5王欢.生态翻译学视角下的英语翻译教学研究[J].佳木斯职业学院学报,2023,39(4):61-63. 被引量：1
6伍勇进.欧化翻译结构“如此……以至于”的多维考察[J].现代语言学,2023,11(2):412-418.
7彭永梅,童向荣.基于知识图谱的轻量级图卷积网络推荐[J].南京大学学报（自然科学版）,2023,59(6):937-946.
8张宏利,兰超,刘树林,肖海华,蒋伦常,孙欣.免疫连续记忆故障诊断方法[J].机械工程学报,2023,59(24):34-45.
9窦建民,师智斌,于孟洋,霍帅,张舒娟.基于API序列和预训练模型的恶意软件检测[J].计算机工程与设计,2024,45(4):974-981.
10赵文竹,袁冠,张艳梅,乔少杰,王森章,张雷.多视角融合的时空动态GCN城市交通流量预测[J].软件学报,2024,35(4):1751-1773. 被引量：2

计算机技术与发展

2024年第4期

浏览历史

内容加载中请稍等...

基于多尺度Scale-Unet的单样本图像翻译

相关作者

相关机构

相关主题

浏览历史