Controllable multi-domain semantic artwork synthesis

导出

摘要 We present a novel framework for the multidomain synthesis of artworks from semantic layouts.One of the main limitations of this challenging task is the lack of publicly available segmentation datasets for art synthesis.To address this problem,we propose a dataset called ArtSem that contains 40,000 images of artwork from four different domains,with their corresponding semantic label maps.We first extracted semantic maps from landscape photography and used a conditional generative adversarial network(GAN)-based approach for generating high-quality artwork from semantic maps without requiring paired training data.Furthermore,we propose an artwork-synthesis model using domain-dependent variational encoders for high-quality multi-domain synthesis.Subsequently,the model was improved and complemented with a simple but effective normalization method based on jointly normalizing semantics and style,which we call spatially style-adaptive normalization(SSTAN).Compared to the previous methods,which only take semantic layout as the input,our model jointly learns style and semantic information representation,improving the generation quality of artistic images.These results indicate that our model learned to separate the domains in the latent space.Thus,we can perform fine-grained control of the synthesized artwork by identifying hyperplanes that separate the different domains.Moreover,by combining the proposed dataset and approach,we generated user-controllable artworks of higher quality than that of existing approaches,as corroborated by quantitative metrics and a user study.

作者 Yuantian Huang Satoshi Iizuka Edgar Simo-Serra Kazuhiro Fukui

机构地区 Department of Computer Science Department of Computer Science and Engineering

出处《Computational Visual Media》 SCIE EI CSCD 2024年第2期355-373,共19页 计算可视媒体（英文版）

基金 supported by the Japan Science and Technology Agency Support for Pioneering Research Initiated by the Next Generation(JST SPRING)under Grant No.JPMJSP2124.

关键词 semantic artwork synthesis generative adversarial network(GAN) datasets non-photorealistic rendering

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1Shuyang Zhang,Runze Liang,Miao Wang.ShadowGAN: Shadow synthesis for virtual objects with conditional adversarial networks[J].Computational Visual Media,2019,5(1):105-115. 被引量：6
2Wen-Yang Zhou,Guo-Wei Yang,Shi-Min Hu.Jittor-GAN:A fast-training generative adversarial network model zoo based on Jittor[J].Computational Visual Media,2021,7(1):153-157. 被引量：5
3Cong Wang,Fan Tang,Yong Zhang,Tieru Wu,Weiming Dong.Towards harmonized regional style transfer and manipulation for facial images[J].Computational Visual Media,2023,9(2):351-366. 被引量：1

二级参考文献2

1Bin Liu,Nun Xu,Ralph R. Martin.Static Scene Illumination Estimation from Videos with Applications[J].Journal of Computer Science & Technology,2017,32(3):430-442. 被引量：6
2Shi-Min HU,Dun LIANG,Guo-Ye YANG,Guo-Wei YANG,Wen-Yang ZHOU.Jittor:a novel deep learning framework with meta-operators and unified graph execution[J].Science China(Information Sciences),2020,63(12):114-134. 被引量：15

共引文献9

1柴艳娜,宋焕生,朱婧.航拍图像车辆检测的数据增强[J].长安大学学报（自然科学版）,2023,43(3):95-104.
2Jinjiang Li,Xiaomei Feng,Hui Fan.Saliency-based image correction for colorblind patients[J].Computational Visual Media,2020,6(2):169-189. 被引量：1
3Ruoqi Sun,Chen Huang,Hengliang Zhu,Lizhuang Ma.Mask-aware photorealistic facial attribute manipulation[J].Computational Visual Media,2021,7(3):363-374.
4仇栋,吴云超,李蔚清,苏智勇.面向移动增强现实的室外阴影实时检测技术[J].图学学报,2022,43(1):85-92.
5Hong’an Li,Min Zhang,Dufeng Chen,Jing Zhang,Meng Yang,Zhanli Li.Image Color Rendering Based on Hinge-Cross-Entropy GAN in Internet of Medical Things[J].Computer Modeling in Engineering & Sciences,2023(4):779-794.
6赵伟,马晶.基于条件生成对抗网络的交互式辅助室内设计算法[J].信息技术,2023,47(8):41-46.
7何文睿,高丹阳,周羿旭,朱强.基于扩散模型的多模态引导图像合成系统[J].北京信息科技大学学报（自然科学版）,2023,38(6):80-87.
8叶国升,王建明,杨自忠,张宇航,崔荣凯,宣帅.深度学习图像合成研究综述[J].中国图象图形学报,2023,28(12):3670-3698. 被引量：1
9Yuxuan Li,Lingfeng Yang,Xiang Li.APF-GAN:Exploring asymmetric pre-training and fine-tuning strategy for conditional generative adversarial network[J].Computational Visual Media,2024,10(1):187-192.

1Jiansheng Peng,Dunhua Chen,Qing Yang,Chengjun Yang,Yong Xu,Yong Qin.Visual SLAM Based on Object Detection Network:A Review[J].Computers, Materials & Continua,2023,77(12):3209-3236.
2Yuxuan Li,Lingfeng Yang,Xiang Li.APF-GAN:Exploring asymmetric pre-training and fine-tuning strategy for conditional generative adversarial network[J].Computational Visual Media,2024,10(1):187-192.
3Liufeng Du,Shaoru Shang,Linghua Zhang,Chong Li,JianingYang,Xiyan Tian.Multidomain Correlation-Based Multidimensional CSI Tensor Generation for Device-FreeWi-Fi Sensing[J].Computer Modeling in Engineering & Sciences,2024,138(2):1749-1767.
4Tian Ma,Chenhui Fu,Jiayi Yang,Jiehui Zhang,Chuyang Shang.RF-Net: Unsupervised Low-Light Image Enhancement Based on Retinex and Exposure Fusion[J].Computers, Materials & Continua,2023,77(10):1103-1122.
5Tianrun CHEN,Runlong CAO,Zejian LI,Ying ZANG,Lingyun SUN.Deep3DSketch-im:rapid high-fidelity AI 3D model generation by single freehand sketches[J].Frontiers of Information Technology & Electronic Engineering,2024,25(1):149-159.
6罗冠霆,ZOU Yenan,CAI Yanxia.Automatic Generation of Artificial Space Weather Forecast Product Based on Sequence-to-sequence Model[J].空间科学学报,2024,44(1):80-94.
7Ming Zhou,Mengrong Wu,Haiwei Yu,Xiangjun Zheng,Kuan Shen,Xingmei Guo,Yuanjun Liu,Fu Cao,Hongxing Gu,Qinghong Kong,Junhao Zhang.Controllable fabrication of FeCoS_(4) nanoparticles/S-doped bowl-shaped hollow carbon as efficient lithium storage anode[J].Chinese Journal of Chemical Engineering,2024,67(3):78-88.
8Alina Kononov,Thomas W.Hentschel,Stephanie B.Hansen,Andrew D.Baczewski.Trajectory sampling and finite-size effects in first-principles stopping power calculations[J].npj Computational Materials,2023(1):238-246.
9Guangsu Zhu,Min Guo,Jianxin Zhao,Hao Zhang,Gang Wang,Wei Chen.Environmental enrichment in combination with Bifidobacterium breve HNXY26M4 intervention amplifies neuroprotective benefits in a mouse model of Alzheimer's disease by modulating glutamine metabolism of the gut microbiome[J].Food Science and Human Wellness,2024,13(2):982-992.
10Yunsheng Liu,Xingfeng He,Yifei Mo.Discrepancies and error evaluation metrics for machine learning interatomic potentials[J].npj Computational Materials,2023(1):533-545.

Computational Visual Media

2024年第2期

浏览历史

内容加载中请稍等...

Controllable multi-domain semantic artwork synthesis

参考文献3

二级参考文献2

共引文献9

相关作者

相关机构

相关主题

浏览历史