循环生成对抗网络的线稿图像自动提取被引量：2

Image extraction of cartoon line art based on cycle-consistent adversarial networks

导出

摘要目的动漫制作中线稿绘制与上色耗时费力,为此很多研究致力于动漫制作过程自动化。目前基于数据驱动的自动化研究工作快速发展,但并没有一个公开的线稿数据集可供使用。针对真实线稿图像数据获取困难,以及现有线稿提取方法效果失真等问题,提出基于循环生成对抗网络的线稿图像自动提取模型。方法模型基于循环生成对抗网络结构,以解决非对称数据训练问题。然后将不同比例的输入图像及其边界图输入到掩码指导卷积单元,以自适应选择网络中间特征。同时为了进一步提升网络提取线稿的效果,提出边界一致性约束损失函数,确保生成结果与输入图像在梯度变化上的一致性。结果在公开的动漫彩色图像数据集Danbooru2018上,应用本文模型提取的线稿图像相比于现有线稿提取方法,噪声少、线条清晰且接近真实漫画家绘制的线稿图像。实验中邀请30名年龄在2025岁的用户,对本文以及其他4种方法提取的线稿图像进行打分。最终在30组测试样例中,本文方法提取的线稿图像被认为最佳的样例占总样例84%。结论通过在循环生成对抗网络中引入掩码指导单元,更加合理地提取彩色图像的线稿图像,并通过对已有方法提取效果进行用户打分证明,在动漫线稿图像提取中本文方法优于对比方法。此外,该模型不需要大量真实线稿图像训练数据,实验中仅采集1000幅左右真实线稿图像。模型不仅为后续动漫绘制与上色研究提供数据支持,同时也为图像边缘提取方法提供了新的解决方案。 Objective With the continuous development of digital media,people’s demand for animation works continues to increase.Excellent two-dimensional animation works usually require a lot of time and effort.In the animation production process,the key frame line draft images are usually drawn by the original artist,then the intermediate frame line draft images are drawn by multiple ordinary animators,and finally all the line draft images are colored by the coloring staff.In order to improve the production efficiency of two-dimensional animation art,researchers have committed to improving the automation of the production process.At present,data-driven deep learning technology is developing rapidly,which provides a new solution for improving the production efficiency of animation works.Although many data-driven automated methods have been proposed,it is very difficult to obtain training datasets,and there is no public dataset that corresponds to color images and linear images.For this reason,the research work of automatically extracting line draft images from color animation images will provide data support for animation production-related research.MethodEarly image edge extraction methods depend on the setting of parameter values,and fixed parameters cannot be applied to all images.However,the datadriven image edge extraction method is limited by the collection and size of the dataset.Therefore,researchers usually use data enhancement techniques or use images similar to line art,such as boundary images(edge information extracted from color images).This study proposes an automatic extraction model of linear art images based on the cycle-consistent adversarial networks to solve the problem of the difficulty of obtaining real line art images and the distortion of the existing line art image extraction methods.First of all,this study uses a cycle-consistent adversarial network structure to solve the dataset problem without real color images and corresponding line art images.It only uses a few collected real line art images and a large number of color images to learn the model parameters.Then,the mask-guided convolution unit and the mask-guided residual unit are proposed to better autonomously select the intermediate output features of the network.Specifically,the input images of different scales and their corresponding boundary images are input to mask-guided convolution unit to learn the mask parameters of the intermediate feature layer,where the boundary map determines the line area of the line art image and the input image provides prior information.In order to ensure that information is not lost in the process of information encoding,no operations such as pooling that can cause information loss are used in the network design process,but the image resolution is reduced by controlling the size of the convolution kernel and the convolution step length.Finally,this study proposes a boundary constraint loss function.Considering that this study does not have the supervision information corresponding to the input image,the loss function is designed to calculate the difference between the gradient information of the input image and the output image.At the same time,regular constraints are added to ensure that the generated result is consistent with the gradient of the input image.The proposed method mainly restricts the gradient of the input image and the generated image to be consistent.ResultFinally,on the public animation color image dataset Danbooru 2018,the line art image extraction results of this method are compared with the results extracted by the Canny edge detection operator,cycle-consistent adversarial networks(CycleGAN),holistically-nested edge detection(HED),and Sketch Keras methods.The Canny edge detection operator only extracts the position information of the image gradient.The resulting lines extracted by Cycle GAN are blurred and accompanied by missing information,and the lines in some areas cannot be extracted correctly.The line art image extracted by HED has obvious outer contours but seriously lacks internal details.The line art image extracted by Sketch Keras is closer to the edge information image and contains the rich gradient change information,which causes the lines to be unclear and noisy.The extracted results of the proposed model are not only clear and have less noise,but also are more in line with the effect drawn by human animators.In order to show the actual performance effect of the proposed method,30 users between the ages of 20-25 years are invited to score the cartoon line art images extracted by five different methods.A total of 30 sets of test samples are provided.Each user selects the best line art image in each group according to whether the extracted line art image lines are clear,whether there is noise,and whether it is close to the real cartoonist’s line art image.The statistical results show that the linear art image extracted by the proposed method is superior to that of other methods in terms of image quality and authenticity.Moreover,the proposed method can not only extract the line art image corresponding to the color animation image,but also extract the line art from the real color image.In the experiment,the model was used to extract line art images from real-world color images,and results similar to animation line art images were obtained.At the same time,the proposed model is better at extracting black border lines,which may be because the borders of the color animation images given in the training set are black lines.ConclusionThis study proposes a model for extracting line art images from color animation images.It trains network parameters through asymmetric data and does not require a large amount of real cartoon line art images.The proposed mask-guided convolution unit and mask-guided residual unit constrain the output features of the intermediate network through the input image and the corresponding boundary image to obtain clearer line results.The proposed boundary consistency loss function introduces a Gaussian regular term to make the boundary of the region with severe gradient change more obvious,and the region with weak gradient change is smoother,reducing the noise in the generated line art image.Finally,the proposed method extracts corresponding line art images from the public animation color dataset Danbooru2018,provides data support for subsequent line art drawing and line art coloring research work,and can also extract results similar to the sketch drawn by an animator from the real color image.

作者王素琴张加其石敏赵银君 Wang Suqin;Zhang Jiaqi;Shi Min;Zhao Yinjun(School of Control&Computer Engineering,North China Electric Power University,Beijing 102206,China)

机构地区华北电力大学控制与计算机工程学院

出处《中国图象图形学报》 CSCD 北大核心 2021年第5期1117-1127,共11页 Journal of Image and Graphics

基金国家自然科学基金项目(61972379)。

关键词动漫线稿图像生成非对称数据训练掩码指导卷积单元(MGCU) 循环生成对抗网络(CycleGAN) 卷积神经网络(CNN) cartoon line art image generation unpair training data mask guided convolution unit(MGCU) cycle-consistent adversarial network(CycleGAN) convolutional neural network(CNN)

分类号 TP37 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献2

1黄剑玲,郑雪梅.一种改进的基于Canny算子的图像边缘提取算法[J].计算机工程与应用,2008,44(25):170-172. 被引量：34
2季虎,孙即祥,邵晓芳,毛玲.图像边缘提取方法及展望[J].计算机工程与应用,2004,40(14):70-73. 被引量：85

二级参考文献13

1余洪山,王耀南.一种改进型Canny边缘检测算法[J].计算机工程与应用,2004,40(20):27-29. 被引量：76
2游素亚,杨静.图象边缘检测技术的发展与现状[J].电子科技导报,1995(8):25-28. 被引量：22
3张斌,贺赛先.基于Canny算子的边缘提取改善方法[J].红外技术,2006,28(3):165-169. 被引量：53
4Helmi Z,Shafri M,Paul M M.Wavelet shrinkage in noise removal of hyperspectral remote sensing data[J].American Journal of Applied Sciences, 2005,2 (7) : 1169-1173.
5Jansen M,Malfait M,Bultheel A.Generalized cross validation for wavelet thresholding[J].Signal Processing, 1997,56 ( 1 ) : 33-44.
6Weyrich N,Warhola G T.Wavelet shrinkage and generalized cross validation for image denoising[J].IEEE Trans on Image Processing, 1998,7( 1 ) : 82-90.
7孙即祥.图像处理[M].北京:科学出版社,2005.
8尹平,ns.cetin.net.cn,王润生.自适应多尺度边缘检测[J].软件学报,2000,11(7):990-994. 被引量：21
9罗强,任庆利,杨万海.基于分形理论的图像边缘提取方法[J].通信学报,2001,22(11):104-109. 被引量：37
10魏海,沈兰荪.反对称双正交小波应用于多尺度边缘提取的研究[J].电子学报,2002,30(3):313-316. 被引量：107

共引文献117

1陈炳权,刘宏立,孟凡斌.数字图像处理技术的现状及其发展方向[J].吉首大学学报（自然科学版）,2009,30(1):63-70. 被引量：88
2谢捷如.计算机视觉中的边缘提取技术研究[J].机械制造与自动化,2005,34(2):120-122. 被引量：2
3高国荣,刘冉,羿旭明.一种改进的基于小波变换的图像边缘提取算法[J].武汉大学学报（理学版）,2005,51(5):615-619. 被引量：30
4朱云飞,汪天富,林江莉,李德玉,彭玉兰,罗燕.基于模糊数的乳腺肿瘤超声图像边缘快速提取方法[J].生物医学工程学杂志,2006,23(3):488-491.
5张莉,贾永红,程刚.基于数学形态学的遥感影像边缘检测研究[J].地理空间信息,2006,4(4):52-54. 被引量：6
6翟红芳,卢焕章,张申涛.中值滤波和边缘检测技术在复杂背景图像分割中的应用[J].现代计算机,2006(9):92-94. 被引量：5
7魏伟波,芮筱亭.图像边缘检测方法研究[J].计算机工程与应用,2006,42(30):88-91. 被引量：115
8多化琼,冯利群,何文利.探讨小波系数对图像质量的影响[J].内蒙古农业大学学报（自然科学版）,2006,27(3):93-95. 被引量：1
9张震,马驷良,张忠波,刘辉,宫跃欣,孙秋成.一种改进的基于Canny算子的图像边缘提取算法[J].吉林大学学报（理学版）,2007,45(2):244-248. 被引量：53
10任民宏.图像边缘检测算法的比较与展望[J].中国科技信息,2007(10):119-120. 被引量：8

同被引文献20

1魏忠钰,范智昊,王瑞泽,承怡菁,赵王榕,黄萱菁.从视觉到文本:图像描述生成的研究进展综述[J].中文信息学报,2020(7):19-29. 被引量：14
2魏玮.基于三维重建的全景图像自动生成技术[J].电子设计工程,2019,27(4):158-161. 被引量：4
3甄永赞,杨荆宜,张冰.基于小波变换的直流线路行波保护采样数值稳定性研究[J].电力系统保护与控制,2019,47(9):42-48. 被引量：15
4税留成,刘卫忠,冯卓明.基于生成式对抗网络的图像自动标注[J].计算机应用,2019,39(7):2129-2133. 被引量：8
5张慧利,周湘贞.基于尺度信息边缘提取的模糊核估计方法[J].电子测量与仪器学报,2019,31(5):65-72. 被引量：2
6李粮,田青林,钟明,赖小皇.基于Mallat算法的机载失速喘振辨识装置研制[J].燃气涡轮试验与研究,2019,32(6):45-49. 被引量：2
7许纪亚,孙佳宁,乔双.基于多尺度二维直方图均衡化的医学图像增强方法[J].东北师大学报（自然科学版）,2020,52(1):88-91. 被引量：8
8杨立波,蒋铁钢,徐志强.基于混合梯度的硬阈值追踪算法[J].计算机应用,2020,40(3):912-916. 被引量：2
9熊炜,王鑫睿,王娟,刘敏,曾春艳.融合背景估计与U-Net的文档图像二值化算法[J].计算机应用研究,2020,37(3):896-900. 被引量：9
10余震,何留杰,王振飞.基于中智理论与方向α-均值的图像边缘检测算法[J].电子测量与仪器学报,2020,32(3):43-50. 被引量：21

引证文献2

1彭姣丽.基于深度学习的自动生成图像描述技术研究[J].中国新技术新产品,2023(7):12-14.
2何灏,张海民.基于纹理增强和细化的可见光图像互信息边缘提取研究[J].长春工程学院学报（自然科学版）,2024,25(2):102-108.

1宋庆慧,朱立国,杨丽平,李秋月,朱嘉,张万强,杨晨.基于网络药理学探析核心中药治疗颈椎病的作用机制[J].世界中医药,2021,16(8):1198-1203. 被引量：9

中国图象图形学报

2021年第5期

浏览历史

内容加载中请稍等...

循环生成对抗网络的线稿图像自动提取被引量：2

参考文献2

二级参考文献13

共引文献117

同被引文献20

引证文献2

相关作者

相关机构

相关主题

浏览历史

循环生成对抗网络的线稿图像自动提取 被引量：2

参考文献2

二级参考文献13

共引文献117

同被引文献20

引证文献2

相关作者

相关机构

相关主题

浏览历史

循环生成对抗网络的线稿图像自动提取被引量：2