自适应多任务学习的自动艺术分析被引量：1

Automatic art analysis based on adaptive multi-task learning

导出

摘要目的艺术品数字化为从计算机视觉角度对艺术品研究提供了巨大机会。为更好地为数字艺术品博物馆提供艺术作品分类和艺术检索功能,使人们深入理解艺术品内涵,弘扬传统文化,促进文化遗产保护,本文将多任务学习引入自动艺术分析任务,基于贝叶斯理论提出一种原创性的自适应多任务学习方法。方法基于层次贝叶斯理论利用各任务之间的相关性引入任务簇约束损失函数模型。依据贝叶斯建模方法,通过最大化不确定性的高斯似然构造多任务损失函数,最终构建了一种自适应多任务学习模型。这种自适应多任务学习模型能够很便利地扩展至任意同类学习任务,相比其他最新模型能够更好地提升学习的性能,取得更佳的分析效果。结果本文方法解决了多任务学习中每个任务损失之间相对权重难以决策这一难题,能够自动决策损失函数的权重。为了评估本文方法的性能,在多模态艺术语义理解Sem Art数据库上进行艺术作品分类以及跨模态艺术检索实验。艺术作品分类实验结果表明,本文方法相比于固定权重的多任务学习方法,在“时间范围”属性上提升了4.43%,同时本文方法的效果也优于自动确定损失权重的现有方法。跨模态艺术检索实验结果也表明,与使用“作者”属性的最新的基于知识图谱模型相比较,本文方法的改进幅度为9.91%,性能与分类的结果一致。结论本文方法可以在多任务学习框架内自适应地学习每个任务的权重,与目前流行的方法相比能显著提高自动艺术分析任务的性能。 Objective To improve learning efficiency and prediction accuracy,multi-task learning aims to tackle multiple tasks based on the generic features assumption those are prior to task-related features.Multi-task learning technique has been applied in a variety of computer vision applications on the aspects of object detection and tracking,object recognition,human-based identification and human facial attribute classification.The worldwide digitization of artwork has called to art research from the aspect of computer vision and further facilitated cultural heritage preservation.Automatic artwork analysis has been developing the art style,the content of the painting,or the oriented attributes analysis for art research.Our multitask learning for automatic art analysis application is based on the historical,social and artistic information.The existing multi-task joint learning methods learn multiple tasks based on a labor cost and time consuming weighted sum of losses.Our method illustrates art classification and art retrieval tools for the application of Digital Art Museum,which is convenient for researchers to deeply understand the connotation of art and further harness traditional cultural heritage research.Method A multiple objectives learning method is based on Bayesian theory.In terms of Bayesian analyzed results,we use the correlation between each task and introduce task cluster(clustering)to constrain the model.Then,we formulate a multi-task loss function via maximizing the Gaussian possibility derived of homoscedastic uncertainty via task-dependent uncertainty in Bayesian modeling.Result In order to slice into art classification and art retrieval missions,we identify the Sem Art dataset,a recent multi-modal benchmark for understanding the semantic essence of the art,which is designed to retrieve the art paginating cross different modal,and could be readily modified for the classification of art paginating.This dataset contains21384 art painting images,which is randomly split into training,validation and test sets based on 19244,1069 and1069 samples,respectively.First,we conduct art classification experiments on the Sem Art dataset,and then evaluate the performance through classification accuracy,i.e.,the proportion of properly predicted paintings to the total amount of paintings in test procedure.The art classification results demonstrate that our model is qualified based on proposed adaptive multi-task learning technique while in the previous multi-task learning model,the weight of each task in fixed.For example,in“Timeframe”classification task,the improvement is about 4.43%with respect to the previous model.In order to calculate the task-specific weighting,the previous model barriers are limited to twice back forward tracing.The art classification results also validate the importance of introducing weighting constraints in our model.Next,we also evaluate our model on cross-modal art retrieval tasks.Experiments are conducted through Text2Art Challenge Evaluation where painting samples are sorted out based on their similarity to an oriented text,and vice versa.The calculated ranking results are evaluated by median rank and recall rate atK,withKbeing 1,5 and 10 on the test dataset and performances.Median rank denotes the value separating the higher half of the relevant ranking position amount all samples,whereas recall at rateK represents the rate of samples for which its relevant image is in the topKpositions of the ranking.Compared with the most recent knowledge-graph-based model in the context of author attribute,the improvement is about 9.91%in average which is consistent of classification results.Finally,we compare our model with manual evaluators.Following an artistic text,which contains comment,title,author,type,school and time schedule,participants are required to pick the most proper painting image out from a collection of 10 images.There are two distinct levels in this task as mentioned below:the collection of painting images are easy to random selected from the test set,and the difficulty is where the 10 collected images have the identical attribute category(i.e.,portraits,landscapes).All participants are required to conduct the task for 100 artistic texts in each level.The performance is reported as the proportion of clear feedbacks over all responses.Our demonstrated results also illustrate that our modeling accuracy is quite closer to human evaluators.Conclusion We harness an adaptive multi-task learning method to weight multiple loss functions based on Bayesian theory for automatic art analysis tasks.Furthermore,we conduct several experiments on the public available art dataset.The synthesized results on this dataset include both art classification and art retrieval challenges.

作者杨冰向学勤孔万增施妍姚金良 Yang Bing;Xiang Xueqin;Kong Wanzeng;Shi Yan;Yao Jinliang(College of Computer Science and Technology,Hangzhou Dianzi University,Hangzhou 310018,China;Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province,Hangzhou 310018,China;uSens Incorporated Company,Hangzhou 310051,China;School of Media and Design,Hangzhou Dianzi University,Hangzhou 310018,China)

机构地区杭州电子科技大学计算机学院浙江省脑机协同智能重点实验室杭州凌感科技有限公司杭州电子科技大学人文艺术与数字媒体学院

出处《中国图象图形学报》 CSCD 北大核心 2022年第4期1226-1237,共12页 Journal of Image and Graphics

基金国家自然科学基金项目(61633010,U1909202) 浙江省基础公益研究计划(LGG22F020027) 浙江省重点研发计划(2020C04009) 浙江省脑机协同智能重点实验室项目(2020E10010)。

关键词自动艺术分析自适应多任务学习贝叶斯理论艺术分类跨模态艺术检索 automatic art analysis adaptive multi-task learning Bayesian theory art classification cross-modal art retrieval

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1盛家川,李玉芝.国画的艺术目标分割及深度学习与分类[J].中国图象图形学报,2018,23(8):1193-1206. 被引量：10
2杨秀芹,张华熊.双核压缩激活神经网络艺术图像分类[J].中国图象图形学报,2020,25(5):967-976. 被引量：4
3张钰,刘建伟,左信.多任务学习[J].计算机学报,2020,43(7):1340-1378. 被引量：33

二级参考文献8

1亓玉权.中国画的创作和鉴赏——谢赫的“六法论”[J].科技资讯,2009,7(14):231-231. 被引量：2
2王征,孙美君,韩亚洪,张冬.监督式异构稀疏特征选择的国画分类和预测[J].计算机辅助设计与图形学学报,2013,25(12):1848-1855. 被引量：10
3李宏益,吴素萍.Mean Shift图像分割算法的并行化[J].中国图象图形学报,2013,18(12):1610-1619. 被引量：11
4盛家川.基于小波变换的国画特征提取及分类[J].计算机科学,2014,41(2):317-318. 被引量：16
5宋熙煜,周利莉,李中国,陈健,曾磊,闫镔.图像分割中的超像素方法研究综述[J].中国图象图形学报,2015,20(5):599-608. 被引量：97
6冯语姗,王子磊.自上而下注意图分割的细粒度图像分类[J].中国图象图形学报,2016,21(9):1147-1154. 被引量：11
7钱文华,徐丹,官铮,普园媛,喻扬涛,杨萌.粉笔画艺术风格模拟[J].中国图象图形学报,2017,22(5):622-630. 被引量：6
8高峰,聂婕,黄磊,段凌宇,李晓明.基于表现手法的国画分类方法研究[J].计算机学报,2017,40(12):2871-2882. 被引量：12

共引文献44

1湛颖,高妍,谢凌云.中国国画艺术美感特征分析与分类[J].北京航空航天大学学报,2019,45(12):2514-2522. 被引量：5
2盛家川,陈雅琦,王君,李亮.融合人类认知网络优化的中国画情感识别[J].模式识别与人工智能,2020,33(2):141-149. 被引量：8
3何丽,韩克平,朱泓西,刘颖.双分支迭代的深度增量图像分类方法[J].模式识别与人工智能,2020,33(2):150-159. 被引量：2
4盛家川,陈雅琦,韩亚洪.深层网络特征聚合重标定的中国画情感分类算法[J].计算机辅助设计与图形学学报,2020,32(9):1420-1429. 被引量：6
5盛家川,王佳媛,李玉芝,王君.融合深度网络的改进快速生成超像素算法[J].计算机科学与探索,2020,14(12):2132-2139. 被引量：3
6盛家川,陈雅琦,王君,韩亚洪.深度学习结构优化的图像情感分类[J].红外与激光工程,2020,49(11):256-265. 被引量：4
7赵海英,周伟,侯小刚,张小利.基于多任务学习的传统服饰图像双层标注[J].吉林大学学报（工学版）,2021,51(1):293-302. 被引量：7
8李大湘,张玥.基于多尺度CNN特征的国画图像分类算法[J].西安邮电大学学报,2021,26(1):104-110. 被引量：4
9杨佳明,姜静.基于联合训练的强化学习方法[J].信息技术与信息化,2021(3):126-127.
10陈亮,褚燕华,王丽颖,张晓琳,刘海佳.基于CoBERT-BiGRU的对话式机器阅读理解[J].计算机应用研究,2021,38(7):1983-1987.

引证文献1

1霍奕.多维连续空间的多任务表情识别研究[J].软件导刊,2024,23(5):17-23.

1稻盛和夫.人格比能力更重要[J].风流一代,2022(18):10-10.
2常翔.关于平面设计中的空间艺术分析[J].喜剧世界（中旬刊）,2022(1):167-168.
3杜学德.河北省武安市固义村与东通乐村的队戏[J].中华戏曲,2020(1):118-134.
4陈新儒.德国前古典美学中的艺术分类问题及其今日启示--以门德尔松、莱辛与赫尔德为中心[J].艺术学研究,2022(3):42-53. 被引量：2
5刘哲怡.宗教时空与科学时空的偶合:从赫里福德地图到现代理论物理假说[J].形象史学,2020(2):337-352.
6王尚文.西方诗画关系视域下莱辛的诗画异质说[J].最小说,2021(3):157-158.
7王雅坤.艺术形态学视野下传媒艺术建构的意义与问题[J].声屏世界,2021(20):61-62. 被引量：2
8王冉.弘扬传统文化,传承京剧艺术[J].喜剧世界（中旬刊）,2022(1):16-17.
9王龙.全断面掘进过程中贯入度不同对切削性能的影响分析[J].机械管理开发,2022,37(3):136-137.
10刘亚茹.基于贝叶斯理论的岩相分析技术在储层预测中的应用[J].中国石油和化工标准与质量,2022,42(9):141-143. 被引量：1

中国图象图形学报

2022年第4期

浏览历史

内容加载中请稍等...

自适应多任务学习的自动艺术分析被引量：1

参考文献3

二级参考文献8

共引文献44

引证文献1

相关作者

相关机构

相关主题

浏览历史

自适应多任务学习的自动艺术分析 被引量：1

参考文献3

二级参考文献8

共引文献44

引证文献1

相关作者

相关机构

相关主题

浏览历史

自适应多任务学习的自动艺术分析被引量：1