多任务学习被引量：29

Survey of Multi-Task Learning

下载PDF

导出

摘要随着图像处理,语音识别等人工智能技术的发展,很多学习方法尤其是采用深度学习框架的方法取得了优异的性能,在精度和速度方面有了很大的提升,但随之带来的问题也很明显,这些学习方法如果要获得稳定的学习效果,往往需要使用数量庞大的标注数据进行充分训练,否则就会出现欠拟合的情况而导致学习性能的下降.因此,随着任务复杂程度和数据规模的增加,对人工标注数据的数量和质量也提出了更高的要求,造成了标注成本和难度的增大.同时,单一任务的独立学习往往忽略了来自其它任务的经验信息,致使训练冗余重复和学习资源的浪费,也限制了其性能的提升.为了缓解这些问题,属于迁移学习范畴的多任务学习方法逐渐引起了研究者的重视.与单任务学习只使用单个任务的样本信息不同,多任务学习假设不同任务数据分布之间存在一定的相似性,在此基础上通过共同训练和优化建立任务之间的联系.这种训练模式充分促进任务之间的信息交换并达到了相互学习的目的,尤其是在各自任务样本容量有限的条件下,各个任务可以从其它任务获得一定的启发,借助于学习过程中的信息迁移能间接利用其它任务的数据,从而缓解了对大量标注数据的依赖,也达到了提升各自任务学习性能的目的.在此背景之下,本文首先介绍了相关任务的概念,并按照功能的不同对相关任务的类型进行划分,之后对它们的特点进行了逐一描述.然后,本文按照数据的处理模式和任务关系的建模过程不同将当前的主流算法划分为两大类:结构化多任务学习算法和深度多任务学习算法.其中,结构化多任务学习算法采用线性模型,可以直接针对数据进行结构假设并且使用原有标注特征表述任务关系,同时,又可根据学习对象的不同将其细分为基于任务层面和基于特征层面两种不同结构,每种结构有判别式方法和生成式方法两种实现手段.与结构化多任务学习算法的建模过程不同,深度多任务学习算法利用经过多层特征抽象后的深层次信息进行任务关系描述,通过处理特定网络层中的参数达到信息共享的目的.紧接着,以两大类算法作为主线,本文详细分析了不同建模方法中对任务关系的结构假设、实现途径、各自的优缺点以及方法之间的联系.最后,本文总结了任务之间相似性及其紧密程度的判别依据,并且分析了多任务作用机制的有效性和内在成因,从归纳偏置和动态求解等角度阐述了多任务信息迁移的特点. With the development of artificial intelligence technology such as image processing and speech recognition,many learning methods,especially those using deep learning frameworks,have achieved excellent performance and greatly improved accuracy and speed,but the problems are also obvious,if these learning methods want to achieve a stable learning effect,they often need to use a large number of labeled data to train adequately.Otherwise,there will be an underfitting situation which will lead to the decline of learning performance.Therefore,with the increase of task complexity and data scale,higher requirements are put forward for the quantity and quality of manual labeling data,resulting in the increase of labeling cost and difficulty.At the same time,the independent learning of single task often ignores the experience information from other tasks,which leads to redundant training and waste of learning resources,and also limits the improvement of its performance.In order to alleviate these problems,the multi-task learning method,which belongs to the category of transfer learning,has gradually attracted the attention of researchers.Unlike single-task learning,which only uses sample information of a single task,multi-task learning assumes that there is a certain similarity between the data distribution of different tasks.On this basis,the relationship between tasks is established through joint training and optimization.This training mode fully promotes information exchange between tasks and achieves the goal of mutual learning.Especially under the condition that the sample size of each task is limited,each task can get some inspiration from other tasks.With the help of information transfer in the learning process,the data of other tasks can be indirectly utilized.Thus,the dependence on a large number of labeled data is alleviated,and the goal of improving the performance of task learning is also achieved.Under this background,this paper first introduces the concept of related tasks,and describes their characteristics one by one after classifying the types of related tasks according to their functions.Then,according to the data processing mode and task relationship modeling process,the current mainstream algorithms are divided into two categories:structured multi-task learning algorithm and deep multi-task learning algorithm.The structured multi-task learning algorithm adopts linear model,which can directly assume the structure of the data and express the task relationship with the original annotation features.At the same time,it can be subdivided into two different structures based on task level and feature level according to the different learning objects.Each structure has two implementation means:discriminant method and generative method.Different from the modeling process of structured multi-task learning algorithm,deep multi-task learning algorithm uses the deep information abstracted by multi-layer features to describe the task relationship,and achieves the goal of information sharing by processing the parameters in the specific network layer.Then,taking two kinds of algorithms as the main line,this paper analyzed the structural assumptions,implementation approaches,advantages and disadvantages of different modeling methods and the relationship between them in detail.Finally,this paper summarizes the criteria for identifying the similarity and compactness between tasks,and the effectiveness and intrinsic causes of multi-task mechanism are also analyzed,then the characteristics of multi-task information migration are expounded from the perspectives of inductive bias and dynamic solution.

作者张钰刘建伟左信 ZHANG Yu;LIU Jian-Wei;ZUO Xin(Department of Automation,China University of Petroleum,Beijing 102249)

机构地区中国石油大学(北京)自动化系

出处《计算机学报》 EI CSCD 北大核心 2020年第7期1340-1378,共39页 Chinese Journal of Computers

基金国家重点研发计划项目(2016YFC0303703-03) 中国石油大学(北京)年度前瞻导向及培育项目(2462018QZDX02)资助.

关键词多任务学习信息迁移任务相似性贝叶斯生成式模型多任务学习判别式多任务学习深度多任务学习 multi-task learning information transfer similarity of tasks Bayesian generative model of multi-task learning discriminant approach of multi-task learning deep multi-task learning via deep neural network

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

同被引文献166

1LUCAS Simon,沈甜雨,王晓,张杰.基于统计前向规划算法的游戏通用人工智能[J].智能科学与技术学报,2019,0(3):219-227. 被引量：4
2Fei Shen,Chao Chen,Jiawen Xu,Ruqiang Yan.A Fast Multi-tasking Solution: NMF-Theoretic Co-clustering for Gear Fault Diagnosis under Variable Working Conditions[J].Chinese Journal of Mechanical Engineering,2020,33(1):182-196. 被引量：6
3崔平远,秦同,朱圣英.火星动力下降自主导航与制导技术研究进展[J].宇航学报,2020,41(1):1-9. 被引量：17
4李滨,陆明珍.考虑实时气象耦合作用的地区电网短期负荷预测建模[J].电力系统自动化,2020(17):60-75. 被引量：43
5陈岳林,汪杰君.金相组织定量识别分析研究[J].特种铸造及有色合金,2005,25(3):135-137. 被引量：7
6葛学军,李冰,柴慧臻,陈树越,孙克勤,任秀云.口腔牙周病诊断专家系统的初步实现[J].中华老年口腔医学杂志,2006,4(2):105-107. 被引量：8
7孙秋冬,颜文英,戴虹,朱刚.金相图像处理与分析[J].计算机应用与软件,2006,23(8):100-102. 被引量：3
8徐正军,唐硕.基于改进遗传算法的飞行航迹规划[J].宇航学报,2008,29(5):1540-1545. 被引量：34
9胡远,张师帅.基于CFD分析的翼型性能预测[J].风机技术,2009,51(1):10-12. 被引量：4
10刘建军,陈建新.基于可通过性的月面巡视探测器路径规划算法[J].中国空间科学技术,2009,29(3):16-22. 被引量：4

引证文献29

1赵海英,周伟,侯小刚,张小利.基于多任务学习的传统服饰图像双层标注[J].吉林大学学报（工学版）,2021,51(1):293-302. 被引量：7
2杨佳明,姜静.基于联合训练的强化学习方法[J].信息技术与信息化,2021(3):126-127.
3陈亮,褚燕华,王丽颖,张晓琳,刘海佳.基于CoBERT-BiGRU的对话式机器阅读理解[J].计算机应用研究,2021,38(7):1983-1987.
4颜志鹏.基于多任务协同的粒子群聚类优化算法[J].现代计算机,2021,27(19):32-40. 被引量：1
5郭辉,郭静纯,张甜.基于梯度优化的多任务混合学习方法[J].计算机技术与发展,2021,31(10):7-12. 被引量：2
6吴锴,王晓放,边超,刘海涛.面向变精度仿真数据建模分析的多任务学习方法比较研究[J].风机技术,2021,63(5):71-80.
7马雨,解庆,唐伶俐,刘永坚.一种基于多任务学习的方面级情感分析方法[J].计算机应用与软件,2022,39(2):245-252. 被引量：1
8李红光,王菲,丁文锐.面向目标分类识别的多任务学习算法综述[J].航空学报,2022,43(1):197-212. 被引量：5
9谭慧欣,赖杰伟,王祚,季磊,张一行,王进亮,宋育章,阳维.可穿戴式心电信号R峰检测的心拍感知卷积网络[J].南方医科大学学报,2022,42(3):375-383. 被引量：2
10王鑫,赵清杰,于重重,张长春,陈涌泉.多节点探测器软着陆的路径规划方法[J].宇航学报,2022,43(3):366-373. 被引量：2

二级引证文献65

1陈涛.目标检测在数字人文图像中的应用尝试[J].数字人文研究,2021,1(3):39-50. 被引量：2
2陈建荣,陈建华.求解绝对值方程的改进捕鱼算法[J].现代计算机,2021,27(32):27-32.
3赵海英.文化基因研究缘起、进展与未来研究思考综述[J].中国传媒大学学报（自然科学版）,2021,28(5):1-10. 被引量：10
4吴浩,潘善亮.基于BERT-RCNN的中文违规评论识别研究[J].中文信息学报,2022,36(1):92-103. 被引量：2
5卢明,侯小刚,韩晓彤,赵海英.基于知识推理的纹饰演化关系发现[J].计算机工程与应用,2022,58(14):194-199.
6张连谊,张亚娜.基于多任务学习的人脸状态判断算法研究[J].中国传媒大学学报（自然科学版）,2022,29(3):43-50.
7张欢,张成,宋晓东.索式火箭回收着陆系统缓冲装置设计与分析[J].宇航学报,2022,43(9):1152-1162.
8余登武,刘敏,蒲凡诺,秦序胜,秦先鑫,谢若昕.基于深度学习分位数回归的电力负荷区间预测方法[J].广东电力,2022,35(9):1-8. 被引量：4
9林俊光,冯彦皓,林小杰,吴凡,钟崴,俞自涛.综合能源系统源网荷储动态建模技术进展[J].热力发电,2022,51(10):92-102. 被引量：4
10徐凯,涂永超,徐文轩,吴仕勋.自适应多智能体算法优化深度网络的列车智能驾驶[J].铁道科学与工程学报,2022,19(10):2820-2832. 被引量：2

1韩良臣.高中信息技术课程的现状、问题与对策[J].视界观,2020,0(6):0218-0218.
2王继伟,王婕,张红霞,谭清月,屈川,刘寒瑜.Mg-Li合金的研究进展[J].玉林师范学院学报,2019,40(5):35-42. 被引量：1
3程翔.精心构思巧手点石[J].中学语文教学,2020(4):69-71.
4江常坤,高林.基于双边拍卖的群智感知数据复用机制设计[J].物联网学报,2019,3(3):26-33.
5李启佳,罗福凯,庞廷云.兼听则明：股价信息的创新指导效应[J].山西财经大学学报,2020,42(2):16-28. 被引量：5
6袁向军.组文阅读,让语文学习走向深刻[J].内蒙古教育,2020(16):20-23.
7申秀文.浅析高中信息技术任务教学的有效实施[J].环球慈善,2020,0(6):0109-0109.
8何湜,程结海,王世东,王雅萍.小样本条件下的半监督遥感影像分类方法研究[J].青海科技,2020,27(1):28-31. 被引量：1
9王勇,王松,张红英.基于B/S构架的网络结构可视化系统设计与实现[J].计算机工程与应用,2020,56(11):230-237. 被引量：23
10胡漠,马捷.异构区块链网络视域下智慧养老多元信息协同模式研究[J].图书情报工作,2020,64(7):110-118. 被引量：22

计算机学报

2020年第7期

浏览历史

内容加载中请稍等...

多任务学习被引量：29

同被引文献166

引证文献29

二级引证文献65

相关作者

相关机构

相关主题

浏览历史

多任务学习 被引量：29

同被引文献166

引证文献29

二级引证文献65

相关作者

相关机构

相关主题

浏览历史

多任务学习被引量：29