面向小样本约束的域适应分类算法被引量：2

Domain Adaptation Algorithm for Few-Shot Classification Task

下载PDF

导出

摘要近年来,人工智能的相关应用被越来越细化到不同的应用场景,而对不同的应用场景都进行相应的数据收集,模型训练,模型调优等步骤需要消耗大量的时间精力会严重影响人工智能技术应用的效率.因此如何基于现有的成熟的训练过的模型迁移到其他应用场景是当前应用人工智能技术的关键问题.域适应算法主要研究将源域模型有效地迁移到目标域,这为上述问题提供了一个重要的解决思路.本文提出小样本对抗判别域适应算法,相对于无监督域适应算法能够在更严格的约束下-仅需要少量的目标域样本,在标准数据集上取得了优于对抗判别域适应算法(Adversarial Discriminative Domain Adaptation,ADDA)算法的表现,在单任务中最高提升幅度达16.9%.本文中,首先,提出了两种新的数据增强方法,以构建符合双域联合分布的图像以丰富样本多样性并填充特征空间,解决小样本约束下模型易过拟合到少量目标域样本的问题.接着,结合双域样本配对机制和ADDA算法,将以大量目标域样本为条件的无监督域适应算法改进为面向小样本约束的有监督域适应算法.在域适应过程中,引入类标签平滑损失来抑制过拟合现象,并结合度量学习中的最大平均差异度量,提出了新的域适应损失函数.同时,还提出了一种新的域分类判别器网络结构.最后,在对抗判别域适应算法的基础上增加了一个强化阶段,基于混淆矩阵对模型的分类性能进行强化提升.在困难数据集上的实验结果表明,仅使用少于5-shot的目标域样本,经提出的算法域适应训练得到的模型提升了26.3%~37.2%的分类准确率. In recent years,the related applications of artificial intelligence have been more and more refined into different application scenarios,and the corresponding data collection,model training,model tuning and other steps for different application scenarios need to consume a lot of time and energy,which will seriously affect The efficiency of artificial intelligence technology application.Therefore,how to transfer to other application scenarios based on the existing mature trained models is a key issue in the current application of artificial intelligence technology.The domain adaptation algorithm mainly studies the effective transfer of the source domain model to the target domain,which provides an important solution to the above problems.This paper proposes a few-shot adversarial discriminative domain adaptation algorithm,which can achieve better performance than the adversarial discriminative domain adaptation(ADDA)algorithm on the standard datasets with different degrees of difficulty.The highest improvement rate is 16.9%in a single task.This does not only prove the effectiveness of the series of components proposed in this paper but also shows that the method proposed in this paper can bring a powerful impetus to the field of domain adaptation.In this paper,first,the two new data enhancement methods are proposed to enrich the diversity of samples and fill the feature space.The former is enhanced based on a mixup in the global perspective,and the latter is enhanced based on a cutmix in the local perspective to construct the images which conform to the joint distribution of the source domain and the target domain.This is to solve the problem that the model is easy to overfit to a small number of target domain samples under the constraint of few-shot.Then,combining the dual-domain sample pairing mechanism and the ADDA algorithm,the unsupervised domain adaptation algorithm is improved to a supervised domain adaptation algorithm oriented to a few-shot constraint.Compared with unsupervised domain adaptation methods that require a large amount of unlabeled target domain data,the few-shot condition requires only a very small amount of labeled target domain data,which greatly relaxes the constraints on data collection and makes the method proposed in this paper have a broader prospect.Furthermore,in the domain adaptation process,the concept of class label smoothing is introduced to modify the naive loss function to suppress the overfitting problem,and combined with the maximum mean discrepancy metric in metric learning,a new domain adaptation loss function is proposed to increase the separation degree of features.At the same time,a new domain classification discriminator network structure is proposed to stabilize and speed up the model training.Finally,on the basis of the adversarial discriminative domain adaptation algorithm,an enhancement stage is added to enhance the classification performance of the model based on the confusion matrix.Experimental results on difficult datasets show that using only 5-shot(or less)target domain samples,the model after the training of the proposed domain adaptation algorithm improves the classification accuracy by 26.3%-37.2%,which proves the powerful performance of the method proposed in this paper.

作者戴宏郝轩廷盛立杰苗启广 DAI Hong;HAO Xuan-Ting;SHENG Li-Jie;MIAO Qi-Guang(College of Computer Science and Technology,Xidian University,Xi’an 710071;Xi’an Key Laboratory of Big Data and Intelligent Vision(Xidian University),Xi’an 710071)

机构地区西安电子科技大学计算机科学与技术学院西安市大数据与视觉智能关键技术重点实验室

出处《计算机学报》 EI CAS CSCD 北大核心 2022年第5期935-950,共16页 Chinese Journal of Computers

基金国家重点研发计划(2018YFC0807500) 国家自然科学基金(61772396,61772392,61902296) 广西可信软件重点实验室研究课题(KX202061) 青岛市科技计划重点研发专项(21-1-2-18-xx)资助。

关键词小样本域适应分类深度学习迁移学习 few-shot domain adaptation classification deep learning transfer learning

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

同被引文献13

1孙皓泽,常天庆,王全东,孔德鹏,戴文君.一种基于分层多尺度卷积特征提取的坦克装甲目标图像检测方法[J].兵工学报,2017,38(9):1681-1691. 被引量：25
2李策,张栋,杜少毅,朱子重,贾盛泽,曲延云.一种迁移学习和可变形卷积深度学习的蝴蝶检测算法[J].自动化学报,2019,45(9):1772-1782. 被引量：12
3呙鹏程,吴礼洋.融合卷积特征与判别字典学习的低截获概率雷达信号识别[J].兵工学报,2019,40(9):1881-1889. 被引量：7
4刘秋,孙晋伟,张华,胡煦,顾亮.基于卷积神经网络的路面识别及半主动悬架控制[J].兵工学报,2020,41(8):1483-1493. 被引量：11
5疏颖,毛龙彪,陈思,严严.结合自监督学习和生成对抗网络的小样本人脸属性识别[J].中国图象图形学报,2020,25(11):2391-2403. 被引量：8
6Wenjin Zhang,Jiacun Wang,Fangping Lan.Dynamic Hand Gesture Recognition Based on Short-Term Sampling Neural Networks[J].IEEE/CAA Journal of Automatica Sinica,2021,8(1):110-120. 被引量：12
7吴鸿昊,王立国,石瑶.高光谱图像小样本分类的卷积神经网络方法[J].中国图象图形学报,2021,26(8):2009-2020. 被引量：7
8普运伟,刘涛涛,郭江,吴海潇.基于卷积神经网络和模糊函数主脊坐标变换的雷达辐射源信号识别[J].兵工学报,2021,42(8):1680-1689. 被引量：10
9张珂,冯晓晗,郭玉荣,苏昱坤,赵凯,赵振兵,马占宇,丁巧林.图像分类的深度卷积神经网络模型综述[J].中国图象图形学报,2021,26(10):2305-2325. 被引量：95
10穆思奇,林进健,汪海泉,魏雄志.基于改进YOLOv4的X射线图像违禁品检测算法[J].兵工学报,2021,42(12):2675-2683. 被引量：12

引证文献2

1钟安雨,王蕊,张华,邹聪,荆丽桦.基于域内域间语义一致性约束的域自适应目标检测方法[J].计算机学报,2023,46(4):827-842.
2刘懿,任济寰,吴祥,薄煜明.基于集成迁移学习的新装备装甲车辆分类[J].兵工学报,2023,44(8):2319-2328.

1黎磊,马钰淋,胡刚,孔雪峰,杨军,许彦伟.基于迁移学习的GH159螺栓热镦后头部缺陷识别[J].系统科学与数学,2022,42(1):175-192. 被引量：1
2石金泽,党佩荣.语言交互学习平台的构建与应用[J].信息与电脑,2021,33(19):145-147.
3张静,孙文倩,黄佳,牛慧斌,黄应平,田海林,方艳芬.氧协同菱铁矿降解微囊藻毒素机理研究[J].岩石矿物学杂志,2022,41(1):177-184. 被引量：1
4穆靖,李伟华,饶俊民,李范鸣,卫红.采用三层模板局部差异度量的红外弱小目标检测[J].光学精密工程,2022,30(7):869-882. 被引量：5
5李延,耿震磊,袁艳芳,张磊,杨峰.工业物联网蓝牙安全及基于标识算法的分布式鉴权技术研究[J].信息安全与通信保密,2021(10):82-91. 被引量：1
6徐维祥.“长三角高质量一体化发展”专题[J].浙江工业大学学报（社会科学版）,2022,21(1):29-29.
7王腾辉,陈权,夏文传,姚兵,刘建.基于ABAQUS塑性损伤的半灌浆套筒力学性能有限元分析[J].科学技术与工程,2022,22(9):3709-3715. 被引量：15
8汪云云,孙顾威,赵国祥,薛晖.基于自监督知识的无监督新集域适应学习[J].软件学报,2022,33(4):1170-1182. 被引量：7
9陈伟宏,张廷君.二线省会城市人才集聚模式创新路径--以福州市“多元协同”人才集聚模式为例[J].中国人事科学,2022(3):47-59. 被引量：1
10王帆,韩忠义,尹义龙.伪标签不确定性估计的源域无关鲁棒域自适应[J].软件学报,2022,33(4):1183-1199. 被引量：1

计算机学报

2022年第5期

浏览历史

内容加载中请稍等...

面向小样本约束的域适应分类算法被引量：2

同被引文献13

引证文献2

相关作者

相关机构

相关主题

浏览历史

面向小样本约束的域适应分类算法 被引量：2

同被引文献13

引证文献2

相关作者

相关机构

相关主题

浏览历史

面向小样本约束的域适应分类算法被引量：2