一种基于损失预测的双主动域适应算法研究

A Dual Active Domain Adaptation Algorithm Based on Loss Prediction Strategy

下载PDF

导出

摘要近年来深度学习在图像分类任务上取得了显著效果,但通常要求大量人工标记数据,模型训练成本很高.因此,领域自适应等小样本学习方法成为当前研究热点.通常,域适应方法利用源域的经验知识也仅能一定程度降低对目标域标记数据的依赖,因此可以引入主动学习方法对样本价值进行评估并做筛选,从而进一步降低标记成本.本文将典型样本价值估计模型引入域适应学习,结合特征迁移思路,提出了双主动域适应学习算法D_AcT(Dual active domain adaptation).该算法同时对源域与目标域数据进行价值度量,并挑选最具训练价值的样本,在保证模型精度的前提下,大幅度减少了模型对标签数据的需求.具体而言,首先利用极大极小熵和核心集采样方法,用主动学习价值评估模型挑选目标域样本,得到单主动域适应算法S_AcT(Single active domain adaptation).随后利用损失预测策略,将价值评估策略适配至源域,进一步提升迁移学习知识复用有效性,降低模型训练成本.本文在常用的四个图像迁移数据集进行了测试,将所提两个算法和传统主动迁移学习及半监督迁移学习算法进行了实验对比.结果表明双主动域适应方法所需标记源域数据可减少50%以上,且准确率较传统方法最大提升了4%.系列实验验证了本文所提方法的可行性和有效性. Deep learning has made remarkable achievements in image classification tasks and various applications in recent years.However,most of the deep learning models require a large amount of labeled data in the training process because of deep structures and numerous parameters.This results in a high labeling cost in deep learning model training.To address this issue,various few-shot learning strategies have been proposed and attracted much attention recently.In which,the domain adaptation and active learning are two of the most widely studied methods.The concept of domain adaptation is to use the empirical knowledge in source domains to reduce the label requirement in target domains,while the active learning reduces labeling cost by evaluating the valuable unlabeled samples for the current model to avoid redundant labeling.Although there are a lot of achievements in both of domain adaptation and active learning fields that demonstrate their effect in reducing deep learning training cost,but most of the existing methods are only focus on one field.To further reduce the labeling cost and leverage the advantage of both knowledge reusing and sample evaluating,we propose a Dual Active Domain Adaptation(D_AcT)algorithm in this paper.It is motivated by the phenomenon that not all source domain sam ples are useful in the knowledge transfer learning.In the D_AcT algorithm,the domain adapta tion learning is combined with a typical sample value estimation model to filter the redundant or even opposite-effect samples.The algorithm simultaneously measures the value of the source and target data to select the most valuable samples for training,which further reduce the labeling cost.Specifically,we first propose a Single Active Domain Adaptation(S_AcT)algorithm to se lect the target domain samples.It uses active learning strategy that combines the Minimax Entro py(MME)and the core set model.The Minimax Entropy is used to train feature extractors by minimizing a cross entropy loss on source and target domain samples.The core set model is con structed based on the feature selection diversity.Then,the D_AcT algorithm is proposed by u sing a loss prediction module.It minimizes the difference between the predicted and actual loss to further enhance the effectiveness of source knowledge reusing and reduce the model training cost.To evaluate the performance of the proposed methods,we conduct comprehensive experiments that compare our method with the existing active transfer learning and semi-supervised transfer learning algorithms.The proposed methods are tested on four commonly used transfer learning image datasets including the Office31,the Mixed National Institute of Standards and Technology database(MNIST),the Street View House Number(SVHN)and the SubDomainNet.The ex perimental results show that the S_AcT method improves the accuracy up to 3.8%compared with the conventional active transfer learning methods and up to 1.6%compared with semi-su pervised transfer learning method.The proposed D_AcT method reduces the source domain la bels by more than 50%and improve the accuracy by up to 4%compared with the existing active transfer learning methods,which demonstrates the superiority and effectiveness of the proposed methods.

作者刘贵松郑余解修蕊黄鹂丁浩伦 LIU Gui-Song;ZHENG Yu;XIE Xiu-Rui;HUANG Li;DING Hao-Lun(School of computing and artificial intelligence,Southwestern University of Finance and Economics,Chengdu 611130;School of Computer Science and Engineering,University of Electronic Science and Technology of China,Chengdu 611731;Zhongshan Institute,University of Electronic Science and Technology of China,Zhongshan,Guangdong 528400)

机构地区西南财经大学计算机与人工智能学院电子科技大学计算机科学与工程学院电子科技大学中山学院

出处《计算机学报》 EI CAS CSCD 北大核心 2023年第3期579-593,共15页 Chinese Journal of Computers

基金国家自然科学基金(No.61806040) 四川省重点研发计划(No.2022YFG0314) 广东省自然科学基金(No.2021A1515011866) 中山市科技局基金项目(No.420S36)资助.

关键词小样本学习图像分类主动学习迁移学习双主动域适应 few-shot learning image classification active learning transfer learning dual active domain adaptation

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献4

1舒醒,于慧敏,郑伟伟,谢奕,胡浩基,唐慧明.基于边际Fisher准则和迁移学习的小样本集分类器设计算法[J].自动化学报,2016,42(9):1313-1321. 被引量：12
2张晓宇.基于动态可行域划分的SVM主动学习[J].计算机科学,2012,39(7):175-177. 被引量：3
3王俊,李石君,杨莎,金红,余伟.一种新的用于跨领域推荐的迁移学习模型[J].计算机学报,2017,40(10):2367-2380. 被引量：25
4王进,王科,闵子剑,孙开伟,邓欣.基于迁移权重的条件对抗领域适应[J].电子与信息学报,2019,41(11):2729-2735. 被引量：3

二级参考文献24

1叶红翠,张小平,余红,刘惠,蒋继宏.多花黄精粗多糖抗肿瘤活性研究[J].中国实验方剂学杂志,2008,14(6):34-36. 被引量：60
2李超,伏圣博,刘华玲,马欣荣.细胞凋亡研究进展[J].世界科技研究与发展,2007,29(3):45-53. 被引量：76
3Zhu X. Semi-supervised learning literature survey [R]. Wiscon- sin Computer Sciences, University of Wisconsin-Madison, 2008.
4Cohn A, Ghahramani Z, Jordan M I. Active learning with statis- tical models [J]. Journal of Artificial Intelligence Research, 1996,4(1) 129-145.
5Mccallum A, Nigam K. Employing [M] in pool-based active learning for text classification[C]/,/Proceeding of the 15th In- ternational Conference on Machine Learning. San Francisco: Morgan Kaufmann, 1998: 350-358.
6Burges J C. A tutorial on support vector machines for pattern recognition [J]. Data Mining and Knowledge Discovery, 1998, 2 (2) :121-167.
7Mitchell T. Generalization as search [J]. Artificial Intelligence, 1982,18(2) 203-226.
8Tong S, Koller D. Support vector machine active learning with applications to text classification [J]. The Journal of Machine Learning Research, 2000,2 ( 1 ) : 45-66.
9Tong S, Chang E. Support vector machine active learning for ima- ge retrieval [C]//Proceedings of the 9th ACh)I International Conference on Multimedia New York_. ACM. 2001. ] 07-11.
10Zhang X, Cheng J, Lu H, et al. Weighted co-SVM for image re- trieval with MVB strategy[C]//Proceedings of 2009 IEEE In- ternational Conference on Image Processing. Saxa Antonio, TX: Signal Processing Society, 2007 .- 517-520.

共引文献39

1谢科.融合协同训练和两层主动学习策略的SVM分类方法[J].湖南师范大学自然科学学报,2014,37(1):93-97. 被引量：1
2武玉英,孙平,何喜军,蒋国瑞.基于迁移学习的新产品销量预测模型[J].系统工程,2018,36(6):124-132. 被引量：3
3梁瑞花,郭裕顺.基于支撑向量机的CMOS运放可行域模型[J].杭州电子科技大学学报（自然科学版）,2014,34(5):106-110.
4许夙晖,慕晓冬,柴栋,罗畅.基于极限学习机参数迁移的域适应算法[J].自动化学报,2018,44(2):311-317. 被引量：18
5肖云鹏,孙华超,戴天骥,李茜,李暾.一种基于云模型的社交网络推荐系统评分预测方法[J].电子学报,2018,46(7):1762-1767. 被引量：32
6崔鹏,赵莎莎.基于稀疏编码和背景差分的迁移学习行人检测算法[J].光电子．激光,2018,29(9):1012-1020. 被引量：2
7付荣荣,侯培国,李曼迪.基于Fisher准则的单次运动想象脑电信号意图识别研究[J].生物医学工程学杂志,2018,35(5):774-778. 被引量：4
8梁修荣,杨正益.基于聚类和SVM的数据分类方法与实验研究[J].西南师范大学学报（自然科学版）,2018,43(3):91-96. 被引量：8
9俞东进,陈聪,吴建华,陈耀旺.基于隐式反馈数据的个性化游戏推荐[J].电子学报,2018,46(11):2626-2632. 被引量：15
10赵厉宇哲,刘学军,徐新艳.融入专业度和用户相似性的跨域推荐算法[J].计算机工程与设计,2019,40(1):136-142. 被引量：3

1华杰,刘学亮,赵烨.基于特征融合的小样本目标检测[J].计算机科学,2023,50(2):209-213. 被引量：3
2王佳玲.基于自适应学习的地块定价策略研究[J].价值工程,2022,41(7):49-51.
3郑珍真.碳中和目标下我国小型水电站价值评估策略的探索[J].商情,2022(8):16-18.
4付炳光,杨娟,汪荣贵,薛丽霞.基于混合语义的图神经网络小样本图像分类方法[J].智能计算机与应用,2022,12(12):93-99. 被引量：1
5周凯锐,刘鑫,景丽萍,于剑.概念驱动的小样本判别特征学习方法[J].智能系统学报,2023,18(1):162-172. 被引量：1
6胡馨.基于AI技术的初中英语词汇智适应学习新路径的应用探究[J].中文科技期刊数据库（全文版）教育科学,2023(2):80-82.
7蒲瞻星,葛永新.基于多特征融合的小样本视频行为识别算法[J].计算机学报,2023,46(3):594-608. 被引量：5
8龚敏.理趣探寻:道德与法治项目式教学实践[J].小学教学参考,2023(3):31-33.
9郑真,朱峰,马小丽,田书欣,姜皓喆.基于TL-LSTM的新能源功率短期预测[J].综合智慧能源,2023,45(1):41-48. 被引量：9
10胡建斌.讲好红色故事的叙事困境、典型样本与实施策略[J].青年记者,2022(23):64-67. 被引量：2

计算机学报

2023年第3期

浏览历史

内容加载中请稍等...

一种基于损失预测的双主动域适应算法研究

参考文献4

二级参考文献24

共引文献39

相关作者

相关机构

相关主题

浏览历史