Incremental Learning Based on Data Translation and Knowledge Distillation

Incremental Learning Based on Data Translation and Knowledge Distillation

下载PDF

导出

摘要 Recently, deep convolutional neural networks (DCNNs) have achieved remarkable results in image classification tasks. Despite convolutional networks’ great successes, their training process relies on a large amount of data prepared in advance, which is often challenging in real-world applications, such as streaming data and concept drift. For this reason, incremental learning (continual learning) has attracted increasing attention from scholars. However, incremental learning is associated with the challenge of catastrophic forgetting: the performance on previous tasks drastically degrades after learning a new task. In this paper, we propose a new strategy to alleviate catastrophic forgetting when neural networks are trained in continual domains. Specifically, two components are applied: data translation based on transfer learning and knowledge distillation. The former translates a portion of new data to reconstruct the partial data distribution of the old domain. The latter uses an old model as a teacher to guide a new model. The experimental results on three datasets have shown that our work can effectively alleviate catastrophic forgetting by a combination of the two methods aforementioned. Recently, deep convolutional neural networks (DCNNs) have achieved remarkable results in image classification tasks. Despite convolutional networks’ great successes, their training process relies on a large amount of data prepared in advance, which is often challenging in real-world applications, such as streaming data and concept drift. For this reason, incremental learning (continual learning) has attracted increasing attention from scholars. However, incremental learning is associated with the challenge of catastrophic forgetting: the performance on previous tasks drastically degrades after learning a new task. In this paper, we propose a new strategy to alleviate catastrophic forgetting when neural networks are trained in continual domains. Specifically, two components are applied: data translation based on transfer learning and knowledge distillation. The former translates a portion of new data to reconstruct the partial data distribution of the old domain. The latter uses an old model as a teacher to guide a new model. The experimental results on three datasets have shown that our work can effectively alleviate catastrophic forgetting by a combination of the two methods aforementioned.

作者 Tan Cheng Jielong Wang Tan Cheng;Jielong Wang(Xiamen Institute of Data Intelligence, Xiamen, China)

机构地区 Xiamen Institute of Data Intelligence

出处《International Journal of Intelligence Science》 2023年第2期33-47,共15页 智能科学国际期刊（英文）

关键词 Incremental Domain Learning Data Translation Knowledge Distillation Cat-astrophic Forgetting Incremental Domain Learning Data Translation Knowledge Distillation Cat-astrophic Forgetting

分类号 H31 [语言文字—英语]

引文网络
相关文献

1Yanzhao Zhou,Binghao Liu,Yiran Liu,Jianbin Jiao.Filter Bank Networks for Few-Shot Class-Incremental Learning[J].Computer Modeling in Engineering & Sciences,2023(10):647-668.
2任进,邵淑颖,何怡怡.基于增量学习的时变信道预测方法[J].无线电工程,2023,53(4):815-823. 被引量：1
3Zeyong Sun,Guo Ran,Zilong Jin.Intrusion Detection Method Based on Active Incremental Learning in Industrial Internet of Things Environment[J].Journal on Internet of Things,2022,4(2):99-111.
4Abdul Sattar Palli,Jafreezal Jaafar,Manzoor Ahmed Hashmani,Heitor Murilo Gomes,Aeshah Alsughayyir,Abdul Rehman Gilal.Combined Effect of Concept Drift and Class Imbalance on Model Performance During Stream Classification[J].Computers, Materials & Continua,2023(4):1827-1845.
5Changjiu Teng,Qiangmin Yu,Yujie Sun,Baofu Ding,Wenjun Chen,Zehao Zhang,Bilu Liu,Hui-Ming Cheng.Homologous gradient heterostructure-based artificial synapses for neuromorphic computation[J].InfoMat,2023,5(1):95-105. 被引量：1
6ZHANG Qingsong,SUN Linjun,YANG Guowei,LU Baoli,NING Xin,LI Weijun.TBNN: totally-binary neural network for image classification[J].Optoelectronics Letters,2023,19(2):117-122.
7Jeffrey J Leow,Soon Hock Koh,Marcus WL Chow,Wayren Loke,Rolando Salada,Seok Kwan Hong,Yuyi Yeow,Chau Hung Lee,Cher Heng Tan,Teck Wei Tan.Can we omit systematic biopsies in patients undergoing MRI fusion-targeted prostate biopsies?[J].Asian Journal of Andrology,2023,25(1):43-49.
8Swagata Boruah,Archit Dehloo,Prajul Gupta,Manas Ranjan Prusty,A.Balasundaram.Gaussian Blur Masked ResNet2.0 Architecture for Diabetic Retinopathy Detection[J].Computers, Materials & Continua,2023(4):927-942.
9Bin Ji,Hao Xu,Jie Yu,Shasha Li,JunMa,Yuke Ji,Huijun Liu.A Two-Phase Paradigm for Joint Entity-Relation Extraction[J].Computers, Materials & Continua,2023(1):1303-1318. 被引量：1
10Journal of Integrative Agriculture Instruction to Authors[J].Journal of Integrative Agriculture,2023,22(4).

International Journal of Intelligence Science

2023年第2期

浏览历史

内容加载中请稍等...

Incremental Learning Based on Data Translation and Knowledge Distillation

相关作者

相关机构

相关主题

浏览历史