期刊文献+

基于中间层频域特征蒸馏的元学习算法

Metalearning algorithm for frequency domain featuredistillation in intermediate layers
下载PDF
导出
摘要 针对目前知识蒸馏忽略特征图全局统计信息的问题,提出利用离散余弦变换(DCT)频域释义的全局统计特征知识,并对各类别的频域特征使用Logistic分类器进行二分类,提取类间差异性信息.利用元学习算法(MPL)对教师模型参数进行更新,使教师模型能动态调整所传递的频域特征.实验模型在CIFAR-10,CIFAR-100以及ImageNet 2012数据集上有0.12%~0.16%的精度提升,结果表明,频域特征与类间相似性信息为学生模型的训练提供更多有用的知识,且两模型的知识交互更有利于教师的知识迁移. Aiming at the problem that knowledge distillation nowadays tends to ignore global statistical information about intermediate layers,the global statistical knowledge of DCT frequency domain interpretation was proposed.The logistic classifier was also utilized for binary classification in terms of frequency domain features for each class,to extract information on inter-class differences.In addition,by referring to parameter update algorithm of teacher model(Meta Pseudo Labels,MPL),the as-proposed models can dynamically adjust the frequency domain features.The experimental models in CIFAR-10,CIFAR-100 and ImageNet 2012 data sets witness improvements of their accuracy rates ranging from 0.12%to 0.16%.The results demonstrate that frequency domain features and inter-class differences can make the training of student models more instructive while making knowledge interaction between both models more favorable to knowledge transfer by teachers.
作者 张灵 郭林威 ZHANG Ling;GUO Lin-wei(School of Computer Science,Guangdong University of Technology,Guangzhou 510000,China)
出处 《沈阳工业大学学报》 CAS 北大核心 2023年第3期313-318,共6页 Journal of Shenyang University of Technology
基金 广东省自然科学基金项目(2021A1515012233)。
关键词 知识蒸馏 元学习 离散余弦变换 自监督 中间特征层 Logistic分类器 全局统计特征知识 类间差异性信息 knowledge distillation meta learning discrete cosine transform self-supervision intermediate feature layer Logistic classifier global statistical feature knowledge inter-class difference information
  • 相关文献

参考文献5

二级参考文献10

共引文献68

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部