基于元学习的无数据模型压缩

Compression of data-free model based on meta-learning

下载PDF

导出

摘要针对现有深度学习无数据蒸馏框架下,数据合成效率低下以及蒸馏模型性能不足的问题,提出一种基于元学习的快速数据合成方法。通过批量标准化层的平均值和方差,及动量自适应调整,提取可重用特征,对特定任务执行少量更新,达到提高数据合成效率的目的;通过提出同异构教师鉴别器提取双采样知识,解决数据样本多样性与泛化性问题;改进传统知识蒸馏损失,采用学生自恢复蒸馏,通过一个生成块,提高模型性能。实验结果表明,提出方法优于现有无数据蒸馏方法。 Aiming at the low efficiency of data synthesis and the insufficient performance of distillation model under the existing deep learning framework of data-free distillation,a fast data synthesis method based on meta-learning was proposed.To improve the efficiency of data synthesis,The mean and variance of the batch normalization layer were adaptively adjusted with momentum to extract reusable features and perform minor updates for specific tasks.The heterogeneous teacher discriminator was proposed to extract double sampling knowledge to solve the problem of diversity and generalization of data samples.Using student self-recovery distillation,through a generating block,the performance of the model was improved.Experimental results show that the proposed method is superior to the existing distillation method in the absence of data.

作者张浩郭荣佐成嘉伟吴建成贾森泓 ZHANG Hao;GUO Rong-zuo;CHENG Jia-wei;WU Jian-cheng;JIA Sen-hong(College of Computer Science,Sichuan Normal University,Chengdu 610101,China)

机构地区四川师范大学计算机科学学院

出处《计算机工程与设计》北大核心 2024年第7期2034-2040,共7页 Computer Engineering and Design

基金国家自然科学基金项目(11905153、61701331)。

关键词元学习模型压缩无数据知识蒸馏数据合成批归一化深度学习 meta-learning model compression data-free distillation of knowledge data synthesis batch normalization deep learning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1贾永基,梅全涛.决策依赖不确定性下疫苗接种点选址的分布式鲁棒优化[J].工业工程与管理,2024,29(3):149-157.
2蔡铭嫣,张九龄,陈智峰,廖文丽,陈译,陈铖颖.基于碳基500nm工艺的双采样真随机数发生器[J].半导体技术,2024,49(8):732-741.
3孙倩,颜王吉,任伟新.基于响应谱传递比估计误差的结构模态参数识别精度分析[J].湖南大学学报（自然科学版）,2024,51(7):72-82.

计算机工程与设计

2024年第7期

浏览历史

内容加载中请稍等...

基于元学习的无数据模型压缩

相关作者

相关机构

相关主题

浏览历史