摘要
为让学生模型学习到更全面的表征知识,提出一种多层次知识蒸馏方法,将教师模型的知识分为高、中、低多层次进行蒸馏。这3种层次的知识分别被指代为模型预测值、多尺度融合特征图以及特征层中的特征值,并以多层次知识为基础设计蒸馏项。首先基于低层次知识的特征蒸馏保证学生模型与教师模型的特征分布尽可能接近;再以中层次知识为基础,将图像的空间结构知识传递给学生模型;最后利用高层次的知识编码相邻帧间的依赖关系,并将该隐含知识传递给学生模型。此外定义的语义一致性损失也可有效改善像素点在前后帧标签预测不一致的情况。实验证明,该蒸馏方法能显著提升学生模型的精度,并在模型精度和轻量化方面取得了更好的平衡,具备良好的应用前景。
To learn more comprehensive representational knowledge,a multi-level knowledge distillation method was proposed for student model,which divided the knowledge of teacher model into high,middle and low levels for distillation.These levels of knowledge were respectively referred to as model predictions,multi-scale fusion feature map and activations in intermediate feature layers.Then,distillation items were designed on the basis of multi-level knowledge.Feature distillation based on low-level knowledge ensured the feature distribution between student model and teacher model could be as close as possible.Afterwards,based on middle-level knowledge,spatial structure knowledge of images was transferred to the student model.The high-level knowledge was used to encode the dependency between adjacent frames,and then this implicit knowledge was transmitted to the student model.Moreover,additional semantic consistency loss effectively improved the inconsistent predictions between adjacent frames.Experiments showed that the proposed distillation method could significantly improve the accuracy of the student model and achieved a better balance between accuracy and efficiency,which had a good application prospect.
作者
凌志
李幸
张婷
陈良
孙立宁
LING Zhi;LI Xing;ZHANG Ting;CHEN Liang;SUN Lining(Jiangsu Provincial Key Laboratory of Advanced Robotics,Soochow University,Suzhou 215123,China;School of Mechanical and Electric Engineering,Soochow University,Suzhou 215137,China;Momenta(Suzhou)Technology Company Limited,Suzhou 215133,China)
出处
《计算机集成制造系统》
EI
CSCD
北大核心
2023年第4期1244-1253,共10页
Computer Integrated Manufacturing Systems
基金
工信部工业互联网创新发展工程资助项目(TC190H3WR)。
关键词
语义分割
知识蒸馏
连续图像
轻量模型
相似性计算
semantic segmentation
knowledge distillation
continuous images
lightweight model
similarity calculation