基于强关联平滑约束的目标检测模型剪枝方法

Model pruning for object detection via strong correlation smoothing constraints

下载PDF

导出

摘要目标检测模型的轻量化研究虽已产生诸多代表性成果,但现有方法在模型高比例剪枝时会出现检测精度断崖式衰减。在探索主流目标检测网络剪枝性能衰减的根源时发现剪枝后梯度的波动是影响模型性能的关键。为此构建了基于强关联平滑约束的剪枝框架(Pruning Framework based on Strong Correlation Smoothing Constraint,SCSC)。首先将历史梯度及当前梯度定义为自蒸馏理论中的教师及学生,通过学生模仿教师的方式使学生梯度最大程度接近教师梯度,实现梯度平滑;其次依据梯度平滑结果提出基于强关联约束的剪枝方案,将历史梯度与当前梯度组成强关联组,通过强化历史梯度对当前梯度更新的贡献增强模型权重参数稀疏度。在PASCAL VOC2007数据集进行测试,SCSC对比主流剪枝方法取得了2个百分点的平均精度提升;在KITTI数据集中,SCSC剪枝率为80%时,相较于原网络识别精度衰减仅为3个百分点。 Although the research on lightweight object detection models has produced many representative results,these models still suffer from a cliff-like decay of detection accuracy when they are pruned at a high ratio.Some researchers find that the fluctuation of the gradient after pruning is the key factor affecting the model performance when exploring the root cause of the pruning performance degradation of the mainstream object detection networks.Therefore,a pruning framework based on gradient selfdistillation smoothing is constructed and called as SCSC,a pruning framework with strongly correlation smoothing constraints.First,the historical gradient and the current gradient are defined as the teacher and the student in the self-distillation theory,and the student gradient approaches the teacher gradient as much as possible by imitating the teacher,achieving gradient smoothing.Second,based on the gradient smoothing result,a pruning scheme based on strong correlation constraints is proposed.This scheme forms a strong correlation group with the historical gradient and the current gradient,and enhances the sparsity of the model weight parameters by strengthening the contribution of the historical gradient to the current gradient update.Through the experiments on the PASCAL VOC2007 dataset,SCSC achieves a 2 percentages improvement in average precision compared with mainstream pruning methods;on the KITTI dataset,when the SCSC pruning rate is 80%,the recognition accuracy decay is only decreased 3 percentages from that of the original network.

作者康彬李卓邱坤窦海娥王磊郑宝玉 KANG Bin;LI Zhuo;QIU Kun;DOU Haie;WANG Lei;ZHENG Baoyu(School of Internet of Things,Nanjing University of Posts and Telecommunications,Nanjing 210003,China;School of Applied Technology College,Nanjing University of Posts and Telecommunications,Nanjing 210042,China;School of Communications and Information Engineering,Nanjing University of Posts and Telecommunications,Nanjing 210003,China)

机构地区南京邮电大学物联网学院南京邮电大学应用技术学院南京邮电大学通信与信息工程学院

出处《南京邮电大学学报（自然科学版）》北大核心 2024年第3期72-79,共8页 Journal of Nanjing University of Posts and Telecommunications：Natural Science Edition

基金国家自然科学基金(62171232,62071255,62371253,62001248) 江苏省重点研发计划(BE2023087) 江苏省高校重点项目(20KJA510009)资助项目。

关键词卷积神经网络知识蒸馏模型剪枝 convolutional neural networks knowledge distillation model pruning

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1Jian CHENG,Pei-song WANG,Gang LI,Qing-hao HU,Han-qing LU.Recent advances in efficient computation of deep convolutional neural networks[J].Frontiers of Information Technology & Electronic Engineering,2018,19(1):64-77. 被引量：35
2杨真真,郑艺欣,邵静,杨永鹏.基于改进路径聚合和池化YOLOv4的目标检测[J].南京邮电大学学报（自然科学版）,2022,42(5):1-7. 被引量：6

二级参考文献2

1张索非,冯烨,吴晓富.基于深度卷积神经网络的目标检测算法进展[J].南京邮电大学学报（自然科学版）,2019,39(5):72-80. 被引量：29
2许腾,唐贵进,刘清萍,鲍秉坤.基于空洞卷积和Focal Loss的改进YOLOv3算法[J].南京邮电大学学报（自然科学版）,2020,40(6):100-108. 被引量：14

共引文献39

1艾祖鹏,刘雨帆,阮晓峰,李兵.深度卷积神经网络压缩与加速研究进展[J].中国基础科学,2022(3):1-9.
2杨本臣,裴欢菲.灰狼优化支持向量机的推荐算法[J].辽宁工程技术大学学报（自然科学版）,2021,40(6):552-557. 被引量：2
3郭乔进,胡杰,宫世杰,梁中岩.深度学习计算平台发展综述[J].信息化研究,2019,45(3):1-7. 被引量：5
4高明慧,张尤赛,王亚军,李垣江.应用卷积神经网络的纹理合成优化方法[J].计算机工程与设计,2019,40(12):3551-3556. 被引量：2
5庞涛,丘海华,潘碧莹.手机终端人工智能关键技术研究[J].电信科学,2020,36(5):145-151. 被引量：5
6林雄锋,李新海,邱天怡,范德和,曾令诚,肖星,罗海鑫,凌霞.基于智能视频识别技术的变电站安全监控系统研究[J].广西电力,2020,43(5):51-57. 被引量：12
7Hao WANG,Zhi-yuan WANG,Ben-dong WANG,Zhuo-qun YU,Zhong-he JIN,John L.CRASSIDIS.An artificial intelligence enhanced star identification algorithm[J].Frontiers of Information Technology & Electronic Engineering,2020,21(11):1661-1670. 被引量：1
8孟祥环,罗素云,张玉祖,陈亚,陈思涛.基于TensorFlow的车牌字符识别方法[J].上海工程技术大学学报,2020,34(3):247-252.
9陈明浩,陈庆奎.基于边缘节点的深度神经网络任务分配方法[J].计算机工程与设计,2021,42(1):113-121. 被引量：2
10于文家,丁世飞.基于自注意力机制的条件生成对抗网络[J].计算机科学,2021,48(1):241-246. 被引量：9

1肖远,梁华国,汪玉传,鲁迎春,易茂祥,姚亮.非线性优化的时间数字转换器设计[J].微电子学,2023,53(5):772-778. 被引量：1
2孙子文,钱立志,杨传栋,袁广林,凌冲.基于弹载图像的代价敏感与平滑约束结构化SVM目标跟踪方法[J].兵器装备工程学报,2024,45(6):142-149.
3张鹏.趣味高中美术素描课堂建构策略浅析[J].名师在线（中英文）,2024(15):85-87.
4杨奕.新时期小学班主任德育工作的有效途径探析[J].新教育时代电子杂志（学生版）,2020(4):152-152.
5隋勇.论小学体育高效课堂构建[J].电子乐园,2018(4):288-288.
6齐继,朱建安,张延旭,杨嘉豪,张春婷.可逆SCSC转变中铜(Ⅱ)配合物的可视变色:结构对颜色的影响[J].大学化学,2024,39(3):43-57.
7朱玥洁,巩晓芸,张峰波,陈志芳,李玉娇.基于微课与翻转课堂的免疫学实验教学模式的探索研究[J].中国科技期刊数据库科研,2024(7):0168-0171.
8陈晓平.小学语文阅读的培养学生写作意识的方法[J].新教育时代电子杂志（学生版）,2020(16):43-43.
9董丽华.小议初中英语词汇教学“七步曲”[J].女报,2020(9):406-406.
10兰国祥.任务链:基于“刻意练习”的写作作业设计及实施——以老物件的散文写作为例[J].语文学习,2024(6):57-61.

南京邮电大学学报（自然科学版）

2024年第3期

浏览历史

内容加载中请稍等...

基于强关联平滑约束的目标检测模型剪枝方法

参考文献2

二级参考文献2

共引文献39

相关作者

相关机构

相关主题

浏览历史