基于相似度感知的深度卷积神经网络剪枝方法

Similarity-aware Pruning Method for Deep Convolutional Neural Networks

下载PDF

导出

摘要随着卷积神经网络规模的不断扩大,由于其庞大的计算量和参数量,终端智能设备的部署及发展面临着巨大的挑战,因此如何保持模型精度的同时尽可能地压缩和加速模型至关重要.目前已有工作提出的压缩方法仍然存在压缩算法实现、压缩效果、压缩效率等方面的缺陷.为此,本文提出了一种基于通道相似性的卷积神经网络剪枝方法.具体而言,首先探究了卷积神经网络特征通道间的相似冗余,引入了一种高效的相似性指标来量化特征通道之间的相似性;其次,通过相似性排序算法移除整个网络中冗余的通道从而实现剪枝;再次,加载保留的通道参数通过微调减少由于剪枝操作造成对模型分类性能的影响.为了提高压缩效率,本文采用一次性剪枝策略,满足时间复杂度更低的要求.最后,在CIFAR-10、CIFAR-100数据集上对VGG-16、ResNet-56、ResNet-110、GoogLeNet模型的实验结果表明,与现有方法相比本文所提方法可以更高效地压缩模型且模型依然保持良好精度. As the scale of convolutional neural networks continues to expand,the deployment and development of terminal intelligent devices face significant challenges due to their substantial computational and parameter requirements.Therefore,it is crucial to compress and accelerate models while maintaining their accuracy as much as possible.Existing compression methods still have deficiencies in terms of compression algorithm implementation,compression effectiveness,and compression efficiency.In this paper,we propose a convolutional neural network pruning method based on channel similarity.Specifically,we first explore the similarity redundancy between feature channels in convolutional neural networks and introduce an efficient similarity metric to quantify the similarity between feature channels.Secondly,we achieve pruning by removing redundant channels throughout the network using a similarity ranking algorithm.Thirdly,we fine-tune the loaded parameters of the remaining channels to mitigate the impact of pruning on the model's classification performance.To improve compression efficiency,we adopt a one-shot pruning strategy to meet the requirement of lower time complexity.Finally,experimental results on the CIFAR-10 and CIFAR-100 datasets with VGG-16,ResNet-56,ResNet-110 and GoogLeNet models demonstrate that the proposed method in this paper can compress models more efficiently while maintaining good accuracy compared to existing methods.

作者程点郑海斌陈晋音 CHENG Dian;ZHENG Haibin;CHEN Jinyin(College of Information Engineering,Zhejiang University of Technology,Hangzhou 310023,China;Institute of Cyberspace Security,Zhejiang University of Technology,Hangzhou 310023,China)

机构地区浙江工业大学信息工程学院浙江工业大学网络空间安全研究院

出处《小型微型计算机系统》 CSCD 北大核心 2024年第11期2656-2662,共7页 Journal of Chinese Computer Systems

基金国家自然科学基金项目(62072406)资助浙江省自然科学基金项目(LDQ23F020001)资助信息系统安全技术重点实验室基金项目(61421110502)资助.

关键词卷积神经网络模型剪枝模型压缩通道相似性 convolutional neural network model pruning model compression channels similarity

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1Usman Ahmad,Muhammad Junaid Ali,Faizan Ahmed Khan,Arfat Ahmad Khan,ArifUr Rehman,Malik Muhammad Ali Shahid,Mohd Anul Haq,Ilyas Khan,Zamil SAlzamil,Ahmed Alhussen.Large Scale Fish Images Classification and Localization using Transfer Learning and Localization Aware CNN Architecture[J].Computer Systems Science & Engineering,2023,45(5):2125-2140. 被引量：1

1胥桂仙,李晓荣.基于深度对比学习的文本聚类[J].中央民族大学学报（自然科学版）,2024,33(3):62-72.
2刘秋阳,魏政,方艳红,锁斌.AFSS吸波器高加速寿命试验可靠性评估[J].探测与控制学报,2024,46(5):99-104.
3何玮洁,吴志言,陈印.T/R组件收发模块的设计与实现[J].中文科技期刊数据库（全文版）自然科学,2024(10):0021-0026.
4张伟伟,陈赛越扬,崔英,沈广才,苏展,张卫正,李永亮,李萌.基于改进YOLO v8模型的烟草食叶性害虫识别[J].江苏农业科学,2024,52(17):209-217.
5杨松,张锐,朱良宽.多尺度卷积神经网络融合Transformer的竹材缺陷识别方法[J].林业工程学报,2024,9(5):126-133.
6杨宏宇,陈立畅,谢小龙,张佳进.改进YOLO v4模型在版纳微型猪只行为识别中的研究[J].黑龙江畜牧兽医,2024(19):46-54.
7魏腾达,穆月英.“独乐乐”还是“众乐乐”:政策性农业保险市场适度竞争的量化研究[J].金融经济学研究,2024,39(5):22-37. 被引量：1
8吴宗胜,李红,薛茹.基于深度哈希与VP-Tree的快速图像检索方法[J].西南民族大学学报（自然科学版）,2024,50(5):544-553.

小型微型计算机系统

2024年第11期

浏览历史

内容加载中请稍等...

基于相似度感知的深度卷积神经网络剪枝方法

参考文献1

相关作者

相关机构

相关主题

浏览历史