基于TT-Tucker分解的无预训练LC卷积神经网络压缩方法

A TT-Tucker Decomposition-Based LC Convolutional Neural Network Compression Method Without Pre-Training

下载PDF

导出

摘要张量训练(TT)分解和Tucker分解是两种有效的卷积神经网络压缩方法。然而,TT和Tucker分解分别面临空间结构信息丢失与计算复杂度高等问题。为解决上述问题,文中考虑了网络结构的信息保留率和资源占用情况,采用学习-压缩(LC)算法的约束型压缩框架,提出了一种基于TT-Tucker分解的无预训练LC卷积神经网络压缩方法(TTLC)。TT-LC方法包括学习步骤和压缩步骤两个部分。学习步骤不需要预训练过程,采用了指数循环学习率方法以提高训练准确率。而在压缩步骤,文中根据TT和Tucker分解的优点以及贝叶斯规则选取全局最优秩的特性,运用经验变分贝叶斯矩阵分解(EVBMF)和贝叶斯优化(BayesOpt)选出合理的秩以指导张量分解,采用TT-LC方法压缩训练后的模型。TT-LC方法既降低了空间结构信息丢失率和计算复杂度,又解决了张量的秩选取不合理导致模型准确率显著下降的问题,可实现模型的双重贝叶斯选秩和双重压缩,获得最优的压缩模型。最后,采用ResNets和VGG网络在CIFAR10与CIFAR100数据集上进行实验。结果表明:对于ResNet32网络,相比于基准方法,文中方法在准确率为92.22%的情况下,获得了69.6%的参数量压缩率和66.7%的浮点计算量压缩率。 Tensor training(TT)decomposition and Tucker decomposition are two effective compression methods for convolutional neural networks.However,TT and Tucker decomposition face the problems of spatial structure information loss and high computational complexity respectively.To solve the above problems,this paper considered the information retention rate and resource occupancy of the network structure and proposed a LC convolutional neural network compressed method(TT-LC)without pre-training based on TT-Tucker decomposition,adopting the learning-compression(LC)algorithm constraint compression framework.The TT-LC method includes two parts:learning step and compression step.The learning step didn’t not need the pre-training process,and adopted the exponential cyclic learning rate method to improve the training accuracy.In the compression step,this paper selected the global optimal rank according to the advantages of TT and Tucker decomposition and the characteristics of Bayes rule,and used empirically variable Bayesian matrix factorization(EVBMF)and Bayesian optimization(BayesOpt)to select reasonable ranks to guide tensor decomposition.The TT-LC method was used to compress the trained model.TT-LC method not only reduces the loss rate of spatial structure information and computational complexity,but also solves the problem that the unreasonable rank selection of the tensor leads to the significant decrease in model accuracy.It can realize the double Bayesian rank selection and double compression of the model,and obtains the optimal compression model.Finally,experiments were carried out on CIFAR10 and CIFAR100 datasets using ResNets and VGG networks.The results show that for ResNet32 network,compared with the benchmark method,the proposed method achieved a compression rate of parameter quantity of 69.6%and a floating point computation compression rate of 66.7%with the accuracy of 92.22%.

作者刘微容张志强张宁孟家豪张敏刘婕 LIU Weirong;ZHANG Zhiqiang;ZHANG Ning;MENG Jiahao;ZHANG Min;LIU Jie(College of Electrical and Information Engineering,Lanzhou University of Technology,Lanzhou 730050,Gansu,China)

机构地区兰州理工大学电气工程与信息工程学院

出处《华南理工大学学报（自然科学版）》 EI CAS CSCD 北大核心 2024年第7期29-38,共10页 Journal of South China University of Technology(Natural Science Edition)

基金国家自然科学基金资助项目(62261032) 甘肃省自然科学基金资助项目(22JR5RA272) 甘肃省重点人才项目。

关键词卷积神经网络网络压缩张量分解贝叶斯优化约束型压缩 convolutional neural network network compression tensor decomposition Bayesian optimization constrained compression

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1高晗,田育龙,许封元,仲盛.深度学习模型压缩与加速综述[J].软件学报,2021,32(1):68-92. 被引量：61
2魏钰轩,陈莹.基于自适应层信息熵的卷积神经网络压缩[J].电子学报,2022,50(10):2398-2408. 被引量：5

二级参考文献7

1雷杰,高鑫,宋杰,王兴路,宋明黎.深度网络模型压缩综述[J].软件学报,2018,29(2):251-266. 被引量：45
2Jian CHENG,Pei-song WANG,Gang LI,Qing-hao HU,Han-qing LU.Recent advances in efficient computation of deep convolutional neural networks[J].Frontiers of Information Technology & Electronic Engineering,2018,19(1):64-77. 被引量：36
3纪荣嵘,林绍辉,晁飞,吴永坚,黄飞跃.深度神经网络压缩与加速综述[J].计算机研究与发展,2018,55(9):1871-1888. 被引量：55
4饶川,陈靓影,徐如意,刘乐元.一种基于动态量化编码的深度神经网络压缩方法[J].自动化学报,2019,45(10):1960-1968. 被引量：8
5权宇,李志欣,张灿龙,马慧芳.融合深度扩张网络和轻量化网络的目标检测模型[J].电子学报,2020,48(2):390-397. 被引量：20
6周涛,霍兵强,陆惠玲,任海玲.残差神经网络及其在医学图像处理中的应用研究[J].电子学报,2020,48(7):1436-1447. 被引量：24
7曹文龙,芮建武,李敏.神经网络模型压缩方法综述[J].计算机应用研究,2019,36(3):649-656. 被引量：12

共引文献63

1牛鑫,吕现伟,余辰.边缘智能:现状与挑战[J].武汉大学学报（理学版）,2023,69(2):270-282. 被引量：5
2李汶霞,殷声.燃烧合成中的有机物[J].材料导报,2000,14(5):45-48. 被引量：14
3侯晓龙,周培林,邹月娴.基于知识蒸馏的口语理解模型研究与实现[J].电子技术与软件工程,2021(2):180-184.
4刘鑫,韩强,周永帅,庹先国.基于深度学习的白酒分类识别方法[J].食品与机械,2021,37(4):68-71. 被引量：3
5李良熹,荣进国.基于深度学习的智能烘培类商品识别系统研究[J].信息与电脑,2021,33(13):156-158. 被引量：1
6孟宪法,刘方,李广,黄萌萌.卷积神经网络压缩中的知识蒸馏技术综述[J].计算机科学与探索,2021,15(10):1812-1829. 被引量：12
7蒋润熙,阿里甫·库尔班,耿丽婷.面向轻量化网络的安全帽检测算法[J].计算机工程与应用,2021,57(20):263-270. 被引量：19
8彭宇,姬森展,于希明,刘胜剑.语义分割网络的FPGA加速计算方法综述[J].仪器仪表学报,2021,42(9):1-12. 被引量：17
9杨学杰,宋凯,曹付勇,王一夔,许荣浩.前端化目标检测技术在电力巡检中的应用研究[J].山东电力技术,2022,49(1):7-12. 被引量：5
10余方洁,王斌.基于RGB-D图像的移动端点云分割方法研究[J].重庆理工大学学报（自然科学）,2022,36(2):126-134.

1韩瑶,骆晓玲,程换新,沈静.基于改进YOLOv7的红外安防目标检测[J].激光杂志,2024,45(5):55-61. 被引量：2
2乔丹,马鹏,王琦.基于Multi-Agent的水电站变压器故障诊断系统[J].自动化技术与应用,2024,43(7):58-61.
3段艳明,肖辉辉,谭黔林.融合均值榜样的反向互学习水母搜索算法[J].河南师范大学学报（自然科学版）,2024,52(4):111-119.

华南理工大学学报（自然科学版）

2024年第7期

浏览历史

内容加载中请稍等...

基于TT-Tucker分解的无预训练LC卷积神经网络压缩方法

参考文献2

二级参考文献7

共引文献63

相关作者

相关机构

相关主题

浏览历史