Space Efficient Quantization for Deep Convolutional Neural Networks

导出

摘要 Deep convolutional neural networks(DCNNs)have shown outstanding performance in the fields of computer vision,natural language processing,and complex system analysis.With the improvement of performance with deeper layers,DCNNs incur higher computational complexity and larger storage requirement,making it extremely difficult to deploy DCNNs on resource-limited embedded systems(such as mobile devices or Internet of Things devices).Network quantization efficiently reduces storage space required by DCNNs.However,the performance of DCNNs often drops rapidly as the quantization bit reduces.In this article,we propose a space efficient quantization scheme which uses eight or less bits to represent the original 32-bit weights.We adopt singular value decomposition(SVD)method to decrease the parameter size of fully-connected layers for further compression.Additionally,we propose a weight clipping method based on dynamic boundary to improve the performance when using lower precision.Experimental results demonstrate that our approach can achieve up to approximately 14x compression while preserving almost the same accuracy compared with the full-precision models.The proposed weight clipping method can also significantly improve the performance of DCNNs when lower precision is required.

作者 Dong-Di Zhao Fan Li Kashif Sharif Guang-Min Xia Yu Wang

机构地区 School of Computer Science Wireless Networking and Sensing Laboratory

出处《Journal of Computer Science & Technology》 SCIE EI CSCD 2019年第2期305-317,共13页 计算机科学技术学报（英文版）

基金 the National Natural Science Foundation of China(NSFC)under Grant Nos.61772077 and 61370192 Beijing Natural Science Foundation of China under Grant No.4192051 NSFC under Grant Nos.61428203 and 61572347.

关键词 convolutional NEURAL NETWORK MEMORY compression NETWORK QUANTIZATION

分类号 TP [自动化与计算机技术]

引文网络
相关文献

1Liangliang Wang,Kefei Chen,Yu Long,Huige Wang.An efficient pairing-free certificateless signature scheme for resource-limited systems[J].Science China(Information Sciences),2017,60(11):264-266. 被引量：4
2SOUTH AFRICA Unemployment Rate Drops[J].ChinAfrica,2019,11(3):8-9.
3Xiaoyuan LIU,Shihong YUE,Zeying WANG.A New Design of Electrical Impedance Tomography Sensor System for Pulmonary Disease Diagnosis[J].Journal of Systems Science and Information,2018,9(5):473-480. 被引量：1
4Stephen J.Puetz.A relational database of global U-Pb ages[J].Geoscience Frontiers,2018,9(3):877-891. 被引量：1
5崔茜茜.你为什么能看见颜色?（英文）[J].英语画刊（高级）,2017(17):16-16.
6Jie CHEN,Ben M.CHEN,Jian SUN.Complex system and intelligent control: theories and applications[J].Frontiers of Information Technology & Electronic Engineering,2019,20(1):1-3. 被引量：5
7孙洋.The Analysis of the Attrition of College Students in Mobile Assisted Language Learning: A Case Study Based on UTAUT Model[J].海外英语,2019(5):247-250.
8刘琦,漆采玲,马雯波,胡聪,李锋.深海底质土-金属界面间黏附特性试验研究[J].岩土力学,2019,40(2):701-708. 被引量：4
9梁泳诗,黄沛杰,黄培松,杜泽峰.基于可靠词汇语义约束的词语向量表达修正研究[J].中文信息学报,2019,33(1):56-67. 被引量：2
10Cuie YANG,Jinliang DING,Yaochu JIN,Tianyou CHAI.Incremental data-driven optimization of complex systems in nonstationary environments[J].Science China(Information Sciences),2018,61(12):207-209.

Journal of Computer Science & Technology

2019年第2期

浏览历史

内容加载中请稍等...

Space Efficient Quantization for Deep Convolutional Neural Networks

相关作者

相关机构

相关主题

浏览历史