Multi-layer dynamic and asymmetric convolutions

下载PDF

导出

摘要 Dynamic networks have become popular to enhance the model capacity while maintaining efficient inference by dynamically generating the weight based on over-parameters.They bring much more parameters and increase the difficulty of the training.In this paper,a multi-layer dynamic convolution(MDConv) is proposed,which scatters the over-parameters over multi-layers with fewer parameters but stronger model capacity compared with scattering horizontally;it uses the expanding form where the attention is applied to the features to facilitate the training;it uses the compact form where the attention is applied to the weights to maintain efficient inference.Moreover,a multi-layer asymmetric convolution(MAConv) is proposed,which has no extra parameters and computation cost at inference time compared with static convolution.Experimental results show that MDConv achieves better accuracy with fewer parameters and significantly facilitates the training;MAConv enhances the accuracy without any extra cost of storage or computation at inference time compared with static convolution.

作者 LUO Chunjie ZHAN Jianfeng 罗纯杰;ZHAN Jianfeng(Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,P.R.China;University of Chinese Academy of Sciences,Beijing 100049,P.R.China)

机构地区 Institute of Computing Technology University of Chinese Academy of Sciences

出处《High Technology Letters》 EI CAS 2022年第3期227-236,共10页 高技术通讯（英文版）

基金 Supported by the National Key Research and Development Program of China(No.2016YFB1000601) the Standardization Pilot Research Project of Chinese Academy of Sciences(No.20194620)。

关键词 neural network dynamic network ATTENTION image classification

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1Chengming Zhang,Hong Zhang,Jing Ge,Tingyan Mi,Xiao Cui,Fengjuan Tu,Xuelan Gu,Tao Zeng,Luonan Chen.Correction to‘Landscape dynamic network biomarker analysis reveals the tipping point of transcriptome reprogramming to prevent skin photodamage’[J].Journal of Molecular Cell Biology,2022,14(3):65-65.
2方健,刘坤.改进RFBnet网络的船只目标检测方法[J].计算机工程与应用,2022,58(12):155-162. 被引量：2
3晏思雪,张潇云.基于多特征融合的服装图像检索算法研究[J].信息与电脑,2022,34(7):116-118.
4Menghua Zheng,Keyan Zhi,Jiawen Zeng,Chunwei Tian,Lei You.A Hybrid CNN for Image Denoising[J].Journal of Artificial Intelligence and Technology,2022,2(3):93-99. 被引量：3
5Jar-Der Luo,Jifan Liu,Kunhao Yang,Xiaoming Fu.Big data research guided by sociological theory:a triadic dialogue among big data analysis,theory,and predictive models[J].The Journal of Chinese Sociology,2019,6(1):199-217.
6Yuanmin Shi,Siran Yin,Ze Chen,Leiming Yan.XGBoost Algorithm under Differential Privacy Protection[J].Journal of Information Hiding and Privacy Protection,2021,3(1):9-16.
7Ahmed Y.Hamed,Monagi H.Alkinani.Task Scheduling Optimization in Cloud Computing Based on Genetic Algorithms[J].Computers, Materials & Continua,2021(12):3289-3301. 被引量：1
8Jie Tong,Leilei Shi,Lu Liu,John Panneerselvam,Zixuan Han.A Novel Influence Maximization Algorithm for a Competitive Environment Based on Social Media Data Analytics[J].Big Data Mining and Analytics,2022,5(2):130-139. 被引量：2
9Fangzheng Zhao,Xinyu Wan,Xiaolin Wang,Qingyang Wu,Yan Wu.Real-time probabilistic sediment concentration forecasting using integrated dynamic network and error distribution heterogeneity[J].International Journal of Sediment Research,2022,37(6):766-779.
10Nancy Abbas El-Hefnawy,Osama Abdel Raouf,Heba Askr.Dynamic Routing Optimization Algorithm for Software Defined Networking[J].Computers, Materials & Continua,2022(1):1349-1362.

High Technology Letters

2022年第3期

浏览历史

内容加载中请稍等...

Multi-layer dynamic and asymmetric convolutions

相关作者

相关机构

相关主题

浏览历史