集成最大汇合:最大汇合时只有最大值有用吗

Ensemble max-pooling:Is only the maximum activation useful when pooling

下载PDF

导出

摘要卷积神经网络中的汇合层基于局部相关性原理进行亚采样,在减少数据量的同时保留有用信息,从而有助于提升泛化能力.同时,汇合层可以有效提高感受野.经典的最大汇合采用赢者通吃策略,这有时会影响网络的泛化能力.为此提出集成最大汇合,用于替代传统卷积神经网络中的汇合层.在每个局部汇合区域,集成最大汇合以p的概率使输出最大的神经元失活,激活输出第二大的神经元.集成最大汇合可以看作多个基础潜在网络的集成,也可以理解为一种输入经历一定局部形变下的经典最大汇合过程.实验结果表明,相比经典汇合方法及其他相关汇合方法,集成最大汇合取得了更好的性能.DFN-MR是近期主流结构ResNet的一个衍生,相比ResNet,DFN-MR有着更多的基础潜在网络数目,同时避免了极深网络.保持其他超参数不变,通过将DFN-MR中步长为2的卷积层改为集成最大汇合串联步长为1的卷积层的结构,可以使网络性能得到显著提高. The pooling layer in convolutional neural networks performs subsampling on the basis of the local correlation principle,reducing the data size while keeping useful information in order to improve generalization,and effectively increase receptive fields simultaneously.The winner-take-all strategy is used in classical max-pooling,which will affect the generalization of the network sometimes.A simple and effective pooling method named ensemble max-pooling was introduced,which can replace the pooling layer in conventional convolutional neural networks.In each pooling region,ensemble max-pooling drops the neuron with maximum activation with probability p,and outputs the neuron with second largest activation.Ensemble max-pooling can be viewed as an ensemble of many basic underlying networks,and itcan also be viewed as the classical max-pooling with some local distortion of the input.The results achieved are better than classical pooling methods and other related pooling approaches.DFN-MR is derived from ResNet,compared with which it has more basic underlying networks and avoids very deep networks.By keeping other hyperparameters unchanged,and replacing each convolutional layer in DFN-MR with a tandem form,i.e.,a combination of an ensemble max-pooling layer and a convolutional layer with stride 1,it is shown to deliver significant gains in performance.

作者张皓吴建鑫

机构地区计算机软件新技术国家重点实验室南京大学计算机科学与技术系

出处《中国科学技术大学学报》 CAS CSCD 北大核心 2017年第10期799-807,共9页 JUSTC

关键词卷积神经网络汇合层网络集成数据扩充 convolutional neural network pooling layer network ensemble data augmentation

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1彭茂祥,徐勇.专利分类与产业分类对照关系构建及应用研究[J].科学管理研究,2017,35(5):30-33. 被引量：6
2周源,肖春霞,濮鸣亮.高浓度葡萄糖对C57小鼠视网膜神经节细胞的视觉反应特性的影响——一项离体生理研究[J].中国科学：生命科学,2017,47(11):1212-1219.
3段立峰.基于计算机网络技术下电子商务安全性的提高[J].自动化与仪器仪表,2017(12):141-142. 被引量：1
4闫梦月,薛宏武.汉语“强调并列结构”的类型学表现[J].重庆师范大学学报（哲学社会科学版）,2017(5):52-56.
5高原.网络零售平台客户忠诚度实证研究[J].全国流通经济,2017(23):7-9.
6孙成武.介绍I.M.柯丕《逻辑学导论》专门讲述“符号逻辑”的三章(上)[J].重庆师范大学学报（哲学社会科学版）,1981(4):102-117.
7王航臣.基于级联失效的航路网络中关键航路识别算法[J].航空计算技术,2017,47(6):32-35. 被引量：11
8靳俊杰.文学语篇中情感词汇的韵律结构研究[J].长春大学学报,2017,27(11):35-40. 被引量：1
9李正南,胡晓彤,朱玉倩,杨朝阳,丁昂.基于图像识别的校园安全监测系统[J].数码世界,2017,0(12):145-146.
10施文荣,刘艳,郑辉,余文珍,陈玲,涂春香.普通医学高等院校临床医学专业录取选考科目探讨[J].卫生职业教育,2017,35(23):117-119.

中国科学技术大学学报

2017年第10期

浏览历史

内容加载中请稍等...

集成最大汇合:最大汇合时只有最大值有用吗

相关作者

相关机构

相关主题

浏览历史