期刊文献+

MgNet: A unified framework of multigrid and convolutional neural network 被引量:2

MgNet: A unified framework of multigrid and convolutional neural network
原文传递
导出
摘要 We develop a unified model, known as MgNet, that simultaneously recovers some convolutional neural networks (CNN) for image classification and multigrid (MG) methods for solving discretized partial differential equations (PDEs). This model is based on close connections that we have observed and uncovered between the CNN and MG methodologies. For example, pooling operation and feature extraction in CNN correspond directly to restriction operation and iterative smoothers in MG, respectively. As the solution space is often the dual of the data space in PDEs, the analogous concept of feature space and data space (which are dual to each other) is introduced in CNN. With such connections and new concept in the unified model, the function of various convolution operations and pooling used in CNN can be better understood. As a result, modified CNN models (with fewer weights and hyperparameters) are developed that exhibit competitive and sometimes better performance in comparison with existing CNN models when applied to both CIFAR-10 and CIFAR-100 data sets. We develop a unified model, known as MgNet, that simultaneously recovers some convolutional neural networks(CNN) for image classification and multigrid(MG) methods for solving discretized partial differential equations(PDEs). This model is based on close connections that we have observed and uncovered between the CNN and MG methodologies. For example, pooling operation and feature extraction in CNN correspond directly to restriction operation and iterative smoothers in MG, respectively. As the solution space is often the dual of the data space in PDEs, the analogous concept of feature space and data space(which are dual to each other) is introduced in CNN. With such connections and new concept in the unified model, the function of various convolution operations and pooling used in CNN can be better understood. As a result,modified CNN models(with fewer weights and hyperparameters) are developed that exhibit competitive and sometimes better performance in comparison with existing CNN models when applied to both CIFAR-10 and CIFAR-100 data sets.
出处 《Science China Mathematics》 SCIE CSCD 2019年第7期1331-1354,共24页 中国科学:数学(英文版)
基金 supported by the Elite Program of Computational and Applied Mathematics for PhD Candidates of Peking University supported in part by the National Science Foundation of USA (Grant No. DMS-1819157) the US Department of Energy Office of Science Office of Advanced Scientific Computing Research Applied Mathematics Program (Grant No. DE-SC0014400)
关键词 convolutional NEURAL network MULTIGRID UNIFIED FRAMEWORK NET WORK architecture convolutional neural network multigrid unified framework network architecture
  • 相关文献

共引文献3

同被引文献1

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部