修剪算法的信息几何分析

Information Geometric Analysis of Pruning Algorithm

下载PDF

导出

摘要修剪法是确定和优化神经网络结构的重要方法之一.当前对修剪法的研究大多集中在方法描述上,对于修剪法内在机理的研究尚不多见,而研究修剪的内在机理可以为修剪策略提供理论基础和依据.从信息几何的角度研究了修剪法的内在机理,给出了神经网络结构修剪法的信息几何理论解释,利用神经流形参数结构的层次性,将修剪法表述为一系列从当前模型流形到其子流形的信息投影过程,在此基础上提出了新的修剪算法,并给出了算法可行性与有效性的实验验证. Pruning algorithm is an important method to set up and optimize the structure of neural network model. The research on pruning nowadays mostly focuses on the algorithm description while less effort is spent on its immanent mechanism. Research on its mechanism can provide theoretical basis for pruning strategy. The immanent mechanism of pruning is analyzed based on information geometry and a set of theoretical explanation of pruning is given. The pruning process is depicted as a series of information projections from the current model manifold to its submanifolds utilizing the hierarchical structure of neural manifold parameter space. A new pruning algorithm is presented based on the theoretical analysis and its validity and the efficiency is verified by experiments.

作者刘蕴辉罗四维黄华李爱军

机构地区北京交通大学计算机与信息技术学院

出处《计算机研究与发展》 EI CSCD 北大核心 2006年第9期1609-1614,共6页 Journal of Computer Research and Development

基金国家自然科学基金项目(60373029) 教育部博士点基金项目(20020004020) 北京交通大学科技基金项目(2005RC044)

关键词修剪法信息几何神经流形信息投影 pruning information geometry neural manifold information projection

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献12

1fgG Bebis, M Georgiopoulos. Feed-forward neural networks:Why network size is so important [J]. IEEE Potentials, 1994,13(4) : 27-31
2T Kavzaglu. Determining optimum structure for artificial neural networks [C] . The 24th Annual Technical Conf and Exhibition of the Remote Sensing Society, Cardiff, UK, 1999
3S E Fahlman, C Lebiere. The cascade-correlation learning architecture [G]. In: Advances in Neural Information Processing Systems. San Francisco: Morgan Kaufmann, 1990.524-532
4J Sietsma, R J Dow. Neural net pruning-Why and how [C].IEEE Int'l Conf on Neural Networks, San Diego, 1988
5R Reed. Pruning algorithms-A survey [J]. IEEE Trans on Neural Network, 1993, 4(5) : 740-747
6S Amari. Differential-Geometrical Methods in Statistics [G].In: Lecture Notes in Statistics. Berlin: Springer-Verlag, 1985
7S Amari. Information geometry of the EM and EM algorithm for neural network. Neural Networks, 1995, 8(9): 1379-1408
8孟大志,刘蓉.信息几何-计算神经科学的几何学方法[J].生物物理学报,1999,15(2):243-248. 被引量：2
9罗四维,温津伟.神经场整体性和增殖性研究与分析[J].计算机研究与发展,2003,40(5):668-674. 被引量：10
10S Amari. Information geometry on hierarchical of probability distributions [J ]. IEEE Trans on Inf Theory, 2001, 47 (5) :1701-1711

二级参考文献13

1陈省身陈维桓.微分几何讲义[M].北京：北京大学出版社,1980..
2Jacobs, Jordan. Adaptive mixtures of local experts. Neural Computation, 1991, 2(3) : 79-87.
3R E Sehapire. The strength of weak learnability. Machine Learning, 1990, 5(2): 197-227.
4S Amari. Information geometry. Contemporary Mathematics,1977, 20(3): 81-95.
5S Amari. Information geometry of EM and EM algorithm for neural networks. Neural Networks, 1995, 8(9): 1379-1408.
6S Amari, K Kurata, H Nagaoka. Information geometry of Boltzmann machines. IEEE Trans on Neural Networks, 1992, 3(2) : 260-271.
7S Amari. Dualistic geometry of the manifold of higher-order neurons. Neural Networks, 1991, 4(4): 443-451.
8S Amari. Differential geometrical methods in statistics. Springer Lecture Notes in Statistics, Vol 28, New York: Springer-Verlag,1985.
9L K Hansen, P Salamon. Neural network ensembles. IEEE TPAM, 1990, 12(10): 993-1001.
10Jordan, Jacobs. Hiearchical mixtures of experts and the EM algorithm. Neural Computation, 1994, 2(6) : 181-214.

共引文献9

1黄华,罗四维,李爱军,刘蕴辉.实现人工神经网络知识增殖能力的一种方法[J].计算机研究与发展,2004,41(7):1062-1067. 被引量：4
2刘蕴辉,罗四维,李爱军,黄华.神经网络的层次化学习机制探讨[J].模式识别与人工智能,2004,17(3):332-336.
3周炎涛,向升,吴正国.RBF神经网络在电气设备状态评估中的应用[J].航空计算技术,2005,35(3):1-4. 被引量：3
4曾宪华,罗四维.动态增殖流形学习算法[J].计算机研究与发展,2007,44(9):1462-1468. 被引量：13
5吕进,郭晨,赵敏.类图书馆知识可增殖神经网络及其组织算法[J].系统仿真学报,2009,21(1):100-103. 被引量：1
6陶永芹,崔杜武,费蓉,李雪.粒计算模糊增殖神经场的研究与分析[J].计算机工程,2010,36(6):178-180. 被引量：1
7李曼.云计算平台上的增量分类研究[J].微型机与应用,2011,30(18):65-68. 被引量：1
8李曼,李云.云计算平台上基于选择性集成的增量学习研究[J].南京邮电大学学报（自然科学版）,2012,32(5):146-152.
9况发伦,杨慕容,金德泉.一维离散型核函数的神经场稳态解的研究[J].广西大学学报（自然科学版）,2020,45(4):906-910.

1林克正,陈冬梅,吴清英.基于局部保持投影的统计不相关复合信息投影[J].哈尔滨理工大学学报,2010,15(5):75-79. 被引量：1
2兰红,洪玉欢,高晓林.SIFT优化算法及其在全景拼接图像配准中的应用[J].小型微型计算机系统,2016,37(5):1052-1056. 被引量：4
3兰红,洪玉欢,高晓林.面向全景拼接的图像配准技术研究及应用[J].计算机工程与科学,2016,38(2):317-324. 被引量：8
4许东,夏良正,杨世周.一种新的基于投影的三维矩快速算法[J].模式识别与人工智能,2001,14(1):59-63. 被引量：1
5郭春璐,陶琳.特征选择和分类器参数优化联合进行的人体行为识别[J].微型电脑应用,2016,32(4):74-77.
6徐利新,欧阳咸泰,胡中功,文小玲.模型预测控制在工业控制领域中的发展与应用[J].武汉化工学院学报,2001,23(1):77-81. 被引量：9
7范佳妮,王振雷,钱锋.BP人工神经网络隐层结构设计的研究进展[J].控制工程,2005,12(S1):109-113. 被引量：70
8张海英,浦磊,潘永湘.一种自适应快速关联规则挖掘算法[J].西安理工大学学报,2004,20(4):404-407. 被引量：2
9杨秀娟,董军,李慧慧.欧式空间中反向最远邻查询方法的研究[J].计算机工程与应用,2015,51(4):142-147. 被引量：3
10牛丽平,闫林,郑延斌,薛占鳌.基于Gabor特征的快速贝叶斯人脸识别[J].控制工程,2012,19(6):1055-1058. 被引量：3

计算机研究与发展

2006年第9期

浏览历史

内容加载中请稍等...

修剪算法的信息几何分析

参考文献12

二级参考文献13

共引文献9

相关作者

相关机构

相关主题

浏览历史