期刊文献+

基于MIC的深度置信网络研究 被引量:2

Deep Belief Networks Research Based on Maximum Information Coefficient
下载PDF
导出
摘要 传统的深度置信网络(DBNs)训练过程采用重构误差作为RBM网络的评价指标,它能在一定程度上反映网络对训练样本的似然度,但它并不是可靠的。而最大信息系数(MIC)能反映两个属性间的相关度,保留相关度较大的属性,且MIC较稳健,不易受异常值的影响,可作为网络评价指标。故提出一种基于最大信息系数(MIC)的深度置信网络方法,一方面用MIC对数据进行降维预处理,提高数据与网络的拟合度,降低网络分类误差;另一方面将MIC作为网络评价标准,改进重构误差的不可靠性。分别利用传统方法与基于MIC的深度置信网络方法对手写数据集MNIST和USPS进行分类实验,结果表明,基于MIC的深度置信网络方法能有效地提高识别率。 The traditional deep belief networks use reconstruction error as the evaluation criteria of restricted boltzmann machine(RBM) networks in the training process, which can reflect the likelihood between RBM network and training samples to some extent. However, it is not reliable. Maximum information coefficient (MIC), based on the estimations of Shannon entropy and conditional entropy, identifies interesting relationships between pairs of variables in large data sets and captures a subset of highly related features. The MIC can be used as a criterion for evaluating a network since it is robust to outliers. In order to construct models that fit data well and reduce classification error, a deep belief net- works based on MIC method was proposed. MIC is used not only in dimensionality reduction, but also in improving the unreliability of the reconstruction error. Classification experiments were performed on handwriting data sets of MNIST and USPS by several traditional methods and deep belief networks based on MIC method. The results show that the lat- ter can improve the recognition rate effectively.
作者 曾安 郑齐弥 ZENG An ZHENG Qi-mi(School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510000, China)
出处 《计算机科学》 CSCD 北大核心 2016年第8期249-253,共5页 Computer Science
基金 国家自然科学基金项目(61300107) 广东省自然科学基金项目(S2012010010212) 广州市科技计划项目(201504301341059 201505031501397)资助
关键词 深度置信网络 最大信息系数 重构误差 降维 DBNs,MIC,Reconstruction error,Dimensionality reduction
  • 相关文献

参考文献3

二级参考文献125

  • 1罗彬,游志胜,曹刚.基于边缘直方图的快速汽车标志识别方法[J].计算机应用研究,2004,21(6):150-151. 被引量:25
  • 2史忠值.神经网络[M].北京:高等教育出版社,2009.
  • 3李彦宏.2012百度年会主题报告:相信技术的力量[R].北京:百度,2013.
  • 4Hopfield J J. Neural networks and physical systems with emergent col|ective computational abilities [J]. Proceedings of the National Academy of Sciences of the United States of America, 1982, 79(8):2554-2558.
  • 5Hinton G E, Sejnowski T J. Optimal perceptual inference [C]//Proc of the 1983 IEEE Conf on Computer Vision and Pattern Recognition. Los Alamitos, CA: IEEE Computer Society, 1983: 448-453.
  • 6Hinton G E, Sejnowski T J. Analyzing cooperative computation [C] //Proc of the 5th Annual Congress of the Cognitive Science Society. New York: ACM, 1983: 2554- 2558.
  • 7Hofstadter D R. The copycat project: An experiment in nondeterminism and creative analogies [DB/OL:. MIT Artificial Intelligence Laboratory Memo 755. (1984- 01-01) [2004-10-01]. http://hdl, handle, net/1721.1/5648.
  • 8Hofstadter D R. A Non-Deterministic Approach to Analogy, Involving the Ising Model of Ferromagnetism[M] //The Physics of Cognitive Processes. Hackensack: World Scientific, 1987.
  • 9Smolensky P. Information Processing in Dynamical Systems: Foundations of Harmony Theory [M]//Parallel Distributed Processing, Vol 1: Foundations. Cambridge: MIT Press, 1986: 194-281.
  • 10Ackley D H, Hinton G E, Sejnowski T J. A learning algorithm for Boltzmann machines [J]. Cognitive Science, 1985, 9(1): 147-169.

共引文献298

同被引文献16

引证文献2

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部