期刊文献+

一种基于强化学习的深度信念网络设计方法 被引量:4

A Design Method for Deep Belief Network Based on Reinforcement Learning
下载PDF
导出
摘要 近年来,基于深度学习思想发展起来的深度信念网络(Deep Belief Networks, DBN)在人工智能和大数据预测分析中得到了成功的应用。由于DBN的隐含层数较多,传统的DBN有监督精调(Fine-tuning)方法-BP算法很难得到令人满意的学习精度,甚至会因为梯度扩散(Gradient Diffusion)导致精度调节失败,且网络鲁棒性差。针对此问题,提出一种基于强化学习策略的DBN模型(RL-DBN)及其算法。首先利用自适应对比散度(Adaptive Contrastive Divergence, ACD)算法来快速预训练DBN的隐含层以获取较优的初始权值,然后用强化学习算法代替BP算法对DBN进行精调以提高有监督学习的精度和网络的鲁棒性。实验结果表明,相较于现有的类似模型,RL-DBN在学习速度、精度以及鲁棒性能等方面均有较大提高。 In recent years, deep learning-based deep belief network(DBN) has achieved successful applications in artificial intelligence and big data prediction analysis. However, too many hidden layers in DBN easily leads to a poor learning accuracy of supervised fine-tuning method(BP algorithm), even failure because of gradient diffusion, and robustness is poor. For this problem, an improved DBN based on reinforcement learning(RL-DBN) is proposed. First, adaptive contrastive divergence(ACD) algorithm is used to fast pre-train the hidden layers of DBN so that the better initial weight can be achieved, then the RL algorithm is used to replace BP algorithm to fine-tune DBN so that higher accuracy and better robustness can be achieved. The experimental results show that, compared with several existing similar models, the proposed RL-DBN achieves better performance in learning rate, accuracy and robustness.
作者 邢海霞 程乐 XING Hai-xia;CHENG Le(Jiangsu Software Testing Engineering Technology Research and Development Center,Huai'an 223003,China)
出处 《控制工程》 CSCD 北大核心 2019年第11期2115-2120,共6页 Control Engineering of China
基金 江苏省高校自然科学基金项目(16KJB520049)
关键词 深度信念网络 强化学习 自适应对比散度 鲁棒性能 Deep belief network reinforcement learning adaptive contrastive divergence robustness
  • 相关文献

参考文献2

二级参考文献8

共引文献25

同被引文献35

引证文献4

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部