摘要
对在动态学习的环境中的IGA算法做了研究,改进了梯度方向上的步长恒定不变的不足,引入了变学习率,并介绍了调节学习率的方法——WoLF原则,加速其收敛。最后根据该方法,对Q学习算法做了改进,并通过仿真试验证明了算法的有效性。
This paper studied the IGA algorithm in a dynamic learning environment,and improved the insufficiency of step constantly invariable in the gradient direction.The variable learning rate and the WoLF principle to adjust learning rate were introduced in order to accelerate its convergence.Finally the Q learning algorithm was improved based on this method and the validity of the algorithm was proved through the simulation testing.
出处
《长春工程学院学报(自然科学版)》
2009年第4期81-83,共3页
Journal of Changchun Institute of Technology:Natural Sciences Edition