摘要
为了提高机器臂轨迹跟踪控制器的工作性能,提出基于增强学习的机械臂轨迹跟踪控制方法。介绍了增强学习的基本原理,提出基于SARSA算法的增强学习补偿控制策略。利用比例—微分(PD)控制器完成了基本的稳定任务后,再利用增强学习算法实现了对未知干扰因素的补偿,提升了对不同未知情况的适应能力。实验结果验证了自适应离散化增强学习方法在机械臂轨迹跟踪问题中的可行性和有效性,明显提高了控制器的学习速度。
To improve the working performance of robotic trajectory tracking controller,the robotic trajectory tracking control method based on reinforcement learning was proposed.The basic principle of reinforcement learning was introduced,and then the robot trajectory tracking control strategy based on SARSA was proposed.By using the reinforcement learning,the unknown disturbance factors were compensated and the adaptability to the unknown was improved after the PD control method was applied.The experimental results verified the feasibility and effectiveness of the reinforcement learning method in the trajectory tracking problem of robot arms,and the learning speed of the controller was enhanced.
作者
刘卫朋
邢关生
陈海永
孙鹤旭
LIU Weipeng;XING Guansheng;CHEN Haiyong;SUN Hexu(School of Control Science and Engineering,Hebei University of Technology,Tianjin 300130,China;School of Automation and Electronic Engineering,Qingdao University of Science and Technology,Qingdao 266042,China)
出处
《计算机集成制造系统》
EI
CSCD
北大核心
2018年第8期1996-2004,共9页
Computer Integrated Manufacturing Systems
基金
河北省科技计划资助项目(17211804D)
天津市教委科研计划资助项目(20140728)
河北省自然科学基金资助项目(F2018202078)
河北省首批青年拔尖人才支持计划资助项目(210003)
天津市自然科学基金资助项目(16JCQNJC04200)~~