摘要
针对结构固定认知模型中存在的学习浪费与计算浪费问题,在具有内发动机机制的感知行动认知模型基础上,根据操作条件反射学习特性,借鉴潜在动作原理,建立起一种具有发育机制的感知行动认知模型D-SSCM (Development-sensorimotor cognitive model),具体为一个14元组,包含离散学习时间集、内部可感知离散状态集、可输出动作集、有效输出动作空间集、潜在动作关系集、可输出动作空间探索率集及发育算法等.针对模型发育过程,分别设计了模型结构扩展式发育方法和算法以及缩减式发育方法和算法,定义了模型的发育式学习过程.使用两轮机器人自平衡任务对设计的学习模型进行验证,实验结果表明,发育机制下的感知行动认知模型D-SSCM具有更快的学习速度及更稳定的学习效果.
Aiming at the problems of learning waste and computing waste that exist in the cognitive models with fixed structure,and according to the operate conditioning learning characteristics as well as drawing on the affordance theory,a new kind of sensorimotor cognitive model named D-SSCM with the developmental mechanism is established based on the sensorimotor cognitive model with the mechanism of intrinsic motivation.D-SSCM is a fourteen tuple in specific,including discrete learning time set,internal sensible discrete state set,optional motion set,effective output motion space set,affordance relationship set,optional motion space exploration rate set,developmental algorithm and etc.In view of D-SSCM’s developmental learning,extended developmental method and algorithm as well as reduced developmental method and algorithm are designed.Model’s developmental learning process is defined.Using two-wheeled robot self-balancing task to test this designed model,results show that D-SSCM is with faster learning speed and more stable learning effect.
作者
张晓平
阮晓钢
王力
李志军
闫佳庆
毕松
ZHANG Xiao-Ping;RUAN Xiao-Gang;WANG Li;LI Zhi-Jun;YAN Jia-Qing;BI Song(School of Electrical and Control Engineering,North China University of Technology,Beijing 100144;Faculty of In-formation Technology,Beijing University of Technology,Beijing 100124)
出处
《自动化学报》
EI
CAS
CSCD
北大核心
2021年第2期391-403,共13页
Acta Automatica Sinica
基金
北京市自然科学基金(4204096,4184086)
国家自然科学基金(61903006,61174116)
北京市教委项目(KM201610009001)
北京高校高精尖学科建设项目
北方工业大学青年毓优人才项目
北方工业大学科研启动基金项目资助。
关键词
发育
操作条件反射
潜在动作
感知行动认知
两轮机器人
Development
operant conditioning
affordance
sensorimotor cognitive
two-wheeled robot