期刊文献+

面向语言评价的Takagi-Sugeno模糊再励学习

Linguistic reward-oriented T-S fuzzy reinforcement learning
原文传递
导出
摘要 综合考虑再励学习的两个重要子问题 :连续空间及语言评价问题 ,提出了一种新的学习方法 ,即面向语言评价的 Takagi-Sugeno(T-S)模糊再励学习。该学习智能体构建在 Q-学习方法和 Takagi-Sugeno模糊推理系统的基础上 ,适于处理连续域的复杂学习任务 ,亦可用于设计 Takagi-Sugeno模糊逻辑控制器。以二级倒立摆控制系统为例 。 This paper presents a learning method to simultaneously resolve two significant sub problems in reinforcement learning: continuous space and linguistic rewards. A linguistic reward oriented Takagi Sugeno fuzzy reinforcement learning (LRTSFRL) model was constructed by combining the Q learning method with Takagi Sugeno type fuzzy inference systems. The proposed method is capable of solving complicated learning tasks in continuous domains and can be used to design Takagi Sugeno fuzzy logic controllers. Experiments with the double inverted pendulum system demonstrated the improved performance of the scheme.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2002年第10期1393-1396,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家"九七三"重点基础研究发展规划项目( G19990 32 70 7)
关键词 语言评价 Takagi-Sugeno模糊再励学习 T-S模糊推理系统 神经-模糊控制 函数逼近 Q-学习 专家系统 reinforcement learning linguistic rewards Takagi Sugeno fuzzy inference systems neuro fuzzy control function approximations Q learning fuzzy number
  • 相关文献

参考文献4

  • 1LIN Chin-Teng,KAN Ming-Chih.Adaptive fuzzy command acquisition with reinforcement learning[].IEEE Transactions on Fuzzy Systems.1998
  • 2LIN Chin-Teng,LU Ya-Ching.A neural fuzzy system with linguistic teaching signals[].IEEE Transactions on Fuzzy Systems.1995
  • 3Takagi T,Sugeno M.Fuzzy identification of systems and its application to modeling and control[].I EEE Trans on SystemsM an and Cybernetics.1985
  • 4Kaufmann A,Gupta M M.Introduction to Fuzzy Arithmetic[]..1985

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部