期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Optimal Policies for Quantum Markov Decision Processes 被引量:2
1
作者 ming-sheng ying Yuan Feng Sheng-Gang ying 《International Journal of Automation and computing》 EI CSCD 2021年第3期410-421,共12页
Markov decision process(MDP)offers a general framework for modelling sequential decision making where outcomes are random.In particular,it serves as a mathematical framework for reinforcement learning.This paper intro... Markov decision process(MDP)offers a general framework for modelling sequential decision making where outcomes are random.In particular,it serves as a mathematical framework for reinforcement learning.This paper introduces an extension of MDP,namely quantum MDP(q MDP),that can serve as a mathematical model of decision making about quantum systems.We develop dynamic programming algorithms for policy evaluation and finding optimal policies for q MDPs in the case of finite-horizon.The results obtained in this paper provide some useful mathematical tools for reinforcement learning techniques applied to the quantum world. 展开更多
关键词 Quantum Markov decision processes quantum machine learning reinforcement learning dynamic programming decision making
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部