期刊文献+

一种基于强化学习的五子棋博弈程序的设计与实现

Design and implementation of a gobang game program based on reinforcement learning
下载PDF
导出
摘要 提出了一种基于蒙特卡洛树和深度神经网络的强化学习方法,用于训练一个具有较高棋力水平的五子棋算法模型。该模型利用蒙特卡洛树搜索在给定的棋盘状态下进行自我对弈,通过策略价值网络评估每个可行的落子位置的先验概率和最终价值,并选择最优的落子方案。实验结果表明该模型具有较强的泛化能力,以此设计的五子棋博弈程序在2022年中国大学生计算机博弈大赛暨中国计算机博弈锦标赛中获得一等奖。 A reinforcement learning method based on Monte Carlo trees and deep neural networks has been proposed to train a gobang algorithm model with high chess power levels.The model uses the Monte Carlo tree search to conduct self play under the given chessboard state,evaluates the prior probability and final value of each feasible drop position through the strategic value net-work,and selects the optimal drop scheme.The experimental results indicate that the model has strong generalization ability,and the Gobang game program designed based on this won first prize in the 2022 China University Computer Game Competition and China Computer Game Championship.
作者 刘克 曹杨 金张根 孔维立 Liu Ke;Cao Yang;Jin Zhanggen;Kong Weili(School of Information and Control Engineering,Liaoning Shihua University,Fushun 113001,China;School of Artificial Intelligence and Software,Liaoning Shihua University,Fushun 113001,China)
出处 《现代计算机》 2023年第19期102-105,共4页 Modern Computer
基金 辽宁省大学生创新创业训练项目(S202210148038) 辽宁省教育厅科学研究项目(LJKMZ20220754)。
关键词 五子棋 博弈 卷积神经网络 强化学习 Gobang game convolutional neural network reinforcement learning
  • 相关文献

参考文献2

二级参考文献15

  • 1[1]Von NEUMANN J,MORGENSTERN O.Theory of games and economic behavior[M].Princeton:Princeton University Press,1944.
  • 2[2]SHANNON C E.Programming a computer for playing chess[J].Philosophical Magazine,1950,41:256-275.
  • 3[3]TURING A.Digital computers applied to games[C]//Faster than Thought.London,1953:286-295.
  • 4[4]FULLER S H,GASCHING J G,GILLOGLY J J.An analysis of the alpha-beta pruning algorithm[D].Pittsburg:Carnegie-Mellon University,1973.
  • 5[5]KNUTH D E,MOORE R N.An analysis of alpha-beta pruning[J].Artificial Intelligence,1975(6):293-326.
  • 6[6]KORF R.Iterative deepening:an optimal admissible tree search[J].Artificial Intelligence,1985,27(1):97-109.
  • 7[7]ELIZABETH P.Breakthrough of the year:human genetic vaviation[J].Science,2007,318(5858):1842-1849.
  • 8[9]潘丽娟.打扑克人脑险胜电脑[EB/OL].[2007-07-27].http://sports.sohu.com.
  • 9[17]摩尔根与果蝇[EB/OL].[2008-01-06].http://basic.shsmu.edu.cn/jpkc/Marx_philosophy/yxyzx/12.ppt.
  • 10[18]何黎.扑克牌里的博弈之道[EB/OL].[2008-01-06].http://bbs.mso.com.cn/viewthread.php?tid=645174.

共引文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部