检索结果-维普期刊中文期刊服务平台

期刊文献⁺

任意字段

题名或关键词

题名

关键词

文摘

作者

第一作者

机构

刊名

分类号

参考文献

作者简介

基金资助

栏目信息

共找到3篇文章

< 1 >

每页显示 20 50 100

已选择0条

导出题录引用分析

统计分析

显示方式：

文摘详细列表

相关度排序被引量排序时效性排序

基于DDPG算法的路径规划研究被引量：1: 1; 作者张义郭坤《电脑知识与技术》 2021年第4期193-194,200,共3页; 路径规划是人工智能领域的一个经典问题,在国防军事、道路交通、机器人仿真等诸多领域有着广泛应用,然而现有的路径规划算法大多存在着环境单一、离散的动作空间、需要人工构筑模型的问题。强化学习是一种无须人工提供训练数据自行与环... 展开更多; 关键词路径规划深度强化学习 DDPG actorcritic 连续动作空间; 下载PDF 职称材料

Multi-Agent Deep Reinforcement Learning for Cross-Layer Scheduling in Mobile Ad-Hoc Networks: 2; 作者 Xinxing Zheng Yu Zhao +1 位作者 Joohyun Lee Wei Chen 《China Communications》 SCIE CSCD 2023年第8期78-88,共11页; Due to the fading characteristics of wireless channels and the burstiness of data traffic,how to deal with congestion in Ad-hoc networks with effective algorithms is still open and challenging.In this paper,we focus o... 展开更多; 关键词 Ad-hoc network cross-layer scheduling multi agent deep reinforcement learning interference elimination power control queue scheduling actorcritic methods markov decision process; 下载PDF 职称材料

A new noise network and gradient parallelisation‐based asynchronous advantage actor‐critic algorithm: 3; 作者 Zhengshun Fei Yanping Wang +3 位作者 Jinglong Wang Kangling Liu Bingqiang Huang Ping Tan 《IET Cyber-Systems and Robotics》 EI 2022年第3期175-188,共14页; Asynchronous advantage actor‐critic(A3C)algorithm is a commonly used policy opti-mization algorithm in reinforcement learning,in which asynchronous is parallel inter-active sampling and training,and advantage is a sa... 展开更多; 关键词 ASYNCHRONOUS ADVANTAGE actorcritic (A3C) generalised ADVANTAGE estimation (GAE) PARALLELISATION reinforcement learning; 原文传递

	题名	作者	出处	发文年	被引量	操作
1	基于DDPG算法的路径规划研究	张义郭坤	《电脑知识与技术》	2021	1	下载PDF 职称材料
2	Multi-Agent Deep Reinforcement Learning for Cross-Layer Scheduling in Mobile Ad-Hoc Networks	Xinxing Zheng Yu Zhao Joohyun Lee Wei Chen	《China Communications》 SCIE CSCD	2023	0	下载PDF 职称材料
3	A new noise network and gradient parallelisation‐based asynchronous advantage actor‐critic algorithm	Zhengshun Fei Yanping Wang Jinglong Wang Kangling Liu Bingqiang Huang Ping Tan	《IET Cyber-Systems and Robotics》 EI	2022	0	原文传递

已选择0条

导出题录引用分析

统计分析

使用帮助返回顶部