检索结果-维普期刊中文期刊服务平台

期刊文献⁺

任意字段

题名或关键词

题名

关键词

文摘

作者

第一作者

机构

刊名

分类号

参考文献

作者简介

基金资助

栏目信息

共找到4篇文章

< 1 >

每页显示 20 50 100

已选择0条

导出题录引用分析

统计分析

显示方式：

文摘详细列表

相关度排序被引量排序时效性排序

基于多智能体增强学习的公交驻站控制方法被引量：6: 1; 作者陈春晓陈治亚陈维亚《计算机工程与应用》 CSCD 北大核心 2015年第17期8-13,27,共7页; 车辆驻站是减少串车现象和改善公交服务可靠性的常用且有效控制策略,其执行过程需要在随机交互的系统环境中进行动态决策。考虑实时公交运营信息的可获得性,研究智能体完全合作环境下公交车辆驻站增强学习控制问题,建立基于多智能体系... 展开更多; 关键词驻站多智能体增强学习多智能体系统控制策略; 下载PDF 职称材料

面向工业5G+时间敏感网络的分布式流调度策略: 2; 作者李明妍刘厚灵古富强《移动通信》 2023年第8期2-8,共7页; 5G和时间敏感网络的融合是工业制造无线升级的关键技术。在3GPP发布的版本16中,研究的关键方向之一是增强5G系统以满足TSN支持的工业应用,助力提升工业互联网的实时转发与泛在感知能力。由于TSN和5G系统的服务质量保证机制相互独立,目... 展开更多; 关键词 5G+工业互联网时间敏感网络多智能体深度增强学习; 下载PDF 职称材料

A distributed algorithm for signal coordination of multiple agents with embedded platoon dispersion model: 3; 作者别一鸣王殿海 +1 位作者马东方朱自博《Journal of Southeast University(English Edition)》 EI CAS 2011年第3期311-315,共5页; In order to reduce average arterial vehicle delay, a novel distributed and coordinated traffic control algorithm is developed using the multiple agent system and the reinforce learning （RL）. The RL is used to minimi... 展开更多; 关键词 multiple agents signal coordination reinforce learning platoon dispersion model; 下载PDF 职称材料

Multi-agent reinforcement learning with cooperation based on eligibility traces: 4; 作者杨玉君程君实陈佳品《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2004年第5期564-568,共5页; The application of reinforcement learning is widely used by multi-agent systems in recent years. An agent uses a multi-agent system to cooperate with other agents to accomplish the given task, and one agent′s behavio... 展开更多; 关键词 reinforcement learning MULTI-AGENT BEHAVIOR eligibility trace; 下载PDF 职称材料

	题名	作者	出处	发文年	被引量	操作
1	基于多智能体增强学习的公交驻站控制方法	陈春晓陈治亚陈维亚	《计算机工程与应用》 CSCD 北大核心	2015	6	下载PDF 职称材料
2	面向工业5G+时间敏感网络的分布式流调度策略	李明妍刘厚灵古富强	《移动通信》	2023	0	下载PDF 职称材料
3	A distributed algorithm for signal coordination of multiple agents with embedded platoon dispersion model	别一鸣王殿海马东方朱自博	《Journal of Southeast University(English Edition)》 EI CAS	2011	0	下载PDF 职称材料
4	Multi-agent reinforcement learning with cooperation based on eligibility traces	杨玉君程君实陈佳品	《Journal of Harbin Institute of Technology(New Series)》 EI CAS	2004	0	下载PDF 职称材料

已选择0条

导出题录引用分析

统计分析

使用帮助返回顶部