期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
面向多智能体博弈对抗的对手建模框架 被引量:8
1
作者 罗俊仁 张万鹏 +3 位作者 袁唯淋 胡振震 陈少飞 陈璟 《系统仿真学报》 CAS CSCD 北大核心 2022年第9期1941-1955,共15页
对手建模作为多智能体博弈对抗的关键技术,是一种典型的智能体认知行为建模方法。介绍了多智能体博弈对抗几类典型模型、非平稳问题和元博弈相关理论;梳理总结对手建模方法,归纳了对手建模前沿理论,并对其应用前景及面对的挑战进行分析... 对手建模作为多智能体博弈对抗的关键技术,是一种典型的智能体认知行为建模方法。介绍了多智能体博弈对抗几类典型模型、非平稳问题和元博弈相关理论;梳理总结对手建模方法,归纳了对手建模前沿理论,并对其应用前景及面对的挑战进行分析。基于元博弈理论,构建了一个包括对手策略识别与生成、对手策略空间重构和对手利用共三个模块的通用对手建模框架。期望为多智能体博弈对抗对手建模方面的理论与方法研究提供有价值的参考。 展开更多
关键词 多智能体 对手建模 认知行为建模 元博弈
下载PDF
Transformer in reinforcement learning for decision-making:a survey
2
作者 weilin yuan Jiaxing CHEN +4 位作者 Shaofei CHEN Dawei FENG Zhenzhen HU Peng LI Weiwei ZHAO 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2024年第6期763-790,共28页
Reinforcement learning(RL)has become a dominant decision-making paradigm and has achieved notable success in many real-world applications.Notably,deep neural networks play a crucial role in unlocking RL’s potential i... Reinforcement learning(RL)has become a dominant decision-making paradigm and has achieved notable success in many real-world applications.Notably,deep neural networks play a crucial role in unlocking RL’s potential in large-scale decision-making tasks.Inspired by current major success of Transformer in natural language processing and computer vision,numerous bottlenecks have been overcome by combining Transformer with RL for decision-making.This paper presents a multiangle systematic survey of various Transformer-based RL(TransRL)models applied in decision-making tasks,including basic models,advanced algorithms,representative implementation instances,typical applications,and known challenges.Our work aims to provide insights into problems that inherently arise with the current RL approaches,and examines how we can address them with better TransRL models.To our knowledge,we are the first to present a comprehensive review of the recent Transformer research developments in RL for decision-making.We hope that this survey provides a comprehensive review of TransRL models and inspires the RL community in its pursuit of future directions.To keep track of the rapid TransRL developments in the decision-making domains,we summarize the latest papers and their open-source implementations at https://github.com/williamyuanv0/Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey. 展开更多
关键词 TRANSFORMER Reinforcement learning(RL) Decision-making(DM) Deep neural network(DNN) Multi-agent reinforcement learning(MARL) Meta-reinforcement learning(Meta-RL)
原文传递
3D Surface velocity retrieval of mountain glacier using an offset tracking technique applied to ascending and descending SAR constellation data:a case study of the Yiga Glacier 被引量:1
3
作者 Qun Wang Jinghui Fan +6 位作者 Wei Zhou Liqiang Tong Zhaocheng Guo Guang Liu weilin yuan Joaquim Joao Sousa Zbigniew Perski 《International Journal of Digital Earth》 SCIE EI 2019年第6期614-624,共11页
COSMO-SkyMed is a constellation of four X-band high-resolution radar satellites with a minimum revisit period of 12 hours.These satellites can obtain ascending and descending synthetic aperture radar(SAR)images with v... COSMO-SkyMed is a constellation of four X-band high-resolution radar satellites with a minimum revisit period of 12 hours.These satellites can obtain ascending and descending synthetic aperture radar(SAR)images with very similar periods for use in the three-dimensional(3D)inversion of glacier velocities.In this paper,based on ascending and descending COSMO-SkyMed data acquired at nearly the same time,the surface velocity of the Yiga Glacier,located in the Jiali County,Tibet,China,is estimated in four directions using an offset tracking technique during the periods of 16 January to 3 February 2017 and 1 February to 19 February 2017.Through the geometrical relationships between the measurements and the SAR images,the least square method is used to retrieve the 3D components of the glacier surface velocity in the eastward,northward and upward directions.The results show that applying the offset tracking technique to COSMO-SkyMed images can be used to derive the true 3D velocity of a glacier’s surface.During the two periods,the Yiga Glacier had a stable velocity,and the maximum surface velocity,2.4 m/d,was observed in the middle portion of the glacier,which corresponds to the location of the steepest slope. 展开更多
关键词 Mountain glacier inversion of 3D movement offset tracking SAR constellation data
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部