期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Mutual information oriented deep skill chaining for multi‐agent reinforcement learning
1
作者 Zaipeng Xie Cheng Ji +4 位作者 Chentai Qiao wenzhan song Zewen Li Yufeng Zhang Yujing Zhang 《CAAI Transactions on Intelligence Technology》 SCIE EI 2024年第4期1014-1030,共17页
Multi‐agent reinforcement learning relies on reward signals to guide the policy networks of individual agents.However,in high‐dimensional continuous spaces,the non‐stationary environment can provide outdated experi... Multi‐agent reinforcement learning relies on reward signals to guide the policy networks of individual agents.However,in high‐dimensional continuous spaces,the non‐stationary environment can provide outdated experiences that hinder convergence,resulting in ineffective training performance for multi‐agent systems.To tackle this issue,a novel reinforcement learning scheme,Mutual Information Oriented Deep Skill Chaining(MioDSC),is proposed that generates an optimised cooperative policy by incorporating intrinsic rewards based on mutual information to improve exploration efficiency.These rewards encourage agents to diversify their learning process by engaging in actions that increase the mutual information between their actions and the environment state.In addition,MioDSC can generate cooperative policies using the options framework,allowing agents to learn and reuse complex action sequences and accelerating the convergence speed of multi‐agent learning.MioDSC was evaluated in the multi‐agent particle environment and the StarCraft multi‐agent challenge at varying difficulty levels.The experimental results demonstrate that MioDSC outperforms state‐of‐the‐art methods and is robust across various multi‐agent system tasks with high stability. 展开更多
关键词 artificial intelligence techniques decision making intelligent multi‐agent systems
下载PDF
Fast Distributed Demand Response Algorithm in Smart Grid 被引量:2
2
作者 Qifen Dong Li Yu +3 位作者 wenzhan song Junjie Yang Yuan Wu Jun Qi 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2017年第2期280-296,共17页
This paper proposes a fast distributed demand response U+0028 DR U+0029 algorithm for future smart grid based on primaldual interior method and Gaussian belief propagation U+0028 GaBP U+0029 solver. At the beginning o... This paper proposes a fast distributed demand response U+0028 DR U+0029 algorithm for future smart grid based on primaldual interior method and Gaussian belief propagation U+0028 GaBP U+0029 solver. At the beginning of each time slot, each end-user U+002F energysupplier exchanges limited rounds of messages that are not private with its neighbors, and computes the amount of energy consumption U+002F generation locally. The proposed demand response algorithm converges rapidly to a consumption U+002F generation decision that yields the optimal social welfare when the demands of endusers are low. When the demands are high, each end-user U+002F energysupplier estimates its energy consumption U+002F generation quickly such that a sub-optimal social welfare is achieved and the power system is ensured to operate within its capacity constraints. The impact of distributed computation errors on the proposed algorithm is analyzed theoretically. The simulation results show a good performance of the proposed algorithm. © 2017 Chinese Association of Automation. 展开更多
关键词 Electric power transmission networks Energy utilization
下载PDF
Decentralized Multigrid for In-situ Big Data Computing 被引量:1
3
作者 Goutham Kamath Lei Shi +2 位作者 Edmond Chow wenzhan song Junjie Yang 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2015年第6期545-559,共15页
Modern seismic sensors are capable of recording high precision vibration data continuously for several months. Seismic raw data consists of information regarding earthquake’s origin time, location, wave velocity, etc... Modern seismic sensors are capable of recording high precision vibration data continuously for several months. Seismic raw data consists of information regarding earthquake’s origin time, location, wave velocity, etc.Currently, these high volume data are gathered manually from each station for analysis. This process restricts us from obtaining high-resolution images in real-time. A new in-network distributed method is required that can obtain a high-resolution seismic tomography in real time. In this paper, we present a distributed multigrid solution to reconstruct seismic image over large dense networks. The algorithm performs in-network computation on large seismic samples and avoids expensive data collection and centralized computation. Our evaluation using synthetic data shows that the proposed method accelerates the convergence and reduces the number of messages exchanged. The distributed scheme balances the computation load and is also tolerant to severe packet loss. 展开更多
关键词 distributed multigrid cyber physical system big da
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部