Cooperative Multi-Agent Reinforcement Learning with Constraint-Reduced DCOP

Cooperative Multi-Agent Reinforcement Learning with Constraint-Reduced DCOP

下载PDF

导出

摘要 Cooperative multi-agent reinforcement learning（ MARL） is an important topic in the field of artificial intelligence,in which distributed constraint optimization（ DCOP） algorithms have been widely used to coordinate the actions of multiple agents. However,dense communication among agents affects the practicability of DCOP algorithms. In this paper,we propose a novel DCOP algorithm dealing with the previous DCOP algorithms＇ communication problem by reducing constraints.The contributions of this paper are primarily threefold：（1） It is proved that removing constraints can effectively reduce the communication burden of DCOP algorithms.（2） An criterion is provided to identify insignificant constraints whose elimination doesn＇t have a great impact on the performance of the whole system.（3） A constraint-reduced DCOP algorithm is proposed by adopting a variant of spectral clustering algorithm to detect and eliminate the insignificant constraints. Our algorithm reduces the communication burdern of the benchmark DCOP algorithm while keeping its overall performance unaffected. The performance of constraint-reduced DCOP algorithm is evaluated on four configurations of cooperative sensor networks. The effectiveness of communication reduction is also verified by comparisons between the constraint-reduced DCOP and the benchmark DCOP. Cooperative multi-agent reinforcement learning（ MARL） is an important topic in the field of artificial intelligence,in which distributed constraint optimization（ DCOP） algorithms have been widely used to coordinate the actions of multiple agents. However,dense communication among agents affects the practicability of DCOP algorithms. In this paper,we propose a novel DCOP algorithm dealing with the previous DCOP algorithms＇ communication problem by reducing constraints.The contributions of this paper are primarily threefold：（1） It is proved that removing constraints can effectively reduce the communication burden of DCOP algorithms.（2） An criterion is provided to identify insignificant constraints whose elimination doesn＇t have a great impact on the performance of the whole system.（3） A constraint-reduced DCOP algorithm is proposed by adopting a variant of spectral clustering algorithm to detect and eliminate the insignificant constraints. Our algorithm reduces the communication burdern of the benchmark DCOP algorithm while keeping its overall performance unaffected. The performance of constraint-reduced DCOP algorithm is evaluated on four configurations of cooperative sensor networks. The effectiveness of communication reduction is also verified by comparisons between the constraint-reduced DCOP and the benchmark DCOP.

作者 Yi Xie Zhongyi Liu Zhao Liu Yijun Gu

机构地区 School of Information Technology and Cybersecurity School of Management School of Postgraduates

出处《Journal of Beijing Institute of Technology》 EI CAS 2017年第4期525-533,共9页 北京理工大学学报（英文版）

基金 Supported by the National Social Science Foundation of China(15ZDA034,14BZZ028) Beijing Social Science Foundation(16JDGLA036) JKF Program of People’s Public Security University of China(2016JKF01318)

关键词 reinforcement learning cooperative multi-agent system distributed constraint optimization （DCOP） constraint-reduced DCOP reinforcement learning cooperative multi-agent system distributed constraint optimization （DCOP） constraint-reduced DCOP

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1Chao Chen,Xinbiao Lu,Chunming Wu,Xiao Jiang,Chen Mao.Origin and Geodynamic Implications of Concealed Granite in Shadong Tungsten Deposit, Xinjiang, China： Zircon U-Pb Chronology, Geochemistry, and Sr-Nd-Hf Isotope Constraint[J].Journal of Earth Science,2018,29(1):114-129. 被引量：10
2Hongguang Pan,Weimin Zhong,Zaiying Wang.An on-line constraint softening strategy to guarantee the feasibility of dynamic controller in double-layered MPC[J].Chinese Journal of Chemical Engineering,2017,25(12):1805-1811.
3Yu Lin-Sen,Liu Yong-Mei,Sun Guang-Lu,Li Peng.An Evolutional Learning Algorithm Based on Weighted Likelihood for Image Segmentation[J].国际计算机前沿大会会议论文集,2015(1):61-62.
4Shengbing Ren,Mengyu Jia,Fei Huang,Yuan Liu.Visualization Analysis Framework for Large-Scale Software Based on Software Network[J].国际计算机前沿大会会议论文集,2017(1):185-187.
5Yue Wang,Hongzhi Wang,Chen Ye,Hong Gao.Graph Similarity Join with K-Hop Tree Indexing[J].国际计算机前沿大会会议论文集,2015(1):13-14.
6王天成,刘相振,董泽政,王海波.一种自适应鲁棒最小体积高光谱解混算法[J].自动化学报,2017,43(12):2141-2159. 被引量：6
7Xiao Chen,Fazhi He,Yiteng Pan,Haojun Ai.Selective Image Matting with Scalable Variance and Model Rectification[J].国际计算机前沿大会会议论文集,2017(1):134-138.
8Chunyuan Zhang,Yujiao Chen.DCC: Distributed Cache Consistency[J].国际计算机前沿大会会议论文集,2017(2):89-90.

Journal of Beijing Institute of Technology

2017年第4期

浏览历史

内容加载中请稍等...

Cooperative Multi-Agent Reinforcement Learning with Constraint-Reduced DCOP

相关作者

相关机构

相关主题

浏览历史