期刊文献+

利用A2C-ac的城轨车车通信资源分配算法

Resource Allocation Algorithm of Urban Rail Train-to-Train Communication with A2C-ac
下载PDF
导出
摘要 在城市轨道交通列车控制系统中,车车(T2T)通信作为新一代列车通信模式,利用列车间直接通信来降低通信时延,提高列车运行效率。在T2T通信与车地(T2G)通信并存场景下,针对复用T2G链路产生的干扰问题,在保证用户通信质量的前提下,该文提出一种基于多智能体深度强化学习(MADRL)的改进优势演员-评论家(A2C-ac)资源分配算法。首先以系统吞吐量为优化目标,以T2T通信发送端为智能体,策略网络采用分层输出结构指导智能体选择需复用的频谱资源和功率水平,然后智能体做出相应动作并与T2T通信环境交互,得到该时隙下T2G用户和T2T用户吞吐量,价值网络对两者分别评价,利用权重因子β为每个智能体定制化加权时序差分(TD)误差,以此来灵活优化神经网络参数。最后,智能体根据训练好的模型联合选出最佳的频谱资源和功率水平。仿真结果表明,该算法相较于A2C算法和深度Q网络(DQN)算法,在收敛速度、T2T成功接入率、吞吐量等方面均有明显提升。 In the train control system of urban rail transit,Train-to-Train(T2T)communication,a new train communication mode,use direct communication between trains to reduce communication delay and improve train operation efficiency.In the scenario of the coexistence of T2T communication and Train to Ground(T2G)communication,an improved Advantage Actor-Critic-ac(A2C-ac)resource allocation algorithm based on Multi-Agent Deep Reinforcement Learning(MADRL)is proposed to solve the interference problem caused by multiplexing T2G links,and under the premise of ensuring the quality of user communication.Firstly,taking the system throughput as the optimization goal and the T2T communication transmitter as the agent,the policy network adopts a hierarchical output structure to guide the agent in selecting the spectrum resources and power level to be reused.Then the agent makes corresponding actions and interacts with the communication environment to obtain the throughput of T2G users and T2T users in the time slot.The value networkβevaluates the two separately and uses the weight factor to customize the weighted Temporal Difference(TD)error for each agent to optimize the neural network parameters flexibly.Finally,the agents jointly select the best spectral resources and power levels according to the trained model.The simulation results show that compared with the A2C and Deep Q-Networks(DQN)algorithms,the proposed algorithm has significantly improved the convergence speed,T2T successful access rate,and the throughput.
作者 王瑞峰 张明 黄子恒 何涛 WANG Ruifeng;ZHANG Ming;HUANG Ziheng;HE Tao(School of Automation and Electrical Engineering,Lanzhou Jiaotong University,Lanzhou 730070,China;Automatic Control Institute,Lanzhou Jiaotong University,Lanzhou 730070,China)
出处 《电子与信息学报》 EI CAS CSCD 北大核心 2024年第4期1306-1313,共8页 Journal of Electronics & Information Technology
基金 国家自然科学基金铁路基础研究联合基金(U2268206)。
关键词 城市轨道交通 资源分配 T2T通信 多智能体深度强化学习 A2C-ac算法 Urban rail transit system Resource allocation Train-to-Train(T2T) Multi-Agent Deep Reinforcement Learning(MADRL) Advantage Actor-Critic-ac(A2C-ac)algorithm
  • 相关文献

参考文献8

二级参考文献44

  • 1FAA. Introductions to TCAS II [ EB/OL]. http://www. faa. gov/documentLibrary/media/Advisory _ Circular/ CAS% 20II% 20V7. 1% 20Intro% 20booklet. pdf, 2014 - 09 - 05.
  • 2Federal aviation administration, automatic dependent sur- veillance - broadcast ( ADS - B) [ EB/OL ]. http ://www. faa. gov/nextgen/implementation/programs/adsb/, 2014 - 09 - 05.
  • 3U S. Department of homeland security navigation center. Automatic Identification System Overview [ EB/OL ]. ht- tp ://www. navcen, uscg. gov/? pageName = AISmain, 2014 -09.
  • 4C2C - CC technical committee. C2C - CC manifesto [ EB/ OL]. http ://www. car - to - car. org,2014 - 05.
  • 5Cristina Rico Garcia, Andres Lehner, Thomas Strang, et al. Comparison of collision avoidance systems and applica- bility to rail transport[ C ]//Proceedings of the 7th Inter- national Conference on Intelligent Transportation System Telecommunication ,2007 : 1 - 6.
  • 6Andreas Lehner, Cristina Rico Garcia, Wige Eugen, et al. A multi -broadcast communication system for highdynamic vehicular ad - hoc networks [ C ]//Proceedings of the ICUMT 2009 and IEEE International Workshop on Commnication Technologies for Vehicles ,2009 : 1 - 6.
  • 7Cristina Rico Garcia, Lehner Andreas, Thomas Strang. COMB : Cell - based orientation aware MANET broadcast MAC layer [ C ]//Procedings of the IEEE Global Com- munications Conference ,2008 : 1 - 5.
  • 8Cristina Rico Garcia, Lehner Andreas, Thomas Strang, et al. A reliable MAC protocol for broadcast VANETs [ C]//Proceedings of the 4'h Workshop on Vehicle to Vehicle Communications ,2008 : 1 - 8.
  • 9Gerlach K, Rahmig C. Multi - hypothesis based map - matching algorithm for precise train positioning [ C ]/! Proceedings of the 12th International Conference on In- formation Fusion ,2009 : 1363 - 1369.
  • 10刘海东,苏梅,彭宏勤,张增勇,邢海龙.城市轨道交通列车制动问题研究[J].交通运输系统工程与信息,2011,11(6):93-97. 被引量:13

共引文献32

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部