Autonomous Vehicle Platoons In Urban Road Networks:A Joint Distributed Reinforcement Learning and Model Predictive Control Approach

下载PDF

导出

摘要 In this paper, platoons of autonomous vehicles operating in urban road networks are considered. From a methodological point of view, the problem of interest consists of formally characterizing vehicle state trajectory tubes by means of routing decisions complying with traffic congestion criteria. To this end, a novel distributed control architecture is conceived by taking advantage of two methodologies: deep reinforcement learning and model predictive control. On one hand, the routing decisions are obtained by using a distributed reinforcement learning algorithm that exploits available traffic data at each road junction. On the other hand, a bank of model predictive controllers is in charge of computing the more adequate control action for each involved vehicle. Such tasks are here combined into a single framework:the deep reinforcement learning output(action) is translated into a set-point to be tracked by the model predictive controller;conversely, the current vehicle position, resulting from the application of the control move, is exploited by the deep reinforcement learning unit for improving its reliability. The main novelty of the proposed solution lies in its hybrid nature: on one hand it fully exploits deep reinforcement learning capabilities for decisionmaking purposes;on the other hand, time-varying hard constraints are always satisfied during the dynamical platoon evolution imposed by the computed routing decisions. To efficiently evaluate the performance of the proposed control architecture, a co-design procedure, involving the SUMO and MATLAB platforms, is implemented so that complex operating environments can be used, and the information coming from road maps(links,junctions, obstacles, semaphores, etc.) and vehicle state trajectories can be shared and exchanged. Finally by considering as operating scenario a real entire city block and a platoon of eleven vehicles described by double-integrator models, several simulations have been performed with the aim to put in light the main f eatures of the proposed approach. Moreover, it is important to underline that in different operating scenarios the proposed reinforcement learning scheme is capable of significantly reducing traffic congestion phenomena when compared with well-reputed competitors.

作者 Luigi D’Alfonso Francesco Giannini Giuseppe Franzè Giuseppe Fedele Francesco Pupo Giancarlo Fortino

机构地区 IEEE the Department of Computer Engineering the Department of Mechanical Engineering

出处《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期141-156,共16页 自动化学报（英文版）

关键词 Distributed model predictive control distributed reinforcement learning routing decisions urban road networks

分类号 U463.6 [机械工程—车辆工程] TP273 [自动化与计算机技术—检测技术与自动化装置] TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献5

1Yifang Ma,Zhenyu Wang,Hong Yang,Lin Yang.Artificial Intelligence Applications in the Development of Autonomous Vehicles:A Survey[J].IEEE/CAA Journal of Automatica Sinica,2020,7(2):315-329. 被引量：23
2Yingxu Wang,Ming Hou,Konstantinos NPlataniotis,Sam Kwong,Henry Leung,Edward Tunstel,Imre JRudas,Ljiljana Trajkovic.Towards a Theoretical Framework of Autonomous Systems Underpinned by Intelligence and Systems Sciences[J].IEEE/CAA Journal of Automatica Sinica,2021,8(1):52-63. 被引量：2
3Liang Qi,Mengchu Zhou,Wenjing Luan.A Dynamic Road Incident Information Delivery Strategy to Reduce Urban Traffic Congestion[J].IEEE/CAA Journal of Automatica Sinica,2018,5(5):934-945. 被引量：4
4Ramij Raja Hossain,Ratnesh Kumar.Machine Learning Accelerated Real-Time Model Predictive Control for Power Systems[J].IEEE/CAA Journal of Automatica Sinica,2023,10(4):916-930. 被引量：1
5Bao-Lin Ye,Weimin Wu,Keyu Ruan,Lingxi Li,Tehuan Chen,Huimin Gao,Yaobin Chen.A Survey of Model Predictive Control Methods for Traffic Signal Control[J].IEEE/CAA Journal of Automatica Sinica,2019,6(3):623-640. 被引量：10

二级参考文献5

1席裕庚,李德伟,林姝.模型预测控制--现状与挑战[J].自动化学报,2013,39(3):222-236. 被引量：458
2Naiqi Wu,Zhiwu Li,Kamel Barkaoui,Xiaoou Li,Tadahiko Murata,MengChu Zhou.IoT-based Smart and Complex Systems：A Guest Editorial Report[J].IEEE/CAA Journal of Automatica Sinica,2018,5(1):69-73. 被引量：4
3CHEN Xue-mei,JIN Min,MIAO Yi-song,ZHANG Qiang.Driving decision-making analysis of car-following for autonomous vehicle under complex urban environment[J].Journal of Central South University,2017,24(6):1476-1482. 被引量：2
4Yifang Ma,Zhenyu Wang,Hong Yang,Lin Yang.Artificial Intelligence Applications in the Development of Autonomous Vehicles:A Survey[J].IEEE/CAA Journal of Automatica Sinica,2020,7(2):315-329. 被引量：23
5Long Chen,Xuemin Hu,Wei Tian,Hong Wang,Dongpu Cao,Fei-Yue Wang.Parallel Planning:A New Motion Planning Framework for Autonomous Driving[J].IEEE/CAA Journal of Automatica Sinica,2019,6(1):236-246. 被引量：18

共引文献34

1Zhe Chen,Jing Zhang,Dacheng Tao.Progressive LiDAR Adaptation for Road Detection[J].IEEE/CAA Journal of Automatica Sinica,2019,6(3):693-702. 被引量：11
2陆杰,陶菲,闫金伟,张帅倩,林霜.基于实时道路信息的个性化绕行指引方法研究[J].物流工程与管理,2020,42(9):122-125.
3Di Wu,Xin Luo.Robust Latent Factor Analysis for Precise Representation of High-Dimensional and Sparse Data[J].IEEE/CAA Journal of Automatica Sinica,2021,8(4):796-805. 被引量：5
4Yingxu Wang,Ming Hou,Konstantinos NPlataniotis,Sam Kwong,Henry Leung,Edward Tunstel,Imre JRudas,Ljiljana Trajkovic.Towards a Theoretical Framework of Autonomous Systems Underpinned by Intelligence and Systems Sciences[J].IEEE/CAA Journal of Automatica Sinica,2021,8(1):52-63. 被引量：2
5郑永胜,田盎然,尹鹏,范韬,刘浩宇,居俊,唐强.复杂环境下超宽深大基坑设计与施工技术分析——以X352县道改扩建工程项目为例[J].盐城工学院学报（自然科学版）,2021,34(1):60-65. 被引量：4
6陈虹宇,艾红,王晓,吕宜生,陈圆圆,王飞跃.社会交通中的社会信号分析与感知[J].自动化学报,2021,47(6):1256-1272. 被引量：6
7Wonje Jang,Junhyuk Hyun,Jhonghyun An,Minho Cho,Euntai Kim.A Lane-Level Road Marking Map Using a Monocular Camera[J].IEEE/CAA Journal of Automatica Sinica,2022,9(1):187-204. 被引量：1
8Wenwei Yue,Changle Li,Guoqiang Mao,Nan Cheng,Di Zhou.Evolution of Road Traffic Congestion Control:A Survey from Perspective of Sensing,Communication,and Computation[J].China Communications,2021,18(12):151-177. 被引量：1
9梁晨晨,田金鹏,宋春林.基于雷达和摄像头传感器融合的辅助驾驶目标检测算法[J].信息技术与信息化,2021(12):5-9. 被引量：5
10孙浩凯,刘博,佟世继.基于双模通信的路权控制系统设计[J].物联网技术,2022,12(1):90-94. 被引量：2

1Songjiao Bi,Langtao Hu,Quanjin Liu,Jianlan Wu,Rui Yang,Lei Wu.Deep Reinforcement Learning for IRS-Assisted UAV Covert Communications[J].China Communications,2023,20(12):131-141. 被引量：1
2Haiwei Su,Weikang Wang,Run Shi,Hua Tang,Lijuan Sun,Lele Wang,Qinqin Liu,Tierui Zhang.Recent advances in quantum dot catalysts for hydrogen evolution:Synthesis,characterization,and photocatalytic application[J].Carbon Energy,2023,5(9):1-37. 被引量：4
3Andrzej S Tarnawski.Editor-in-Chief articles of choice and comments at the year-end of 2023[J].World Journal of Gastroenterology,2024,30(1):1-8.
4如‘瓷’多彩中青年陶瓷艺术家精品展[J].景德镇陶瓷,2023,51(5).
5Maryam Bukhari,Sadaf Yasmin,Sheneela Naz,Mehr Yahya Durrani,Mubashir Javaid,Jihoon Moon,Seungmin Rho.A Smart Heart Disease Diagnostic System Using Deep Vanilla LSTM[J].Computers, Materials & Continua,2023,77(10):1251-1279. 被引量：2
6He Meng,Xukun Yang,Liang Gao,Changjin Wang.Research on railway BIM platform framework based on homemade graphics engine[J].High-Speed Railway,2023,1(3):204-210.
7陈祖倩,肖正泮,魏春洁,臧彧伟,从心黎,王大勇.靶向程序性细胞死亡配体1的多肽抑制剂筛选及抗肿瘤活性评价[J].中国热带医学,2023,23(11):1134-1140. 被引量：1
8何德峰,冯阳辉,穆建彬.基于扰动预测的网联车鲁棒协同巡航预测控制[J].浙江工业大学学报,2024,52(1):43-51.
9Justus Shunza,Mary Akinyemi,Chika Yinka-Banjo.Application of quantum computing in discrete portfolio optimization[J].Journal of Management Science and Engineering,2023,8(4):453-464.
10Yixin Tang.AnimeNet: A Deep Learning Approach for Detecting Violence and Eroticism in Animated Content[J].Computers, Materials & Continua,2023,77(10):867-891.

IEEE/CAA Journal of Automatica Sinica

2024年第1期

浏览历史

内容加载中请稍等...

Autonomous Vehicle Platoons In Urban Road Networks:A Joint Distributed Reinforcement Learning and Model Predictive Control Approach

参考文献5

二级参考文献5

共引文献34

相关作者

相关机构

相关主题

浏览历史