摘要
Car following (CF) models are an appealing research area because they fundamentally describe longitudinal interactions of vehicles on the road, and contribute significantly to an understanding of traffic flow. There is an emerging trend to use data-driven method to build CF models. One challenge to the data-driven CF models is their capability to achieve optimal longitudinal driven behavior because a lot of bad driving behaviors will be learnt from human drivers by the supervised learning manner. In this study, by utilizing the deep reinforcement learning (DRL) techniques trust region policy optimization (TRPO), a DRL based CF model for electric vehicle (EV) is built. The proposed CF model can learn optimal driving behavior by itself in simulation. The experiments on following standard driving cycle show that the DRL model outperforms the traditional CF model in terms of electricity consumption.
出处
《智能城市应用》
2019年第5期1-8,共8页
Smart City Application
基金
supported by national natural science foundation of China (61620106002 and 5170520). The authors acknowledge the help of Renzong Lian, who has helped us to perform traffic simulation using SUMO. The parameters of Roewe E50 is provided by SAIC Motor.