Deploying Traffic Smoothing Cruise Controllers Learned from Trajectory Data

Abstract: 

Autonomous vehicle-based traffic smoothing con-trollers are often not transferred to real-world use due to challenges in calibrating many-agent traffic simulators. We show a pipeline to sidestep such calibration issues by collecting trajectory data and learning controllers directly from trajectory data that are then deployed zero-shot onto the highway. We construct a dataset of 772.3 kilometers of recorded drives on the I–24. We then construct a simple simulator using the recorded drives as the lead vehicle in front of a simulated platoon consisting of one autonomous vehicle and five human followers. Using policy-gradient methods with an asymmetric critic to learn the controller, we show that we are able to improve average MPG by 11% in simulation on congested trajectories. We deploy this controller to a mixed platoon of 4 autonomous Toyota RAV-4's and 7 human drivers in a validation experiment and demonstrate that the expected time-gap of the controller is maintained in the real world test. Finally, we release the driving dataset [1], the simulator, and the trained controller at https://github.com/nathanlct/trajectory-training-icra.

Author: 
Lichtle, Nathan
Vinitsky, Eugene
Nice, Matthew
Seibold, Benjamin
Work, Dan
Bayen, Alexandre M.
Publication date: 
May 1, 2022
Publication type: 
Conference Paper
Citation: 
Lichtlé, N., Vinitsky, E., Nice, M., Seibold, B., Work, D., & Bayen, A. M. (2022). Deploying Traffic Smoothing Cruise Controllers Learned from Trajectory Data. 2022 International Conference on Robotics and Automation (ICRA), 2884–2890. https://doi.org/10.1109/ICRA46639.2022.9811912