ITS Berkeley

The I-24 Trajectory Dataset

Nice, Matthew
Lichtle, Nathan
Gumm, Gracie
Roman, Michael
Vinitsky, Eugene
Elmadani, Safwan
Bayen, Alexandre
2021

This dataset was created by recording CAN and GPS data from a single vehicle driving on I-24. The dataset includes values for Time, Velocity, Acceleration, Space Gap, Lateral Distance, Relative Velocity, Longitude GPS, Latitude GPS and more. This empirical dataset is useful for understanding/simulating real vehicle trajectories and vehicle controller performance.

Longitudinal Deep Truck: Deep Learning and Deep Reinforcement Learning for Modeling and Control of Longitudinal Dynamics of Heavy Duty Trucks

Albeaik, Saleh
Wu, Trevor
Vurimi, Ganeshnikhil
Lu, Xiao-Yun
Bayen, Alexandre
2021

Heavy duty truck mechanical configuration is often tailor designed and built for specific truck mission requirements. This renders the precise derivation of analytical dynamical models and controls for these trucks from first principles challenging, tedious, and often requires several theoretical and applied areas of expertise to carry through. This article investigates deep learning and deep reinforcement learning as truck-configuration-agnostic longitudinal modeling and control approaches for heavy duty trucks. The article outlines a process to develop and validate such models and...

Solving N-Player Dynamic Routing Games with Congestion: A Mean Field Approach

Cabannes, Theophile
Lauriere, Mathieu
Perolat, Julien
Marinier, Raphael
Girgin, Sertan
2021

The recent emergence of navigational tools has changed traffic patterns and has now enabled new types of congestion-aware routing control like dynamic road pricing. Using the fundamental diagram of traffic flows - applied in macroscopic and mesoscopic traffic modeling - the article introduces a new N-player dynamic routing game with explicit congestion dynamics. The model is well-posed and can reproduce heterogeneous departure times and congestion spill back phenomena. However, as Nash equilibrium computations are PPAD-complete, solving the game becomes intractable for large but realistic...

Inter-Level Cooperation in Hierarchical Reinforcement Learning

Kreidieh, Abdul Rahman
Berseth, Glen
Trabucco, Brandon
Parajuli, Samyak
Levine, Sergey
Bayen, Alexandre M.
2021

Hierarchies of temporally decoupled policies present a promising approach for enabling structured exploration in complex long-term planning problems. To fully achieve this approach an end-to-end training paradigm is needed. However, training these multi-level policies has had limited success due to challenges arising from interactions between the goal-assigning and goal-achieving levels within a hierarchy. In this article, we consider the policy optimization process as a multi-agent process. This allows us to draw on connections between communication and cooperation in multi-agent RL, and...

Multi-Adversarial Safety Analysis for Autonomous Vehicles

Bahati, Gilbert
Gibson, Marsalis
Bayen, Alexandre
2021

This work in progress considers reachability-based safety analysis in the domain of autonomous driving in multi-agent systems. We formulate the safety problem for a car following scenario as a differential game and study how different modelling strategies yield very different behaviors regardless of the validity of the strategies in other scenarios. Given the nature of real-life driving scenarios, we propose a modeling strategy in our formulation that accounts for subtle interactions between agents, and compare its Hamiltonian results to other baselines. Our formulation encourages...

Parallel Network Flow Allocation in Repeated Routing Games via LQR Optimal Control

Gibson, Marsalis
You, Yiling
Bayen, Alexandre
2021

In this article, we study the repeated routing game problem on a parallel network with affine latency functions on each edge. We cast the game setup in a LQR control theoretic framework, leveraging the Rosenthal potential formulation. We use control techniques to analyze the convergence of the game dynamics with specific cases that lend themselves to optimal control. We design proper dynamics parameters so that the conservation of flow is guaranteed. We provide an algorithmic solution for the general optimal control setup using a multiparametric quadratic programming approach (explicit MPC...

Learning Generalizable Multi-Lane Mixed-Autonomy Behaviors in Single Lane Representations of Traffic

Kreidieh, Abdul Rahman
Zhao, Yibo
Parajuli, Samyak
Bayen, Alexandre
2021

Reinforcement learning techniques can provide substantial insights into the desired behaviors of future autonomous driving systems. By optimizing for societal metrics of traffic such as increased throughput and reduced energy consumption, such methods can derive maneuvers that, if adopted by even a small portion of vehicles, may significantly improve the state of traffic for all vehicles involved. These methods, however, are hindered in practice by the difficulty of designing efficient and accurate models of traffic, as well as the challenges associated with optimizing for the behaviors of...

Boundary Control of Conservation Laws Exhibiting Shocks

Bayen, Alexandre
Monache, Maria Laura Delle
Garavello, Mauro
Goatin, Paola
Piccoli, Benedetto
2022

This chapter focuses on control of systems of conservation laws with boundary data. Problems with one or two boundaries are considered and, in particular, we focus on cases where shocks may be developed by the solution. However, for completeness we briefly discuss in Sect. 2.2 other existing results where singularities are prevented via suitable feedback controls such as in [32].

Boundary Control of Conservation Laws Exhibiting Shocks

Bayen, Alexandre
Monache, Maria Laura Delle
Garavello, Mauro
Goatin, Paola
Piccoli, Benedetto
2022

This chapter focuses on control of systems of conservation laws with boundary data. Problems with one or two boundaries are considered and, in particular, we focus on cases where shocks may be developed by the solution. However, for completeness we briefly discuss in Sect. 2.2 other existing results where singularities are prevented via suitable feedback controls such as in [32].