Optimal Transport and Mean-field games
Last time we had an introductory look at OT, the definition and computation of the Wasserstein distance.
A few years ago, I wrote a solver for this variational formulation of mean-field games, as part of a project. This eventually led to a small C++ library as a toy project, which might or might not be usable.
Mean-Field Games
The dynamics are defined by where is a Brownian motion. Typically could be an -brownian motion with independent components.
where is the marginal distribution of the random variable . The optimality condition for this stochastic control problem where established by Lasry and Lions: they are known as the mean-field game equations. They are a system of coupled Fokker-Planck and Hamilton-Jacobi-Bellman equations: These are nonlinear equations due to the control cost term in the second, HJB partial differential equation. The first PDE is a linear, Fokker-Planck PDE. The optimal control is given in feedback form as . Numerical methods for these equations can involve a fair bit of finite-difference discretisation, Krylov subspace methods...
Here, we lay out a solution method for this particular case of MFG (some might say the control Hamiltonian has a rather specific form...) using optimal transport theory.
A variational approach
Define and similarly . The variational approach separates the probability distribution path as a problem variable, and directly optimizes over the feedback control function , which we recall should be drift coefficient of the underlying SDE. That formulation, reads The optimality conditions of this equation, should yield the HJB equation from before. There are several ways of solving this problem directly, by casting it as a nonlinear program: the transcription step will discretise the continuous space, (using a grid, triangulated mesh, PL manifold...), and discretise the integral and constraint in time. The obtained finite-dimensional program will be a traditional NLP, but with the structure of a discrete-time optimal control problem (DTOCP). Benamou and Carlier introduced an augmented Lagrangian method for this problem in [1].
[2] introduces a Lagrangian variational formulation of the mean-field game, which reads where the unknown is a measure over the space of trajectories , and denotes the marginal operator at time , which is the pushforward of the evaluation map .1 The important part is the kinetic energy term
Now, you may ask where the dynamics went. It's all very subtle here, but they're actually not that far. They're actually baked into the optimization objective over path distributions .
The result is that the above problem is an optimal transport problem! Granted, it has an uncountably infinite number of marginals, but we can still discretize it. We consider time steps (for convenience, with fixed step size ), and corresponding marginals .
Then, the discrete-time counterpart is a multimarginal optimal transport problem
- [1] Jean-David Benamou, Guillaume Carlier, Augmented Lagrangian Methods for Transport Optimization, Mean-Field Games and Degenerate PDEs. https://www.ceremade.dauphine.fr/~carlier/ALG2_Draft.pdf
- [2] Jean-David Benamou, Guillaume Carlier, Filippo Santambrogio, Variational Mean Field Games. https://hal.archives-ouvertes.fr/hal-01295299/document
- [3] Jean-David Benamou, Guillaume Carlier, Simone Marino, Luca Nenna, An entropy minimization approach to second-order variational mean-field games. https://hal.archives-ouvertes.fr/hal-01848370v4/document
Meaning takes a measure over the space of paths, and maps it to the marginal distribution of given .