Sign in

Optimistic Planning by Regularized Dynamic Programming

By Antoine Moulin and Gergely Neu
We propose a new method for optimistic planning in infinite-horizon discounted Markov decision processes based on the idea of adding regularization to the updates of an otherwise standard approximate value iteration procedure. This technique allows us to avoid contraction and monotonicity arguments that are typically required by existing analyses of... Show more
March 3, 2023
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Optimistic Planning by Regularized Dynamic Programming
Click on play to start listening