Sign in

Sufficient Exploration for Convex Q-learning

By Fan Lu and others
In recent years there has been a collective research effort to find new formulations of reinforcement learning that are simultaneously more efficient and more amenable to analysis. This paper concerns one approach that builds on the linear programming (LP) formulation of optimal control of Manne. A primal version is called... Show more
October 17, 2022
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Sufficient Exploration for Convex Q-learning
Click on play to start listening