Sign in

Offline Primal-Dual Reinforcement Learning for Linear MDPs

By Germano Gabbianelli and others
Offline Reinforcement Learning (RL) aims to learn a near-optimal policy from a fixed dataset of transitions collected by another policy. This problem has attracted a lot of attention recently, but most existing methods with strong theoretical guarantees are restricted to finite-horizon or tabular settings. In constrast, few algorithms for infinite-horizon... Show more
May 22, 2023
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Offline Primal-Dual Reinforcement Learning for Linear MDPs
Click on play to start listening