Sign in

Reinforcement Learning in Non-Markovian Environments

By Siddharth Chandak and others
Motivated by the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitrary non-Markovian environments, we propose a related formulation and explicitly pin down the error caused by non-Markovianity of observations when the Q-learning algorithm is applied on this formulation. Based on this observation, we propose that... Show more
February 13, 2024
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Reinforcement Learning in Non-Markovian Environments
Click on play to start listening