Sign in

Discovering hierarchies using Imitation Learning from hierarchy aware policies

By Ameet Deshpande and others
Learning options that allow agents to exhibit temporally higher order behavior has proven to be useful in increasing exploration, reducing sample complexity and for various transfer scenarios. Deep Discovery of Options (DDO) is a generative algorithm that learns a hierarchical policy along with options directly from expert trajectories. We perform... Show more
August 5, 2019
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Discovering hierarchies using Imitation Learning from hierarchy aware policies
Click on play to start listening