Sign in

Choice Set Misspecification in Reward Inference

By Rachel Freedman and others
Specifying reward functions for robots that operate in environments without a natural reward signal can be challenging, and incorrectly specified rewards can incentivise degenerate or dangerous behavior. A promising alternative to manually specifying reward functions is to enable robots to infer them from human feedback, like demonstrations or corrections. To... Show more
January 19, 2021
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Choice Set Misspecification in Reward Inference
Click on play to start listening