Learning Reward Functions
Infer rewards from demonstrations.
Inverse RL
Infer reward from expert. Max-margin IRL.
Learning from Feedback
Preference learning. Reward regression.
Applications
Imitation. Preference-based RL.
Key Takeaways
- Inverse RL
- Learn from preferences
- Reward estimation