← Back to Data Science

All Topics

Advertisement

Learn/Data Science/Machine Learning

Reward Modeling

Topic: RL

Advertisement

Learning Reward Functions

Infer rewards from demonstrations.

Inverse RL

Infer reward from expert. Max-margin IRL.

Learning from Feedback

Preference learning. Reward regression.

Applications

Imitation. Preference-based RL.

Key Takeaways

  1. Inverse RL
  2. Learn from preferences
  3. Reward estimation

Advertisement

Advertisement

Need More Practice?

Get personalized data science help from ChatWhole's AI-powered platform.

Get Expert Help →