Learn Environment Model
Use learned models for RL.
World Models
Learn dynamics model. Dream to control.
Planning
Model-predictive control. Monte Carlo tree search.
Benefits
Sample efficiency. Imaginary rollouts.
Key Takeaways
- Learn transition model
- MPC for planning
- Sample efficient