Evaluate Policy
Compute value function.
Prediction
Compute V or Q for fixed policy.
Methods
Monte Carlo. TD. Dynamic programming.
Control
Policy improvement from evaluation.
Key Takeaways
- Value function estimation
- MC, TD, DP methods
- Policy iteration
Topic: RL
Advertisement
Compute value function.
Compute V or Q for fixed policy.
Monte Carlo. TD. Dynamic programming.
Policy improvement from evaluation.
Advertisement
Advertisement
Get personalized data science help from ChatWhole's AI-powered platform.
Get Expert Help →