← Back to Data Science

All Topics

Advertisement

Learn/Data Science/Machine Learning

Policy Evaluation

Topic: RL

Advertisement

Evaluate Policy

Compute value function.

Prediction

Compute V or Q for fixed policy.

Methods

Monte Carlo. TD. Dynamic programming.

Control

Policy improvement from evaluation.

Key Takeaways

  1. Value function estimation
  2. MC, TD, DP methods
  3. Policy iteration

Advertisement

Advertisement

Need More Practice?

Get personalized data science help from ChatWhole's AI-powered platform.

Get Expert Help →