Large State Spaces
Approximate value functions.
Linear
Features. Fourier basis. Tile coding.
Non-Linear
Neural networks. Deep Q-learning.
Challenges
Bootstrapping + off-policy + approximation.
Key Takeaways
- Function approximation for large spaces
- Linear and nonlinear approximation
- Deadly triad