TD Learning Fundamentals
Learn from bootstrapping.
TD(0)
One-step bootstrap. Update towards TD target.
TD(λ)
Multi-step returns. Eligibility traces.
Eligibility Traces
Backward view of learning. λ parameter.
Key Takeaways
- TD learning bootstraps
- TD(λ) with eligibility traces
- Multi-step returns