RL from Fixed Data
Learn from logged data.
Challenges
Distribution shift. Extrapolation error.
Methods
CQL. IQL. Conservative Q-learning.
Data
D4RL dataset. Batch RL.
Key Takeaways
- Offline RL from logged data
- Conservative Q-learning
- D4RL benchmarks
Topic: Offline RL
Advertisement
Learn from logged data.
Distribution shift. Extrapolation error.
CQL. IQL. Conservative Q-learning.
D4RL dataset. Batch RL.
Advertisement
Advertisement
Get personalized data science help from ChatWhole's AI-powered platform.
Get Expert Help →