Entropy Regularization
Add entropy to objective.
Benefits
Exploration. Policy improvement.
Soft Actor-Critic
Entropy-regularized RL.
Maximum Entropy
Maximize entropy + reward.
Key Takeaways
- Entropy for exploration
- SAC uses entropy
- Policy improvement
Topic: RL
Advertisement
Add entropy to objective.
Exploration. Policy improvement.
Entropy-regularized RL.
Maximize entropy + reward.
Advertisement
Advertisement
Get personalized data science help from ChatWhole's AI-powered platform.
Get Expert Help →