juniorQ-Learning
What is epsilon-greedy strategy?
Updated May 17, 2026
Short answer
A strategy balancing exploration and exploitation.
Deep explanation
With probability epsilon, a random action is chosen (exploration), otherwise the best-known action is selected (exploitation).
Real-world example
Used in recommendation systems to explore new content.
Common mistakes
- Setting epsilon too high or too low permanently.
Follow-up questions
- Why is exploration important?