What is epsilon-greedy strategy?

Updated May 17, 2026

Short answer

A strategy balancing exploration and exploitation.

Deep explanation

With probability epsilon, a random action is chosen (exploration), otherwise the best-known action is selected (exploitation).

Real-world example

Used in recommendation systems to explore new content.

Common mistakes

  • Setting epsilon too high or too low permanently.

Follow-up questions

  • Why is exploration important?

More Q-Learning interview questions

View all →