What is data locality in Hadoop?

Updated May 16, 2026

Short answer

Executing computation near the data location.

Deep explanation

Reduces network traffic by running tasks where data blocks reside.

Real-world example

Processing logs on same server storing them.

Common mistakes

  • Ignoring locality leading to network bottlenecks.

Follow-up questions

  • Types of locality?
  • Why important?

More Hadoop interview questions

View all →