midHadoop
What is data locality in Hadoop?
Updated May 16, 2026
Short answer
Executing computation near the data location.
Deep explanation
Reduces network traffic by running tasks where data blocks reside.
Real-world example
Processing logs on same server storing them.
Common mistakes
- Ignoring locality leading to network bottlenecks.
Follow-up questions
- Types of locality?
- Why important?