Pavel's Blog: Finding The Median In Large Sets Of Numbers Split Across 1000 Servers
How would you find the median across a thousand servers with a billion of numbers each? This is a question that involves lots of discussion because it may be quite vague and may require taking some assumptions. Obviously we can't do any kind of in-memory sorting because we don't have enough memory. We may possibly fit one set at a time, which would be under 1 GB of memory
Read full article from Pavel's Blog: Finding The Median In Large Sets Of Numbers Split Across 1000 Servers
No comments:
Post a Comment