All About Programming: Redis new data structure: the HyperLogLog

Redis new data structure: the HyperLogLog - Antirez weblog

Long story short this is what HyperLogLog does: it hashes every new element you observe. Part of the hash is used to index a register (the coin+paper pair, in our previous example. Basically we are splitting the original set into m subsets). The other part of the hash is used to count the longest run of leading zeroes in the hash (our run of heads). The probability of a run of N+1 zeroes is half the probability of a run of length N, so observing the value of the different registers, that are set to the maximum run of zeroes observed so far for a given subset, HyperLogLog is able to provide a very good approximated cardinality.

Read full article from Redis new data structure: the HyperLogLog - Antirez weblog

Redis new data structure: the HyperLogLog - Antirez weblog

No comments:

Post a Comment

Labels

Popular Posts