Of Algebirds, Monoids, Monads, and other Bestiary for Large-Scale Data Analytics - Michael G. Noll
Have you ever asked yourself what monoids and monads are, and particularly why they seem to be so attractive in the field of large-scale data processing? Twitter recently open-sourced Algebird, which provides you with a JVM library to work with such algebraic data structures. Algebird is already being used in Big Data tools such as Scalding and SummingBird, which means you can use Algebird as a mechanism to plug your own data structures – e.g. Bloom filters, HyperLogLog – directly into large-scale data processing platforms such as Hadoop and Storm. In this post I will show you how to get started with Algebird, introduce you to monoids and monads, and address the question why you should get interested in those in the first place.
Read full article from Of Algebirds, Monoids, Monads, and other Bestiary for Large-Scale Data Analytics - Michael G. Noll
No comments:
Post a Comment