Apache Hama is an Apache Top-Level open source project, allowing you to do advanced analytics beyond MapReduce. Many data analysis techniques such as machine learning and graph algorithms require iterative computations, this is where Bulk Synchronous Parallel model can be more effective than "plain" MapReduce. Therefore To run such iterative data analysis applications more efficiently, Hama offers pure Bulk Synchronous Parallel computing engine. Hama can make jobs that take hours in Hadoop run in minutes. For details, see the Performance Benchmarks |
Read full article from Hama - a general BSP framework on top of Hadoop
No comments:
Post a Comment