All About Programming: Introduction to Cassandra

Introduction to Cassandra

Massively scalable
Linearly scalable (If 2 nodes handle x traffic, then 4 nodes handle 2x!)
NoSQL Database
Failed nodes can be replaced with no downtime.
Integrates with Hadoop and has MapReduce support. Also supports Apache Pig and Hive.
Can store structured and unstructured data alike.

Cassandra is essentially a hybrid between a key-value and a column-oriented (or tabular) database.
Cassandra's column-family resembles a Table in RDBMS.
Column families contain rows and columns.
Each row has multiple columns, each of which has a name, value, and a timestamp.
Different rows in the same column family do not have to share the same set of columns.
Columns may be added to one or multiple rows at any time.
Each key in Cassandra identifies one row with multiple columns (like a Primary Key).