Why Data Science ?

It cannot be denied the data has been increasingly shaping the world we live in - from helping us pick the next TV series we may like to influencing the course of elections. To quote The Economist - "Data are to this century what oil was to the last one: a driver of growth and … Continue reading Why Data Science ?

HDFS

When your data outgrows the capacity of a single machine it becomes necessary to split your data across separate machines. This gave rise to the distributed filesystem and since these are network-based all the complications of network-programming kick in, such as filesystem tolerating node failure (when one of the systems supporting the distributed filesystem becomes … Continue reading HDFS