Tag Archives: MapReduce

Learning the Ropes – Hadoop

Hadoop is technology that is based on two ideas 1. Hadoop Distributed File System (HDFS) 2. MapReduce Algorithm HDFS (based on Java) provides scalable and reliable data storage. It was designed to span large clusters of commodity servers (meaning expensive … Continue reading

Posted in Big data anlaytics, Distributed Computing, Hadoop, MapReduce | Tagged , , , | Leave a comment

The Journey to Hadoop

Intel co-founder Gordon Moore in 1965 noticed that the number of transistors per square inch on integrated circuits had doubled every year since their invention. This was later know as the “Moore’s Law“.(REF) A common corollary is  that the frequency … Continue reading

Posted in Big data anlaytics, Data Science, Distributed Computing, Hadoop, Parallel Computing | Tagged , , , , | Leave a comment