Category Archives: MapReduce

Learn the Ropes – Pig

Pig is a high-level platform for creating MapReduce programs used with Hadoop, originally developed by Yahoo in 2006. It is a powerful tool for querying data in a Hadoop cluster. It basically helps write Map-Reduce more easily. Pig is handly … Continue reading

Posted in Big data anlaytics, Data Science, Distributed Computing, Hadoop, MapReduce | Tagged , , | Leave a comment

Learning the Ropes – Hadoop

Hadoop is technology that is based on two ideas 1. Hadoop Distributed File System (HDFS) 2. MapReduce Algorithm HDFS (based on Java) provides scalable and reliable data storage. It was designed to span large clusters of commodity servers (meaning expensive … Continue reading

Posted in Big data anlaytics, Distributed Computing, Hadoop, MapReduce | Tagged , , , | Leave a comment