Friday, April 11, 2014

Introduction to Hadoop Ecosystem

Everyone knows data is getting bigger. We are stepping into next generation of data analysis and storage. Apache Hadoop is a open source framework that aims at solving the problem of huge data storage and large scale processing of data sets.

"Hadoop" was the name of its founder Doug Cutting's son's toy elephant. Doug used the name for his open source project because it was easy to pronounce and to Google.

Hadoop MapReduce – a programming model for large scale data processing.

Apache Hadoop's MapReduce and HDFS components originally derived respectively from Google's MapReduce and Google File System (GFS) papers.




No comments:

Post a Comment