Introduction to Hadoop (Bigdata)

In this presentation, we are going to introduce the Hadoop Distributed File System, an Apache open source distributed file system designed to run on commodity hardware.

we'll cover:

- Origins of HDFS and Google File System / GFS
- How a file breaks up into blocks before being distributed to a cluster
- NameNode and DataNode basics
- technical architecture of HDFS
- sample HDFS commands
- Rack Awareness
- Synchrounous write pipeline
- How a client reads a file

No comments