1. Introduction
  2. Prerequistes
  3. Set Environment Variables
  4. Setup SSH daemon
  5. Download hadoop and place it in the home directory.
  6. Unpack hadoop
  7. Configure Hadoop
  8. Format the namenode
  9. Setup hadoop plugin
  10. Start the cluster
  11. Setup hadoop location
  12. Upload data
  13. Create and run a test project.
Bookmark and Share

Introduction

Hadoop is a powerful framework for automatic parallelization of computing tasks. Unfortunately programming for it poses certain challenges. It is really hard to understand and debug Hadoop programs. One way to make it a little easier is to have a simplified version of the Hadoop cluster that runs locally on the developer's machine. This tutorial describes how to set up such a cluster on a computer running Microsoft Windows. It also describes how to integrate this cluster with Eclipse, a prime Java development environment.

Organization of the tutorial.

Since Hadoop is a very complex environment, this tutorial has been broken down into several small steps. Each step involves setting up some aspect of the system and verifying that it has been set up correctly. For better understanding, each tutorial is accompanied by screenshots and a recorded video of the steps.

 

Questions, Suggestions Comments

Please leave questions, suggestions and comments about this tutorial here.

 

Continue

Bookmark and Share