Skip to content

Simple example setup for Apache Hadoop on Docker.

simon-forb/docker-hadoop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

Docker Hadoop Intro

Source: This repository is based on the overview of the official Docker Apache Hadoop image.

Prerequisites: Docker and Docker Compose installed on your machine.

Installation

Clone or download this repository. Then, build and run the containers using Docker Compose:

docker compose up -d --build

Usage

Log into a node via:

docker exec -it namenode bash

Web Interfaces

Access the namenode UI via: http://localhost:9870

Access the ResourceManager UI via: http://localhost:8088

Example MapReduce Job

Run an example MapReduce job to test the setup: First log into the namenode as specified above, then run:

yarn jar share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.5.jar pi 10 15

Note: The version number in the path may vary based on the installed Hadoop version (3.5.5 in this case).

Shutdown Cluster

Shut down the cluster via:

docker compose down

About

Simple example setup for Apache Hadoop on Docker.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published