Apache Hadoop docker image
- Updated
Feb 1, 2024 - Shell
Apache Hadoop docker image
Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
ansible playbook to deploy cloudera hadoop components to the cluster
Dockerizing an Apache Spark Standalone Cluster
A System is designed to analyse BigData collect from Wifi probe
HokStack - Run Hadoop Stack on Kubernetes
A fully-functional Hadoop Yarn cluster as docker-compose deployment.
Apache Ignite Guide
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
Taller del Máster Profesional de Informática UGR. Curso de CloudComputing.
Run Hadoop Cluster within Docker Containers
Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker
A storage reference to a comprehensive guide on installing Hadoop on Windows
Docker image builds for Hadoop sandbox.
Colelction of various clustering algorithms including K means, HAC, DBscan. Also includes Hadoop, MapReduce, implementation of K mean algorithm
This is a self-documentation of learning distributed data storage, parallel processing, and Linux OS using Apache Hadoop, Apache Spark and Raspbian OS. In this project, 3-node cluster will be setup using Raspberry Pi 4, install HDFS and run Spark processing jobs via YARN.
Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.
Add a description, image, and links to the hadoop-cluster topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-cluster topic, visit your repo's landing page and select "manage topics."