hadoop-cluster

Star

Here are 156 public repositories matching this topic...

big-data-europe / docker-hadoop

Star

Apache Hadoop docker image

docker hadoop hadoop-cluster hadoop-docker docker-hadoop

Updated Feb 1, 2024
Shell

groda / big_data

Star

Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.

docker big-data spark apache-spark hadoop bigdata jupyter-notebook pyspark hadoop-cluster mapreduce gutenberg-ebooks hadoop-mapreduce spark-sql mrjob bigtop hadoop-hdfs testdfsio mapreduce-bash apache-sedona

Updated Nov 24, 2025
Jupyter Notebook

Impetus / jumbune

Star

Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,

yarn hadoop apm developer-tools data-analysis hadoop-cluster devops-tools data-quality optimization-framework cluster-monitoring monitoring-tool hadoop-monitor yarn-hadoop-cluster aiops hadoop-monitoring

Updated Jan 1, 2023
Java

Segence / docker-hadoop

Star

A Docker container with a full Hadoop cluster setup with Spark and Zeppelin

docker spark hadoop hadoop-cluster zeppelin-notebook

Updated Feb 2, 2020
Shell

sergevs / ansible-cloudera-hadoop

Star

ansible playbook to deploy cloudera hadoop components to the cluster

kafka impala hbase hadoop-cluster oozie cloudera-hadoop

Updated Sep 8, 2018
Shell

Wittline / apache-spark-docker

Sponsor

Star

Dockerizing an Apache Spark Standalone Cluster

docker apache-spark hive docker-compose pyspark hdfs hadoop-cluster hue hadoop-docker dataengineering hive-metastore dataengineer

Updated Jun 29, 2022
VBA

rainmaple / WIFI_BussinessBigDataAnalyseSystem

Star

A System is designed to analyse BigData collect from Wifi probe

spark realtime hbase hadoop-cluster echarts

Updated Dec 31, 2018
JavaScript

hokstack / hok-helm

Star

HokStack - Run Hadoop Stack on Kubernetes

kubernetes automation hadoop bigdata dataops operator hadoop-cluster devops-tools hdp hadoop-hdfs

Updated May 10, 2020
Shell

hadoop-sandbox / hadoop-sandbox

Star

A fully-functional Hadoop Yarn cluster as docker-compose deployment.

docker hadoop docker-compose hadoop-cluster hadoop-hdfs hadoop-yarn

Updated Nov 30, 2025
Shell

mikeroyal / Apache-Ignite-Guide

Star

Apache Ignite Guide

data-science streaming database hadoop nosql stream-processing nosql-databases hadoop-cluster nosql-data-storage ignite

Updated Oct 14, 2021

waltherg / distributable_docker_sql_on_hadoop

Star

Toy Hadoop cluster combining various SQL-on-Hadoop variants

Updated Nov 16, 2017
Shell

hyeonsangjeon / dataplatform

Star

Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.

hive hadoop hadoop-cluster hadoop-mapreduce hadoop-docker pyspark-notebook zeppelin-notebook hadoop-ecosystem

Updated Nov 7, 2019
Shell

manuparra / MasterDegreeCC_Practice

Star

Taller del Máster Profesional de Informática UGR. Curso de CloudComputing.

docker practice hadoop docker-container virtual-machine cluster hdfs hadoop-cluster opennebula cloudcomputing docker-cluster

Updated May 6, 2019

lyingbo / hadoop-cluster-docker

Star

Run Hadoop Cluster within Docker Containers

hadoop-cluster hadoop-docker hadoop-3-2-0

Updated Jan 19, 2020
Shell

pfisterer / apache-knox-docker

Star

Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker

dockerfile hadoop rest-api hadoop-cluster hadoop-ecosystem apache-knox gateway-server

Updated Mar 21, 2022
Dockerfile

Shwetabhdixit / Hadoop-2.7.3-Installation-Guide-for_windows

Star

A storage reference to a comprehensive guide on installing Hadoop on Windows

hadoop-cluster hadoop-mapreduce hadoop-framework

Updated Jun 11, 2018
Shell

hadoop-sandbox / hadoop-sandbox-images

Star

Docker image builds for Hadoop sandbox.

docker hadoop hdfs hadoop-cluster hadoop-hdfs hadoop-yarn

Updated Nov 29, 2025
Dockerfile

MitaliBhiwande / Clustering-Algorithms

Star

Colelction of various clustering algorithms including K means, HAC, DBscan. Also includes Hadoop, MapReduce, implementation of K mean algorithm

hadoop-cluster mapreduce kmeans-clustering hierarchical-clustering density-based-clustering

Updated Mar 4, 2018
Python

aimanamri / raspberry-pi4-hadoop-spark-cluster

Star

This is a self-documentation of learning distributed data storage, parallel processing, and Linux OS using Apache Hadoop, Apache Spark and Raspbian OS. In this project, 3-node cluster will be setup using Raspberry Pi 4, install HDFS and run Spark processing jobs via YARN.

big-data yarn pyspark hdfs distributed-storage hadoop-cluster parallel-processing spark-shell spark-cluster raspberry-pi-4

Updated Jul 13, 2024
Shell

MengmSun / hadoop-in-docker

Star

Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.

docker hadoop docker-compose hdfs hadoop-cluster hadoop-docker hdfs-docker hdfs-cluster

Updated Apr 17, 2022
Shell

Improve this page

Add a description, image, and links to the hadoop-cluster topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hadoop-cluster topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hadoop-cluster

Here are 156 public repositories matching this topic...

big-data-europe / docker-hadoop

groda / big_data

Impetus / jumbune

Segence / docker-hadoop

sergevs / ansible-cloudera-hadoop

Wittline / apache-spark-docker

rainmaple / WIFI_BussinessBigDataAnalyseSystem

hokstack / hok-helm

hadoop-sandbox / hadoop-sandbox

mikeroyal / Apache-Ignite-Guide

waltherg / distributable_docker_sql_on_hadoop

hyeonsangjeon / dataplatform

manuparra / MasterDegreeCC_Practice

lyingbo / hadoop-cluster-docker

pfisterer / apache-knox-docker

Shwetabhdixit / Hadoop-2.7.3-Installation-Guide-for_windows

hadoop-sandbox / hadoop-sandbox-images

MitaliBhiwande / Clustering-Algorithms

aimanamri / raspberry-pi4-hadoop-spark-cluster

MengmSun / hadoop-in-docker

Improve this page

Add this topic to your repo