Apache Hadoop docker image
- Updated
Feb 1, 2024 - Shell
Apache Hadoop docker image
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
ansible playbook to deploy cloudera hadoop components to the cluster
HokStack - Run Hadoop Stack on Kubernetes
A fully-functional Hadoop Yarn cluster as docker-compose deployment.
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.
Run Hadoop Cluster within Docker Containers
A storage reference to a comprehensive guide on installing Hadoop on Windows
This is a self-documentation of learning distributed data storage, parallel processing, and Linux OS using Apache Hadoop, Apache Spark and Raspbian OS. In this project, 3-node cluster will be setup using Raspberry Pi 4, install HDFS and run Spark processing jobs via YARN.
Hadoop in docker cluster, created by docker-compose. Create Hadoop cluster in less than 5mins.
Analyses the customer logs for bigdata components like HDFS, Hive, HBase, Yarn, MapReduce, Storm, Spark, Spark 2, Knox, Ambari Metrics, Nifi, Accumulo, Kafka, Flume, Oozie, Falcon, Atlas & Zookeeper.
BigData Cluster with Docker
Apache Hadoop Cluster Docker images
deploy bigdata platform on kubernetes
A repository for some scripts that can help in creating a distributed Big data ecosystem using the platform Grid5000.
This project create an Hadoop and Spark cluster on Amazon AWS with Terraform
Containerized Hadoop cluster with Spark, Hive, Pig, HBase, and Zookeeper for scalable Big Data processing using Docker.
Automated Installation CD for Hadoop Cluster
Add a description, image, and links to the hadoop-cluster topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-cluster topic, visit your repo's landing page and select "manage topics."