Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
- Updated
Jan 1, 2023 - Java
Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
In this task, we had to calculate the average temperature for each year from the given dataset using Hadoop HDFS. We had to create a MapReduce function to perform this task.
The project deals on how to perform Spatio-temporal hot-spot analysis using Apache Spark.
Design, build, and execute effective big data strategies with advanced Hadoop concepts
Analytics on 22years of Flight Data- AWS/MapReduce
pagerank hadoop
MapReduce in Cluster.
📖 Apache Hadoop Based Clustering Tutorial
Implementation of Hadoop and Spark
Single-node Hadoop cluster to implement MapReduce K-means and compare complexity with non-parallel K-means algorithms
Assignments of Big Data course during the Spring 2017 semester at Sapienza
Simple inverted indexing algorithm implemented with Hadoop
Implemented the PageRank algorithm in Hadoop MapReduce framework and Spark.
Count the number of times a word occurs in 1GB (Big Data) Dataset of books using hadoop map-reduce
Running Map reduce jobs on Hadoop Cluster with customized parameter
A MapReduce program to compute the top 100 word pairs sorted in a decreasing order of relative frequency.
In this project, we used both Hadoop / MapReduce and Spark to do distributed computing. The first task was to perform a series of operations using a Mapper and Reduce java file that was implemented on a Hadoop server. The second task was to perform similar operations, but on Spark instead.
Add a description, image, and links to the hadoop-cluster topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-cluster topic, visit your repo's landing page and select "manage topics."