The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
- Updated
Feb 27, 2024 - Scala
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Link Prediction is about predicting the future connections in a graph. In this project, Link Prediction is about predicting whether two authors will be collaborating for their future paper or not given the graph of authors who collaborated for atleast one paper together.
This repository is created by Dharshan Kumar K S and Siva Prakash as part of our semester project from 'Big Data Analysis' subject
The goal of this project is to identify the flood-prone areas with probabilities of flood in counties in a future date, using Spark MLLib.
Titian: Data Provenance Support in Spark (VLDB 2016) / Adding Data Provenance Support to Apache Spark (VLDB Journal)
Assignment for Cloud Computing And Big Data Ecosystems Design subject that aims to predict flight arrival time using Apache Spark and Scala.
Space filling curve library for Spark
Media Recommendations Using Big Data Analytics.
SANSA RDF Library
Build a large data-intensive application using real-world data to show interactive visualizations of the evolution of temperatures over time all over the world.
Which American cities are the best for tech jobs?
Performance of Aircraft in the US from 1987 to 2008.
The U.S. Department of Transportation's (DOT) Bureau of Transportation Statistics tracks the on-time performance of domestic flights operated by large air carriers. Summary information on the number of on-time, delayed, canceled, and diverted flights is published in DOT's monthly Air Travel Consumer Report and in this dataset of 2015 flight dela…
(Semester 4) Big Data Analytics - End Semester Project
Influencer detection on Twitter and real-time current trend analysis using twitter4j and Spark
Hotspot analysis on Big Data of a major taxi company using Apache Spark and Scala
Add a description, image, and links to the big-data-analytics topic page so that developers can more easily learn about it.
To associate your repository with the big-data-analytics topic, visit your repo's landing page and select "manage topics."