Here are 77 public repositories matching this topic...
Real Time Analytics and Data Pipelines based on Spark Streaming
Updated Oct 24, 2019 Scala Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
Updated Dec 24, 2025 Scala Hadoop splittable InputFormat for ROS. Process rosbag with Hadoop Spark and other HDFS compatible systems.
Updated Nov 13, 2020 Scala A set of connectors for Monix. 🔛
Updated Aug 12, 2024 Scala 全套大数据基础学习教程,包含最基础的centos、maven。大数据主要包含hdfs、mr、yarn、hbase、kafka、scala、sparkcore、sparkstreaming、sparksql。教程包含所有的源代码演示以及在线文档说明。
Updated Oct 4, 2022 Scala Updated Jan 3, 2019 Scala 全国大数据竞赛三等奖解决方案,省赛二等奖解决方案。一键安装大数据环境脚本,自动部署集群环境,包括zookeeper、hadoop、mysql、hive、spark以及一些基础环境。已通过实际服务器测试,效果极佳,仅需要输入密码等少量人为干预。解放安装部署配置所需人力。并添加若干scala案例,结合spark用以进行数据准备。
Updated Sep 26, 2024 Scala A light Kafka to HDFS/S3 ETL library based on Apache Spark
Updated Jun 29, 2017 Scala Components for building stream loaders from Kafka to arbitrary storages
Updated Nov 4, 2025 Scala WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Updated Oct 28, 2025 Scala Bucketing and partitioning system for Parquet
Updated May 22, 2018 Scala FITS data source for Spark SQL and DataFrames
Updated Apr 12, 2023 Scala Updated Feb 7, 2023 Scala Updated Jul 19, 2023 Scala An sbt plugin for publishing artifacts to HDFS.
Updated Nov 29, 2017 Scala A bunch of low-level basic methods for data processing and monitoring with Scala Spark
Updated Jun 29, 2018 Scala Updated Jan 11, 2017 Scala Change data capture realization using Spark and Sqoop
Updated Sep 7, 2018 Scala Spark, Spark Streaming and Kafka Streaming examples
Updated Dec 23, 2025 Scala Updated Oct 15, 2023 Scala Improve this page Add a description, image, and links to the hdfs topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo To associate your repository with the hdfs topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.