Here are 7 public repositories matching this topic...
PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it
Updated Apr 27, 2025 Scala Custom PySpark Data Sources
Updated Oct 3, 2025 Python Google BigQuery data source for Apache Spark
Updated Oct 1, 2024 Scala PySpark custom data source for Hugging Face Datasets
Updated Aug 12, 2025 Python Allows reading ROOT TTrees into Apache Spark as DataFrames
Updated Jun 14, 2023 Java Contains the code and examples for my article on Medium, which explains how to create a custom JDBC read-only data source in Apache Spark 3
Updated Oct 28, 2024 Scala Scala/Spark Netcdf for reading Netcdf files
Updated Jul 14, 2025 Scala Improve this page Add a description, image, and links to the spark-datasource topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo To associate your repository with the spark-datasource topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.