SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Jupyter Notebook Data Science Projects
- Project mention: Microsoft's Open-Source ML Curriculum Is Best to Learn ML from Scratch | news.ycombinator.com | 2025-04-07
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
-
-
Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Link: GitHub A fun, practical guide to understanding Bayesian statistics through Python.
-
-
-
View on GitHub
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
python-machine-learning-book
The "Python Machine Learning (1st edition)" book code repository and info resource
-
-
-
amazon-sagemaker-examples
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
-
numerical-linear-algebra
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
-
tpot
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
-
-
-
H2O
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
View the Project on GitHub
-
evidently
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
- Project mention: TabPFN-2.5 – SOTA foundation model for tabular data | news.ycombinator.com | 2025-11-06
-
machine_learning_complete
A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.
-
-
ML-foundations
Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science
-
-
MachineLearningNotebooks
Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Jupyter Notebook Data Science discussion
Jupyter Notebook Data Science related posts
-
Data Science for Beginners
-
Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats
-
Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats
-
Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats
-
Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats
-
Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats
-
Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats
- A note from our sponsor - SaaSHub www.saashub.com | 23 Dec 2025
Index
What are some of the best open-source Data Science projects in Jupyter Notebook? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | ML-For-Beginners | 82,381 |
| 2 | Made-With-ML | 44,375 |
| 3 | Data-Science-For-Beginners | 31,644 |
| 4 | Probabilistic-Programming-and-Bayesian-Methods-for-Hackers | 28,357 |
| 5 | fastbook | 24,052 |
| 6 | machine-learning-for-trading | 15,993 |
| 7 | Virgilio | 14,299 |
| 8 | python-machine-learning-book | 12,546 |
| 9 | ML-Papers-of-the-Week | 12,156 |
| 10 | python-training | 10,920 |
| 11 | amazon-sagemaker-examples | 10,828 |
| 12 | numerical-linear-algebra | 10,620 |
| 13 | tpot | 10,032 |
| 14 | pycaret | 9,648 |
| 15 | tsfresh | 9,056 |
| 16 | H2O | 7,446 |
| 17 | evidently | 6,920 |
| 18 | TabPFN | 5,317 |
| 19 | machine_learning_complete | 4,971 |
| 20 | nlpaug | 4,637 |
| 21 | ML-foundations | 4,413 |
| 22 | probability | 4,397 |
| 23 | MachineLearningNotebooks | 4,337 |