Jupyter Notebook Data Science

Open-source Jupyter Notebook projects categorized as Data Science

Top 23 Jupyter Notebook Data Science Projects

Data Science
  1. ML-For-Beginners

    12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

    Project mention: Microsoft's Open-Source ML Curriculum Is Best to Learn ML from Scratch | news.ycombinator.com | 2025-04-07
  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. Made-With-ML

    Learn how to design, develop, deploy and iterate on production-grade ML applications.

  4. Data-Science-For-Beginners

    10 Weeks, 20 Lessons, Data Science for All!

    Project mention: Data Science for Beginners | news.ycombinator.com | 2025-11-15
  5. Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

    aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

    Project mention: 5 Free Books for Data Science & Machine Learning | dev.to | 2025-10-04

    Link: GitHub A fun, practical guide to understanding Bayesian statistics through Python.

  6. fastbook

    The fastai book, published as Jupyter Notebooks

  7. machine-learning-for-trading

    Code for Machine Learning for Algorithmic Trading, 2nd edition.

  8. Virgilio

    Your new Mentor for Data Science E-Learning.

    Project mention: Top 5 GitHub Repositories for Data Science in 2026 | dev.to | 2025-09-20

    View on GitHub

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. python-machine-learning-book

    The "Python Machine Learning (1st edition)" book code repository and info resource

  11. ML-Papers-of-the-Week

    🔥Highlighting the top ML papers every week.

    Project mention: ML Papers of the Week | news.ycombinator.com | 2025-02-11
  12. python-training

    Python training for business analysts and traders

  13. amazon-sagemaker-examples

    Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

  14. numerical-linear-algebra

    Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course

  15. tpot

    A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

  16. pycaret

    An open-source, low-code machine learning library in Python

  17. tsfresh

    Automatic extraction of relevant features from time series:

  18. H2O

    H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

    Project mention: H2O: Your New Best Friend for Scalable Machine Learning | dev.to | 2025-05-05

    View the Project on GitHub

  19. evidently

    Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

  20. TabPFN

    ⚡ TabPFN: Foundation Model for Tabular Data ⚡

    Project mention: TabPFN-2.5 – SOTA foundation model for tabular data | news.ycombinator.com | 2025-11-06
  21. machine_learning_complete

    A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

  22. nlpaug

    Data augmentation for NLP

  23. ML-foundations

    Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science

  24. probability

    Probabilistic reasoning and statistical analysis in TensorFlow

  25. MachineLearningNotebooks

    Python notebooks with ML and deep learning examples with Azure Machine Learning Python SDK | Microsoft

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Jupyter Notebook Data Science discussion

Jupyter Notebook Data Science related posts

  • Data Science for Beginners

    1 project | news.ycombinator.com | 15 Nov 2025
  • Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats

    1 project | news.ycombinator.com | 8 Nov 2025
  • Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats

    1 project | news.ycombinator.com | 7 Nov 2025
  • Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats

    1 project | news.ycombinator.com | 6 Nov 2025
  • Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats

    1 project | news.ycombinator.com | 5 Nov 2025
  • Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats

    1 project | news.ycombinator.com | 3 Nov 2025
  • Show HN: DeepShot – NBA game predictor with 70% accuracy using ML and stats

    1 project | news.ycombinator.com | 31 Oct 2025
  • A note from our sponsor - SaaSHub
    www.saashub.com | 23 Dec 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Data Science projects in Jupyter Notebook? This list will help you:

# Project Stars
1 ML-For-Beginners 82,381
2 Made-With-ML 44,375
3 Data-Science-For-Beginners 31,644
4 Probabilistic-Programming-and-Bayesian-Methods-for-Hackers 28,357
5 fastbook 24,052
6 machine-learning-for-trading 15,993
7 Virgilio 14,299
8 python-machine-learning-book 12,546
9 ML-Papers-of-the-Week 12,156
10 python-training 10,920
11 amazon-sagemaker-examples 10,828
12 numerical-linear-algebra 10,620
13 tpot 10,032
14 pycaret 9,648
15 tsfresh 9,056
16 H2O 7,446
17 evidently 6,920
18 TabPFN 5,317
19 machine_learning_complete 4,971
20 nlpaug 4,637
21 ML-foundations 4,413
22 probability 4,397
23 MachineLearningNotebooks 4,337

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Jupyter Notebook is
the 13th most popular programming language
based on number of references?