Python ML

Open-source Python projects categorized as ML

Top 23 Python ML Projects

  1. yolov5

    YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

    Project mention: Teaching AI to Read Emotions: Science, Challenges, and Innovation Behind Facial Emotion Detection with YOLOv11 on Raspberry Pi | dev.to | 2025-11-23

    Ultralytics YOLO Documentation

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. MLflow

    The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

    Project mention: "As Cloud-like as Possible" Data Science: Local MLOps with Docker Compose | dev.to | 2025-11-24

    Experiment management: MLflow tracks models, parameters, and results.

  4. best-of-ml-python

    🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

    Project mention: A ranked list of machine learning Python libraries. Updated weekly | news.ycombinator.com | 2025-01-31
  5. ludwig

    Low-code framework for building custom LLMs, neural networks, and other AI models

  6. metaflow

    Build, Manage and Deploy AI/ML Systems

    Project mention: Metaflow: Build, Manage and Deploy AI/ML Systems | news.ycombinator.com | 2025-07-16

    Stay tuned! We have some cool new features coming soon to support agentic workloads (teaser: https://github.com/Netflix/metaflow/pull/2473)

    If you are curious, join the Metaflow Slack at http://slack.outerbounds.co and start a thread on #ask-metaflow

  7. CoreML-Models

    Largest list of models for Core ML (for iOS 11+)

  8. openllmetry

    Open-source observability for your GenAI or LLM application, based on OpenTelemetry

    Project mention: AI: Introduction to Ollama for local LLM launch | dev.to | 2025-07-20

    For monitoring, there are separate full-fledged monitoring solutions like Opik, PostHog, Langfuse or OpenLLMetry, maybe will try some next time.

  9. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  10. feast

    The Open Source Feature Store for AI/ML

    Project mention: Transforming Your PDFs for RAG with Open Source Using Docling, Milvus, and Feast | news.ycombinator.com | 2025-04-22

    Hey folks!

    I recently gave a talk with the Milvus Community showing a demo of how to transform PDFs with Feast using Docling for RAG.

    The tutorial is available here: https://github.com/feast-dev/feast/tree/master/examples/rag-...

    And the video is available here: https://www.youtube.com/watch?v=DPPtr9Q6_qE

    The goal with having a feature store transform and retrieve your data for RAG is that (1) we make it easy to configure vector retrieval with just a boolean in the code declaration and (2) you can use existing tooling that data scientists / ml engineers are already familiar with.

    I'd love any feedback or ideas on how we could make things better or easier. The Feast maintainers have quite a lot in the pipeline (batch transformations, support for Ray, computer vision and more).

    Thanks a ton!

  11. aim

    Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

    Project mention: Aim: Supercharged open-source experiment tracker | news.ycombinator.com | 2025-03-31
  12. superduper

    Superduper: End-to-end framework for building custom AI applications and agents.

  13. zenml

    ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

    Project mention: Accelerating ML Development with DevPods and ModelKits | dev.to | 2025-01-28

    Seamless integration: Works with OCI-compliant registries (e.g., Docker Hub and Jozu Hub) and integrates with popular tools like HuggingFace, ZenML, and Git.

  14. awesome-mlops

    :sunglasses: A curated list of awesome MLOps tools (by kelvins)

  15. Kiln

    Easily build AI systems with Evals, RAG, Agents, fine-tuning, synthetic data, and more.

    Project mention: DeepFabric – Generate High-Quality Synthetic Datasets at Scale | news.ycombinator.com | 2025-09-26
  16. deepchecks

    Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

  17. zvt

    modular quant framework.

  18. polyaxon

    MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle

  19. hub

    A library for transfer learning by reusing parts of TensorFlow models. (by tensorflow)

  20. RasaGPT

    💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram

  21. plexe

    ✨ Build a machine learning model from a prompt

    Project mention: Launch HN: Plexe (YC X25) – Build production-grade ML models from prompts | news.ycombinator.com | 2025-11-04

    - Deploys the best model with monitoring and automatic retraining

    We did a Show HN for our open-source library five months ago (https://news.ycombinator.com/item?id=43906346). Since then, we've launched our commercial platform with interactive refinement, production-grade model evaluations, retraining pipeline, data connectors, analytics dashboards, and deployment for online and batch inference.

    We use a multi-agent architecture where specialized agents handle different pipeline stages. Each agent focuses on its domain: data analysis, feature engineering, model selection, deployment, and so on. The platform tracks all experiments and generates exportable Python code.

    Our open-source core (https://github.com/plexe-ai/plexe, Apache 2.0) remains free for local development. For the paid product, our pricing is usage-based, with a minimum top up of $10. Enterprises can self-host the entire platform. You can sign up on https://console.plexe.ai. Use promo code `LAUNCHDAY20` to get $20 to try out the platform.

    We’d love to hear your thoughts on the problem and feedback on the platform!

  22. nannyml

    nannyml: post-deployment data science in python

    Project mention: Personal Picks: Data Product News (June 11, 2025) | dev.to | 2025-06-10
  23. ScaledYOLOv4

    Scaled-YOLOv4: Scaling Cross Stage Partial Network

  24. Photonix

    A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms.

  25. GPflow

    Gaussian processes in TensorFlow

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python ML discussion

Python ML related posts

  • Launch HN: Plexe (YC X25) – Build production-grade ML models from prompts

    1 project | news.ycombinator.com | 4 Nov 2025
  • DevOps, MLOps, or Platform Engineering, In 2025, who will own the pipeline?

    4 projects | dev.to | 20 Jun 2025
  • Personal Picks: Data Product News (June 11, 2025)

    1 project | dev.to | 10 Jun 2025
  • Show HN: System Prompt Learning – LLMs Learn Problem-Solving from Experience

    5 projects | news.ycombinator.com | 2 Jun 2025
  • A machine learning library

    1 project | dev.to | 8 May 2025
  • Show HN: Note now supports finding the best optimizer

    1 project | news.ycombinator.com | 6 Mar 2025
  • Show HN: Video processing pipeline for LLM – Python

    1 project | news.ycombinator.com | 28 Feb 2025
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 22 Dec 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source ML projects in Python? This list will help you:

# Project Stars
1 yolov5 56,405
2 MLflow 23,403
3 best-of-ml-python 22,954
4 ludwig 11,636
5 metaflow 9,675
6 CoreML-Models 6,890
7 openllmetry 6,700
8 feast 6,548
9 aim 5,921
10 superduper 5,235
11 zenml 5,104
12 awesome-mlops 4,908
13 Kiln 4,485
14 deepchecks 3,955
15 zvt 3,874
16 polyaxon 3,688
17 hub 3,523
18 RasaGPT 2,447
19 plexe 2,284
20 nannyml 2,119
21 ScaledYOLOv4 2,031
22 Photonix 1,927
23 GPflow 1,895

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?