Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →
Top 23 Python fine-tuning Projects
- Project mention: Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs | news.ycombinator.com | 2025-09-18
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Project mention: Unsloth – Train LLMs 2x faster with 70% less VRAM | news.ycombinator.com | 2025-12-10 - Project mention: How to Build a RAG Solution with Llama Index, ChromaDB, and Ollama | dev.to | 2025-11-04
Step 2: Set up LlamaIndex and Chroma DB
-
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Project mention: CosyVoice 2025 Complete Guide: The Ultimate Multi-lingual Text-to-Speech Solution | dev.to | 2025-12-15git clone --recursive https://github.com/FunAudioLLM/CosyVoice.git cd CosyVoice # If submodule cloning fails due to network issues git submodule update --init --recursive
-
OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Project mention: Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers | dev.to | 2025-08-06REST APIs to connect AI models to Vue.js apps (example 1, example 2).
-
-
h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
- Project mention: DeepFabric – Generate High-Quality Synthetic Datasets at Scale | news.ycombinator.com | 2025-09-26
-
cognita
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
Project mention: Lists of open-source frameworks for building RAG applications | dev.to | 2025-01-02Ideal For: Enterprises seeking a robust framework for large-scale AI applications. GitHub Repository
-
- Project mention: OML 1.0 via Fingerprinting: Open, Monetizable, and Loyal AI | news.ycombinator.com | 2025-07-07
-
xTuring
Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
-
Just use https://github.com/bghira/SimpleTuner
I was able to run this script to train a Lora myself without spending any time learning the underlying python libraries.
-
-
maestro
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL (by roboflow)
-
-
dstack
dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or on-prem.
Project mention: Orchestrating GPUs in data centers and private clouds | news.ycombinator.com | 2025-02-18Super excited to hear any feedback.
[1] https://github.com/dstackai/dstack/issues/2184
-
custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
-
DB-GPT-Hub
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
-
I thought this was pretty well known (at least in the JAX/XLA world). I've hit this many times and got batch variance explained to me before: https://github.com/google-deepmind/penzai/issues/82 and
-
- Project mention: Curator: Scalable data pre processing and curation toolkit for LLMs | news.ycombinator.com | 2025-08-20
-
LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python fine-tuning discussion
Python fine-tuning related posts
-
Unsloth – Train LLMs 2x faster with 70% less VRAM
-
DeepFabric – Generate High-Quality Synthetic Datasets at Scale
-
Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs
-
Defeating Nondeterminism in LLM Inference
-
Show HN: Kiln – AI Boilerplate with Evals, Fine-Tuning, Synthetic Data, and Git
-
Qwen3-235B-A22B-Thinking-2507
-
One Input, Multiple AI Minds: Meet the New MultiMindSDK LLM Router
- A note from our sponsor - Stream getstream.io | 24 Dec 2025
Index
What are some of the best open-source fine-tuning projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | LLaMA-Factory | 64,310 |
| 2 | unsloth | 49,610 |
| 3 | llama_index | 45,989 |
| 4 | CosyVoice | 18,177 |
| 5 | OpenLLM | 12,013 |
| 6 | ludwig | 11,636 |
| 7 | h2o-llmstudio | 4,757 |
| 8 | Kiln | 4,485 |
| 9 | cognita | 4,303 |
| 10 | lorax | 3,569 |
| 11 | OML-1.0-Fingerprinting | 3,533 |
| 12 | xTuring | 2,664 |
| 13 | SimpleTuner | 2,673 |
| 14 | OneTrainer | 2,649 |
| 15 | maestro | 2,647 |
| 16 | YiVal | 2,114 |
| 17 | dstack | 1,982 |
| 18 | custom-diffusion | 1,969 |
| 19 | DB-GPT-Hub | 1,947 |
| 20 | penzai | 1,830 |
| 21 | LongWriter | 1,792 |
| 22 | Curator | 1,285 |
| 23 | LLM-Adapters | 1,217 |