Top 23 Python llm Projects

awesome-llm-apps

1 9 83,772 9.6 Python

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Project mention: Show HN: AI Real Estate Agent Team (100% Open-Source with Free Tutorial) | news.ycombinator.com | 2025-08-06
InfluxDB

www.influxdata.com featured

InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
unsloth

2 29 49,610 9.9 Python

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Project mention: Unsloth – Train LLMs 2x faster with 70% less VRAM | news.ycombinator.com | 2025-12-10
crewAI

3 16 41,461 9.8 Python

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Project mention: I added native time awareness to CrewAI to fix LLM date hallucinations | news.ycombinator.com | 2025-12-14
MindsDB

4 94 38,107 9.9 Python

Query Engine for AI - The only MCP Server you'll ever need

Project mention: “One Journey Ends, Another Begins — My Hacktoberfest 2025 Story” | dev.to | 2025-10-31

Just wrapped up my Hacktoberfest project using MindsDB and Streamlit — built a CRM Semantic Search AI app! 😄 If anyone’s into open source + AI, would love feedback on my PR: Hacktoberfest 2025 PR – Add CRM Semantic Search use case (MindsDB)
BettaFish

5 1 33,338 9.8 Python

微舆：人人可用的多Agent舆情分析助手，打破信息茧房，还原舆情原貌，预测未来走向，辅助决策！从0实现，不依赖任何框架。

Project mention: BettaFish – Public Opinion Sentiment Analysis Model | news.ycombinator.com | 2025-11-03
graphrag

6 25 29,803 8.6 Python

A modular graph-based Retrieval-Augmented Generation (RAG) system

Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26

URL: https://microsoft.github.io/graphrag/ and https://github.com/microsoft/graphrag and https://github.com/Azure-Samples/graphrag-accelerator
kotaemon

7 10 24,766 8.8 Python

An open-source RAG-based tool for chatting with your documents.

Project mention: Kotaemon-papers: an open-source web app to chat with your academic papers | news.ycombinator.com | 2025-01-05

Hi HN,
Our team at https://github.com/Cinnamon/kotaemon/ has been working on a public demo to showcase the new advanced citation features in our RAG (retrieval-augmented generation) application.
We’re excited to share a web app that lets users explore top daily machine learning (ML) papers on Arxiv (via the HuggingFace API) and upload their own Arxiv papers to get LLM-assisted summaries, mind maps, and answers to questions based on the content.
Some notable features:
- Instant Summaries & Mind Maps: Generate concise summaries and visual mind maps for any Arxiv paper.
- Transparent Citations: Verify AI-generated answers with clear, evidence-backed citations. Citations are highlighted directly in the in-browser PDF viewer.
- Flexible Citation Options: Choose between highlights and inline citations. Plus, click on any sentence in the AI-generated response to see its supporting source from the original paper.
- Multi-Paper Analysis: Compare, contrast, and compose summaries from multiple papers simultaneously.
- Complex Question Solving: Use Chain-of-Thought (CoT) reasoning mode to break down and solve complex questions step-by-step.
- Customizable & Private Hosting: Easily self-host or customize your private app via HuggingFace Spaces. You can securely connect your LLM and upload your own document collections.
We’d love to hear your thoughts, feedback, and recommendations as we continue improving this tool.
Check out the demo here and happy hacking!
Stream

getstream.io featured

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
gpt-researcher

8 7 24,503 9.6 Python

An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.

Project mention: Configuring your own deep research tool (Using Nix Flakes) | dev.to | 2025-04-28

For demonstration purposes, I'll be using the gpt-researcher tool. Github link: https://github.com/assafelovic/gpt-researcher
fastmcp

9 12 21,322 9.9 Python

🚀 The fast, Pythonic way to build MCP servers and clients

Project mention: How to Build Your Own MCP Server with Python | dev.to | 2025-11-02

We’ll use a library called FastMCP that simplifies the process of building MCP servers. You can install it using pip:
graphiti

10 10 21,190 9.7 Python

Build Real-Time Knowledge Graphs for AI Agents

Project mention: I built an faster Notion in Rust | news.ycombinator.com | 2025-11-24
serena

11 11 17,340 9.9 Python

A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)

Project mention: Building a 400K-Line Full-Stack App Solo with Claude Code in 2025 | dev.to | 2025-12-22

Serena - Semantic code search and editing tool
outlines

12 46 13,141 9.5 Python

Structured Outputs

Project mention: PyCon 2025 Agentic App with Use of Pydantic AI | news.ycombinator.com | 2025-09-14

`outlines` (https://github.com/dottxt-ai/outlines) is very good and supported by vLLM as a backend structured output provider (https://docs.vllm.ai/en/v0.8.2/features/structured_outputs.h...) for both local and remote LLMs. vLLM is probably the best open source tooling for the inference side right now.
litgpt

13 2 13,034 9.1 Python

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Project mention: Lightning.ai – an enterprise managed inference platform for AI | news.ycombinator.com | 2025-10-09

After making model training simpler with PyTorch Lightning, Lightning.AI is now tackling the next bottleneck — inference. Their new managed service targets enterprises deploying LLMs and deep learning models at scale, emphasizing performance, cost-efficiency, and developer-friendly tooling.
Platform: https://lightning.ai/
LLMSurvey

14 3 12,014 7.3 Python

The official GitHub page for the survey paper "A Survey of Large Language Models".
cleanlab

15 70 11,208 6.9 Python

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
llm

16 91 10,554 9.5 Python

Access large language models from the command-line

Project mention: You Don't Need to Spend $100/Mo on Claude Code:Your Guide to Local Coding Models | news.ycombinator.com | 2025-12-21
mcp-agent

17 18 7,866 9.8 Python

Build effective agents using Model Context Protocol and simple workflow patterns

Project mention: Show HN: MCP-C – cloud platform for running MCP agents and apps | news.ycombinator.com | 2025-10-28
Upsonic

18 6 7,727 9.8 Python

Agent Framework For Fintech and Banks

Project mention: An AI agent framework used by fintechs | news.ycombinator.com | 2025-11-22
OpenNMT-py

19 6 6,981 0.0 Python

Open Source Neural Machine Translation and (Large) Language Models in PyTorch
Liger-Kernel

20 3 5,961 9.4 Python

Efficient Triton Kernels for LLM Training

Project mention: Why ML Needs a New Programming Language | news.ycombinator.com | 2025-09-05
atomic-agents

21 5 5,387 9.5 Python

Building AI agents, atomically
transformerlab-app

22 8 4,592 10.0 Python

Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.

Project mention: Gemma 3 QAT Models: Bringing AI to Consumer GPUs | news.ycombinator.com | 2025-04-20
ag2

23 8 3,951 9.8 Python

AG2 (formerly AutoGen): The Open-Source AgentOS. Join us at: https://discord.gg/pAbnFJrkgZ

Project mention: Paper2Agent: Stanford Reimagining Research Papers as Interactive AI Agents | news.ycombinator.com | 2025-09-24

We built agents to test github repo quickstarts associated with arXiv papers a couple months before this paper was published, wrote about it publicly here: https://remyxai.substack.com/p/self-healing-repos
We've been pushing it farther to implement draft PRs in your target repo, published a month before this preprint: https://remyxai.substack.com/p/paperswithprs
To limit the attack surface we added PR#1929 to AG2 so we could pass API keys to the DockerCommandLineCodeExecutor but also use egress whitelisting to block the ability of an agent to reach a compromised server: https://github.com/ag2ai/ag2/pull/1929
Since then, we've been scaling this with k8s ray workers so we can run this in the cloud to build for the hundreds of papers published daily.
By running in Docker, constraining the network interface, deploying on the cloud, and ultimately keeping humans-in-the-loop through PR review, it's hard to see where the prompt-injection attack comes into play from testing the code.
Would love to get feedback from an expert on this, can you imagine an attack scenario, Simon?
I'll need to work out a check for the case where someone creates a paper with code instructing my agent to publish keys to a public HF repo for others to exfiltrate.
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python llms discussion

Python llms related posts

You Don't Need to Spend $100/Mo on Claude Code:Your Guide to Local Coding Models

10 projects | news.ycombinator.com | 21 Dec 2025
Managing GitHub Copilot CLI MCP Server Configuration in Your Repository

2 projects | dev.to | 20 Dec 2025
I added native time awareness to CrewAI to fix LLM date hallucinations

1 project | news.ycombinator.com | 14 Dec 2025
OpenAI are quietly adopting skills, now available in ChatGPT and Codex CLI

7 projects | news.ycombinator.com | 12 Dec 2025
Unsloth – Train LLMs 2x faster with 70% less VRAM

1 project | news.ycombinator.com | 10 Dec 2025
Show HN: Memory System for Claude Code and Other CLIs

1 project | news.ycombinator.com | 5 Dec 2025
Agentic Entity Resolution for Messy Product Data

1 project | dev.to | 29 Nov 2025
A note from our sponsor - InfluxDB
www.influxdata.com | 22 Dec 2025

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source llm projects in Python? This list will help you:

#	Project	Stars
1	awesome-llm-apps	83,772
2	unsloth	49,610
3	crewAI	41,461
4	MindsDB	38,107
5	BettaFish	33,338
6	graphrag	29,803
7	kotaemon	24,766
8	gpt-researcher	24,503
9	fastmcp	21,322
10	graphiti	21,190
11	serena	17,340
12	outlines	13,141
13	litgpt	13,034
14	LLMSurvey	12,014
15	cleanlab	11,208
16	llm	10,554
17	mcp-agent	7,866
18	Upsonic	7,727
19	OpenNMT-py	6,981
20	Liger-Kernel	5,961
21	atomic-agents	5,387
22	transformerlab-app	4,592
23	ag2	3,951

Python llms

Top 23 Python llm Projects

Python llms discussion

Python llms related posts

You Don't Need to Spend $100/Mo on Claude Code:Your Guide to Local Coding Models

Managing GitHub Copilot CLI MCP Server Configuration in Your Repository

I added native time awareness to CrewAI to fix LLM date hallucinations

OpenAI are quietly adopting skills, now available in ChatGPT and Codex CLI

Unsloth – Train LLMs 2x faster with 70% less VRAM

Show HN: Memory System for Claude Code and Other CLIs

Agentic Entity Resolution for Messy Product Data

Index

Did you know that Python is the 2nd most popular programming language based on number of references?

Did you know that Python is
the 2nd most popular programming language
based on number of references?