Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
GitHub Copilot
Write better code with AI
GitHub Spark
New
Build and deploy intelligent apps
GitHub Models
New
Manage and compare prompts
GitHub Advanced Security
Find and fix vulnerabilities
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
Discussions
Collaborate outside of code
Code Search
Find more, search less
Explore
Why GitHub
Documentation
GitHub Skills
Blog
Integrations
GitHub Marketplace
MCP Registry
View all features
Solutions
By company size
Enterprises
Small and medium teams
Startups
Nonprofits
By use case
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
By industry
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
Topics
AI
DevOps
Security
Software Development
View all
Explore
Learning Pathways
Events & Webinars
Ebooks & Whitepapers
Customer Stories
Partners
Executive Insights
Open Source
GitHub Sponsors
Fund open source developers
The ReadME Project
GitHub community articles
Repositories
Topics
Trending
Collections
Enterprise
Enterprise platform
AI-powered developer platform
Available add-ons
GitHub Advanced Security
Enterprise-grade security features
Copilot for business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
vllm-project
/
vllm
Public
Uh oh!
There was an error while loading.
Please reload this page
.
Notifications
You must be signed in to change notification settings
Fork
10.6k
Star
59.6k
Code
Issues
1.8k
Pull requests
1.2k
Discussions
Actions
Projects
14
Security
Uh oh!
There was an error while loading.
Please reload this page
.
Insights
Additional navigation options
Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights
vllm-project/vllm projects
Projects
Projects
Templates
Projects
Templates
Search results
14 open and 0 closed projects found.
Open
14
Closed
0
Sort
Recently updated
Newest
Oldest
Least recently updated
Name
Structured Output
#16 updated
Oct 7, 2025
Transformers backend
#28 updated
Oct 7, 2025
V0 Deprecation
#25 updated
Oct 7, 2025
Multi-modality Core
Main tasks for the multi-modality workstream (#4194)
#8 updated
Oct 7, 2025
torch.compile integration
torch.compile integration related
#12 updated
Oct 6, 2025
DeepSeek V3/R1
Template
2025-02-25: DeepSeek V3/R1 is supported with optimized block FP8 kernels, MLA, MTP spec decode, multi-node PP, EP, and W4A16 quantization
#5 updated
Oct 6, 2025
Onboarding Tasks
A list of onboarding tasks for first-time contributors to get started with vLLM.
#6 updated
Oct 6, 2025
[V1] Pipeline Parallelism
[Testing] Optimize V1 PP efficiency.
#1 updated
Oct 6, 2025
Llama Issues & Bugs
Tracker of known issues and bugs for serving Llama on vLLM
#14 updated
Oct 6, 2025
Batch-invariant Inference
#29 updated
Oct 4, 2025
Ray
Tracks Ray issues and pull requests in vLLM
#7 updated
Oct 2, 2025
Multi-modal Model Requests
Community requests for multi-modal models
#10 updated
Sep 29, 2025
Llama Features & Optimizations
Enhancement to Llama herd of models. See also https://github.com/vllm-project/vllm/issues/16114
#13 updated
Sep 21, 2025
[V1] Speculative Decoding
#2 updated
Aug 14, 2025
You can’t perform that action at this time.