Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
GitHub Copilot
Write better code with AI
GitHub Spark
New
Build and deploy intelligent apps
GitHub Models
New
Manage and compare prompts
GitHub Advanced Security
Find and fix vulnerabilities
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
Discussions
Collaborate outside of code
Code Search
Find more, search less
Explore
Why GitHub
Documentation
GitHub Skills
Blog
Integrations
GitHub Marketplace
MCP Registry
View all features
Solutions
By company size
Enterprises
Small and medium teams
Startups
Nonprofits
By use case
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
By industry
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
Topics
AI
DevOps
Security
Software Development
View all
Explore
Learning Pathways
Events & Webinars
Ebooks & Whitepapers
Customer Stories
Partners
Executive Insights
Open Source
GitHub Sponsors
Fund open source developers
The ReadME Project
GitHub community articles
Repositories
Topics
Trending
Collections
Enterprise
Enterprise platform
AI-powered developer platform
Available add-ons
GitHub Advanced Security
Enterprise-grade security features
Copilot for business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
triton-inference-server
/
tensorrtllm_backend
Public
Notifications
You must be signed in to change notification settings
Fork
133
Star
907
Code
Issues
317
Pull requests
23
Discussions
Actions
Security
Uh oh!
There was an error while loading.
Please reload this page
.
Insights
Additional navigation options
Code
Issues
Pull requests
Discussions
Actions
Security
Insights
Commits
Branch selector
edb0cdf
User selector
All users
Datepicker
All time
Commit History
Commits on Aug 15, 2025
TensorRT-LLM Backend v1.1.0rc0 release (#789)
Show description for edb0cdf
Superjomn
and
kaiyux
authored
edb0cdf
Copy full SHA for edb0cdf
Commits on Aug 7, 2025
TensorRT-LLM Backend v1.0.0rc6 release (#785)
Show description for e6dfd73
Superjomn
and
kaiyux
authored
e6dfd73
Copy full SHA for e6dfd73
Commits on Aug 4, 2025
TensorRT-LLM Backend v1.0.0rc5 release (#783)
Show description for 1de3939
nv-guomingz
and
kaiyux
authored
1de3939
Copy full SHA for 1de3939
Commits on Aug 2, 2025
Update dockefile to keep up with latest version (#779)
Show description for eb0588f
mc-nv
authored
eb0588f
Copy full SHA for eb0588f
Commits on Jul 22, 2025
TensorRT-LLM Backend v1.0.0rc4 release (#777)
Show description for 8231491
nv-guomingz
and
kaiyux
authored
8231491
Copy full SHA for 8231491
Commits on Jul 16, 2025
TensorRT-LLM Backend v1.0.0rc3 release (#776)
Show description for 4575254
nv-guomingz
and
kaiyux
authored
4575254
Copy full SHA for 4575254
Commits on Jul 8, 2025
TensorRT-LLM Backend v1.0.0rc2 release (#774)
Show description for 6d4a257
nv-guomingz
and
kaiyux
authored
6d4a257
Copy full SHA for 6d4a257
Commits on Jul 2, 2025
fix: use the correct max tokens parameter name in example (#773)
achartier
authored
0edef02
Copy full SHA for 0edef02
Commits on Jul 1, 2025
doc: use trtllm-llmapi-launch for multi-node Triton (#772)
Show description for 9237442
achartier
authored
9237442
Copy full SHA for 9237442
[TRTLLM-6104] add docs on request_perf_metrics to triton LLMAPI backen (#769)
xuanzic
authored
de54e7f
Copy full SHA for de54e7f
TensorRT-LLM Backend v1.0.0rc1 release (#771)
Show description for 448706a
nv-guomingz
and
kaiyux
authored
448706a
Copy full SHA for 448706a
Commits on Jun 25, 2025
TensorRT-LLM Backend v1.0.0rc0 release (#768)
Show description for 5b38a98
nv-guomingz
and
kaiyux
authored
5b38a98
Copy full SHA for 5b38a98
Commits on Jun 18, 2025
Update build.md (#764)
Tabrizian
authored
205c2d9
Copy full SHA for 205c2d9
chore: update tensorrt llm to 0.21.0rc2 (#765)
Show description for 623ad0f
nv-guomingz
authored
623ad0f
Copy full SHA for 623ad0f
Commits on Jun 13, 2025
feat: update documentation for multi-node (#762)
Show description for 08ed112
achartier
authored
08ed112
Copy full SHA for 08ed112
Commits on Jun 11, 2025
chore: update tensorrt llm to 0.21.0rc1 (#761)
Show description for d07ec31
nv-guomingz
authored
d07ec31
Copy full SHA for d07ec31
Commits on Jun 9, 2025
ci: fix the docker container build to include the /app folder (#759)
richardhuo-nv
authored
adc0cc0
Copy full SHA for adc0cc0
Commits on Jun 4, 2025
Update submodule (#757)
Show description for a701b41
Shixiaowei02
authored
a701b41
Copy full SHA for a701b41
Commits on May 21, 2025
TPRD-1536: Update Dockerfile.triton.trt_llm_backend (#752)
mc-nv
authored
402e36e
Copy full SHA for 402e36e
Commits on May 20, 2025
Remove backend files and update documentation (#745)
Show description for 6a749f5
Tabrizian
authored
6a749f5
Copy full SHA for 6a749f5
Commits on May 13, 2025
Update TensorRT-LLM backend (#749)
Show description for c2e65b8
Shixiaowei02
and
kaiyux
authored
c2e65b8
Copy full SHA for c2e65b8
Commits on Apr 29, 2025
Update TensorRT-LLM backend (#744)
kaiyux
authored
b5fa472
Copy full SHA for b5fa472
Commits on Apr 23, 2025
Update TensorRT-LLM backend (#742)
kaiyux
authored
dbae4e8
Copy full SHA for dbae4e8
Commits on Apr 16, 2025
Update TensorRT-LLM backend (#735)
kaiyux
authored
689a553
Copy full SHA for 689a553
Commits on Apr 8, 2025
Update TensorRT-LLM backend (#733)
kaiyux
authored
1a9fa20
Copy full SHA for 1a9fa20
Commits on Apr 1, 2025
TensorRT-LLM backend update (#731)
kaiyux
authored
15cb989
Copy full SHA for 15cb989
Commits on Mar 26, 2025
Update TensorRT-LLM backend (#729)
kaiyux
authored
6c88297
Copy full SHA for 6c88297
Commits on Mar 18, 2025
Update TensorRT-LLM backend (#726)
kaiyux
authored
89805c1
Copy full SHA for 89805c1
Commits on Mar 11, 2025
Update TensorRT-LLM backend (#722)
kaiyux
authored
71ff7bc
Copy full SHA for 71ff7bc
Commits on Mar 4, 2025
Update TensorRT-LLM backend (#715)
kaiyux
authored
a315d2d
Copy full SHA for a315d2d
Commits on Feb 25, 2025
Update TensorRT-LLM backend (#713)
kaiyux
authored
071ee5e
Copy full SHA for 071ee5e
Commits on Feb 18, 2025
Update TensorRT-LLM backend (#708)
kaiyux
authored
f51ea9d
Copy full SHA for f51ea9d
Commits on Feb 13, 2025
Update submodule (#703)
kaiyux
authored
b629484
Copy full SHA for b629484
Commits on Feb 12, 2025
Update TensorRT-LLM backend (#701)
kaiyux
authored
565008d
Copy full SHA for 565008d
Commits on Feb 11, 2025
Update submodule (#700)
kaiyux
authored
dc35794
Copy full SHA for dc35794
Pagination
Previous
Next
You can’t perform that action at this time.