- Notifications
You must be signed in to change notification settings - Fork 134
Pull requests: triton-inference-server/tutorials
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(pre-commit): update hooks versions
#145 by mc-nv was merged Nov 4, 2025 Loading… updated Nov 4, 2025
TPRD-1710: Update default branches post-25.09
#143 by mc-nv was merged Oct 6, 2025 Loading… updated Nov 4, 2025
fix(pre-commit): TRI-237: Address python version issue
#144 by mc-nv was closed Nov 3, 2025 Loading… updated Nov 3, 2025
Update README.md to comply with the latest version of TRT-LLM and outlines
#132 by kuleat was closed Jun 12, 2025 Loading… updated Jun 12, 2025
cherry-pick speculative decoding related PR #133 and #135
#136 by ziqifan617 was merged Mar 25, 2025 Loading… updated Mar 25, 2025
docs: move Constrained_Decoding and Function_Calling to Feature_Guide | rm AI_Agents_Guide folder
#135 by ziqifan617 was merged Mar 24, 2025 Loading… updated Mar 24, 2025
docs: Add EAGLE/SpS Speculative Decoding support with vLLM
#133 by ziqifan617 was merged Mar 21, 2025 Loading… updated Mar 21, 2025
Docs: add tutorials on EAGLE, MEDUSA, vanilla speculative decoding using TRT-LLM
#131 by ziqifan617 was merged Mar 4, 2025 Loading… updated Mar 4, 2025
update copyright for llama2 trtllm guide
#130 by ziqifan617 was merged Feb 12, 2025 Loading… updated Feb 12, 2025
update and polish llama2 trtllm_guide.md
#129 by ziqifan617 was merged Feb 12, 2025 Loading… updated Feb 12, 2025
Doc: Fix links to correct md files
#126 by statiraju was merged Jan 23, 2025 Loading… updated Jan 23, 2025
Md files need to have only one heading for rst files to
#125 by statiraju was merged Jan 9, 2025 Loading… updated Jan 9, 2025
Fix heading for vllm model inferencing
#123 by statiraju was merged Jan 8, 2025 Loading… updated Jan 8, 2025
Add basic testing for the tutorial
#90 by Tabrizian was closed Nov 28, 2024 Loading… updated Nov 28, 2024
Add onnxruntime genai example to the iterative scheduling tutorial
#93 by Tabrizian was closed Nov 28, 2024 Loading… updated Nov 28, 2024
2 tasks
Typo correction "enforcig" => "enforcing"
#122 by harryskim was merged Nov 22, 2024 Loading… updated Nov 22, 2024
docs: Add Semantic Caching Tutorial
#118 by oandreeva-nv was merged Oct 26, 2024 Loading… updated Oct 26, 2024
Update AutoScaling Blog to 24.07
#108 by indrajit96 was merged Oct 9, 2024 Loading… updated Oct 9, 2024
Multi-Node EKS Support mainline PR
#111 by indrajit96 was merged Oct 8, 2024 Loading… updated Oct 8, 2024
chore: Update Triton + Ray Serve Tutorial for Ray Summit 2024
#115 by nnshah1 was merged Sep 26, 2024 Loading… updated Sep 26, 2024
docs: Clarify Logits Processor and TRT-LLM examples
#116 by oandreeva-nv was merged Sep 24, 2024 Loading… updated Sep 24, 2024
Pin TensorRT Version in Stable Diffusion Tutorial
#103 by fpetrini15 was merged Aug 1, 2024 Loading… updated Sep 10, 2024
Previous Next
ProTip! Updated in the last three days: updated:>2025-11-16.