-
- Notifications
You must be signed in to change notification settings - Fork 8.5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bench] Add NVFP4 GEMM benchmark script perf-benchmarks performance Performance-related issues quantization
#20578 opened Jul 7, 2025 by mgoin Loading…
[Core][Model] PrithviMAE Enablement on vLLM v1 engine (with zero kv_cache_groups) documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) needs-rebase v1
#20577 opened Jul 7, 2025 by christian-pinto Loading…
feat - add a new endpoint
get_tokenizer_info
to provide tokenizer/chat-template information frontend #20575 opened Jul 7, 2025 by m-misiura Loading…
[Misc] Improve logging for dynamic shape cache compilation
#20573 opened Jul 7, 2025 by kyolebu Loading…
1 of 4 tasks
[Config] Refactor mistral configs llama Related to Llama models
#20570 opened Jul 7, 2025 by patrickvonplaten Loading…
[Bugfix] Add checks to prevent blocks from being invalidly occupied.
#20569 opened Jul 7, 2025 by CLFutureX Loading…
[Misc] use --ep_config to set eplb param deepseek Related to DeepSeek models v1
#20562 opened Jul 7, 2025 by lengrongfu • Draft
2 of 4 tasks
[CI/Build][CPU] Fix CPU CI and remove all CPU V0 files ci/build v1
#20560 opened Jul 7, 2025 by bigPYJ1151 Loading…
1 of 4 tasks
[Hardware][PPC64LE] Enable V1 for ppc64le v1
#20554 opened Jul 7, 2025 by Akashcodes732 Loading…
4 tasks
Replace Improvements or additions to documentation frontend tool-calling
--expand-tools-even-if-tool-choice-none
with --exclude-tools-when-tool-choice-none
for v0.10.0 documentation #20544 opened Jul 7, 2025 by okdshin Loading…
[Model] Support VLMs with transformers backend ci/build documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194)
#20543 opened Jul 7, 2025 by zucchini-nlp Loading…
[Test] Remove docker build and docker clean from test. ci/build tpu Related to Google TPUs
#20542 opened Jul 7, 2025 by QiliangCui Loading…
3 tasks done
[CI/Build] Ensure compatability with Transformers v4.53 ci/build multi-modality Related to multi-modality (#4194) qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#20541 opened Jul 7, 2025 by Isotr0py Loading…
1 of 4 tasks
DO NOT MERGE - debug needs-rebase performance Performance-related issues
#20535 opened Jul 7, 2025 by robertgshaw2-redhat • Draft
Refactor: Remove numpy dependency from LoggingStatLogger v1
#20529 opened Jul 6, 2025 by skyloevil Loading…
[Benchmark] Parameterization of streaming loading of multimodal datasets performance Performance-related issues
#20528 opened Jul 6, 2025 by Potabk Loading…
3 tasks done
[Third Party] Add a hook to the GPU Model Runner in Worker KV Connector start_load_kv() v1
#20524 opened Jul 6, 2025 by sammshen Loading…
[Benchmarks] Add memory tracking to serving benchmark ci/build performance Performance-related issues
#20519 opened Jul 6, 2025 by sfeng33 Loading…
[Bugfix] fix the block.prev_block reference - release problem
#20512 opened Jul 5, 2025 by CLFutureX Loading…
Add reproducible prefix-cache block hashing using SHA-256 + CBOR ci/build v1
#20511 opened Jul 5, 2025 by vMaroon Loading…
adds optional reasoning content field to ConversationMessage frontend
#20505 opened Jul 4, 2025 by arpitg1991 Loading…
4 tasks
feat: Add streaming support for Mistral v11 tool format frontend tool-calling
#20503 opened Jul 4, 2025 by sjuxax Loading…
Previous Next
ProTip! Type g i on any issue or pull request to go back to the issue listing page.