Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Tanuj/merge
#20579 opened Jul 7, 2025 by tanujtiwari1998 Loading…
4 tasks
[Bench] Add NVFP4 GEMM benchmark script perf-benchmarks performance Performance-related issues quantization
#20578 opened Jul 7, 2025 by mgoin Loading…
[Core][Model] PrithviMAE Enablement on vLLM v1 engine (with zero kv_cache_groups) documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) needs-rebase v1
#20577 opened Jul 7, 2025 by christian-pinto Loading…
[Misc] Improve logging for dynamic shape cache compilation
#20573 opened Jul 7, 2025 by kyolebu Loading…
1 of 4 tasks
[Config] Refactor mistral configs llama Related to Llama models
#20570 opened Jul 7, 2025 by patrickvonplaten Loading…
[Misc] use --ep_config to set eplb param deepseek Related to DeepSeek models v1
#20562 opened Jul 7, 2025 by lengrongfu Draft
2 of 4 tasks
[Hardware][PPC64LE] Enable V1 for ppc64le v1
#20554 opened Jul 7, 2025 by Akashcodes732 Loading…
4 tasks
[Model] Support VLMs with transformers backend ci/build documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194)
#20543 opened Jul 7, 2025 by zucchini-nlp Loading…
[Test] Remove docker build and docker clean from test. ci/build tpu Related to Google TPUs
#20542 opened Jul 7, 2025 by QiliangCui Loading…
3 tasks done
[CI/Build] Ensure compatability with Transformers v4.53 ci/build multi-modality Related to multi-modality (#4194) qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#20541 opened Jul 7, 2025 by Isotr0py Loading…
1 of 4 tasks
DO NOT MERGE - debug needs-rebase performance Performance-related issues
#20535 opened Jul 7, 2025 by robertgshaw2-redhat Draft
[Model] Add AutoWeightsLoader support for BERT
#20534 opened Jul 7, 2025 by panyuhe Loading…
[Benchmark] Parameterization of streaming loading of multimodal datasets performance Performance-related issues
#20528 opened Jul 6, 2025 by Potabk Loading…
3 tasks done
[Benchmarks] Add memory tracking to serving benchmark ci/build performance Performance-related issues
#20519 opened Jul 6, 2025 by sfeng33 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.