vllm-project / vllm Public

Notifications You must be signed in to change notification settings
Fork 8.5k
Star 51.7k

Code
Issues 1.8k
Pull requests 773
Discussions
Actions
Projects 11
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: vllm-project/vllm

Labels 54 Milestones 1

New pull request New

773 Open 9,741 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Tanuj/merge

#20579 opened Jul 7, 2025 by tanujtiwari1998

Loading…

4 tasks

[Bench] Add NVFP4 GEMM benchmark script perf-benchmarks performance

Performance-related issues

quantization

#20578 opened Jul 7, 2025 by mgoin

Loading…

[Core][Model] PrithviMAE Enablement on vLLM v1 engine (with zero kv_cache_groups) documentation

Improvements or additions to documentation

multi-modality

Related to multi-modality (#4194)

needs-rebase v1

#20577 opened Jul 7, 2025 by christian-pinto

Loading…

feat - add a new endpoint get_tokenizer_info to provide tokenizer/chat-template information frontend

#20575 opened Jul 7, 2025 by m-misiura

Loading…

[Misc] Improve logging for dynamic shape cache compilation

#20573 opened Jul 7, 2025 by kyolebu

Loading…

1 of 4 tasks

[Config] Refactor mistral configs llama

Related to Llama models

#20570 opened Jul 7, 2025 by patrickvonplaten

Loading…

[Bugfix] Add checks to prevent blocks from being invalidly occupied.

#20569 opened Jul 7, 2025 by CLFutureX

Loading…

[Misc] use --ep_config to set eplb param deepseek

Related to DeepSeek models

#20562 opened Jul 7, 2025 by lengrongfu • Draft

2 of 4 tasks

[CI/Build][CPU] Fix CPU CI and remove all CPU V0 files ci/build v1

#20560 opened Jul 7, 2025 by bigPYJ1151

Loading…

1 of 4 tasks

[Hardware][PPC64LE] Enable V1 for ppc64le v1

#20554 opened Jul 7, 2025 by Akashcodes732

Loading…

4 tasks

Replace --expand-tools-even-if-tool-choice-none with --exclude-tools-when-tool-choice-none for v0.10.0 documentation

Improvements or additions to documentation

frontend tool-calling

#20544 opened Jul 7, 2025 by okdshin

Loading…

[Model] Support VLMs with transformers backend ci/build documentation

Improvements or additions to documentation

multi-modality

Related to multi-modality (#4194)

#20543 opened Jul 7, 2025 by zucchini-nlp

Loading…

[Test] Remove docker build and docker clean from test. ci/build tpu

Related to Google TPUs

#20542 opened Jul 7, 2025 by QiliangCui

Loading…

3 tasks done

[CI/Build] Ensure compatability with Transformers v4.53 ci/build multi-modality

Related to multi-modality (#4194)

qwen

Related to Qwen models

ready

ONLY add when PR is ready to merge/full CI is needed

#20541 opened Jul 7, 2025 by Isotr0py

Loading…

1 of 4 tasks

[Model] The ForSequenceClassification model should be controlled by override_pooler_config.

#20538 opened Jul 7, 2025 by noooop • Draft

4 tasks

DO NOT MERGE - debug needs-rebase performance

Performance-related issues

#20535 opened Jul 7, 2025 by robertgshaw2-redhat • Draft

[Model] Add AutoWeightsLoader support for BERT

#20534 opened Jul 7, 2025 by panyuhe

Loading…

Refactor: Remove numpy dependency from LoggingStatLogger v1

#20529 opened Jul 6, 2025 by skyloevil

Loading…

[Benchmark] Parameterization of streaming loading of multimodal datasets performance

Performance-related issues

#20528 opened Jul 6, 2025 by Potabk

Loading…

3 tasks done

[Third Party] Add a hook to the GPU Model Runner in Worker KV Connector start_load_kv() v1

#20524 opened Jul 6, 2025 by sammshen

Loading…

[Benchmarks] Add memory tracking to serving benchmark ci/build performance

Performance-related issues

#20519 opened Jul 6, 2025 by sfeng33

Loading…

[Bugfix] fix the block.prev_block reference - release problem

#20512 opened Jul 5, 2025 by CLFutureX

Loading…

Add reproducible prefix-cache block hashing using SHA-256 + CBOR ci/build v1

#20511 opened Jul 5, 2025 by vMaroon

Loading…

adds optional reasoning content field to ConversationMessage frontend

#20505 opened Jul 4, 2025 by arpitg1991

Loading…

4 tasks

feat: Add streaming support for Mistral v11 tool format frontend tool-calling

#20503 opened Jul 4, 2025 by sjuxax

Loading…

Previous 1 2 3 4 5 … 30 31 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!