[Bugfix] Fix tool_choice="none" being ignored by GPT-OSS/harmony models #30867

HaloWorld · 2025-12-17T12:52:07Z

GPT-OSS models using the harmony format were ignoring the tool_choice="none" parameter and could still trigger tool calls when tools were provided in the request. This issue arose because the _make_request_with_harmony method only checked for the existence of request.tools, without accounting for the tool_choice setting or the exclude_tools_when_tool_choice_none flag.
This fix ensures that harmony models respect the exclude_tools_when_tool_choice_none flag, aligning their behavior with other model types and OpenAI API standards. When tool_choice="none" and the flag is enabled, tool definitions are no longer included in the system message.

Purpose

Modified _make_request_with_harmony to incorporate checks for tool_choice and exclude_tools_when_tool_choice_none.

Test Plan

Start vllm server with --exclude-tools-when-tool-choice-none

vllm serve /aifs4su/yujiepu/models/openai/gpt-oss-20b/ \	--served-model-name gpt-oss-20b \	--exclude-tools-when-tool-choice-none \	--max-model-len 8192 \	--tool-call-parser openai \	--enable-auto-tool-choice \	--trust-remote-code \	--max-cudagraph-capture-size 4

Test tool-calling

curl -s -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ -d '{  "model": "gpt-oss-20b",  "messages": [  {  "role": "user",  "content": "What is the weather in Dallas, TX?"  }  ],  "tools": [  {  "type": "function",  "function": {  "name": "get_current_weather",  "description": "Get the current weather in a given location",  "parameters": {  "type": "object",  "properties": {  "city": {  "type": "string"  },  "state": {  "type": "string"  },  "unit": {  "type": "string",  "enum": ["celsius", "fahrenheit"]  }  },  "required": ["city", "state", "unit"]  }  }  }  ],  "tool_choice": "none"  }'

Test Result

Before

After

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

chatgpt-codex-connector · 2025-12-17T12:52:19Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

gemini-code-assist

Code Review

This pull request addresses a bug where tool_choice="none" was being ignored for GPT-OSS/harmony models. The changes correctly modify _make_request_with_harmony to respect the exclude_tools_when_tool_choice_none flag, preventing tool definitions from being included in the prompt when tool_choice is set to "none". The logic is sound, and a new test case has been added to verify the fix. The changes appear correct and improve consistency. I have no major concerns.

vllm/entrypoints/openai/serving_chat.py

github-actions · 2025-12-17T13:12:05Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

vllm/entrypoints/openai/serving_chat.py

chaunceyjiang

The Responses API should also support this.

mergify · 2025-12-18T03:13:52Z

Hi @HaloWorld, the pre-commit checks have failed. Please run:

uv pip install pre-commit pre-commit install pre-commit run --all-files

Then, commit the changes and push to your branch.

For future commits, pre-commit will run automatically on changed files before each commit.

Tip

Is mypy or markdownlint failing?

mypy and markdownlint are run differently in CI. If the failure is related to either of these checks, please use the following commands to run them locally:

# For mypy (substitute "3.10" with the failing version if needed) pre-commit run --hook-stage manual mypy-3.10 # For markdownlint pre-commit run --hook-stage manual markdownlint

HaloWorld · 2025-12-18T03:56:31Z

The Responses API should also support this.

Thank you for the feedback. You're absolutely right that the feature should ideally be consistent across our APIs.

Currently, the OpenAIServingResponses class does not have direct support for the exclude_tools_when_tool_choice_none parameter (unlike OpenAIServingChat). Furthermore, the existing Responses API implementation for Harmony currently only supports tool_choice='auto', as seen in serving_responses.py (L576-L584).

Given this context and to adhere to the principle of minimal changes for this PR, would it be acceptable to add support for the Responses API in a separate, follow-up PR? This would allow us to land the core logic for the Chat API promptly while giving due attention to properly extending the Responses API.

Please let me know your thoughts. I'm happy to proceed either way.

vllm/entrypoints/openai/serving_chat.py

chaunceyjiang

Thanks~

HaloWorld · 2025-12-18T12:47:32Z

@chaunceyjiang Hi, thank you for reviewing and approving the changes. I've noticed that the CI run has failed on one specific test case:

FAILED entrypoints/openai/test_response_api_with_harmony.py::test_code_interpreter[openai/gpt-oss-20b] - AssertionError: assert '5846' in '9648'

Interestingly, I wasn't able to reproduce this failure locally when running the same test, and it passes on my machine.

I'd be happy to provide any additional logs or information that might help investigate the discrepancy between the CI environment and my local setup.

Signed-off-by: yujiepu <pyjapple@gmail.com>

Co-authored-by: Chauncey <chaunceyjiang@gmail.com> Signed-off-by: PlatinumGod <pyjapple@gmail.com>

Signed-off-by: yujiepu <pyjapple@gmail.com>

…ls (vllm-project#30867) Signed-off-by: yujiepu <pyjapple@gmail.com> Signed-off-by: PlatinumGod <pyjapple@gmail.com> Co-authored-by: Chauncey <chaunceyjiang@gmail.com> Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

HaloWorld requested review from DarkLight1337, NickLucche, aarnphm, chaunceyjiang and robertgshaw2-redhat as code owners December 17, 2025 12:52

mergify bot added frontend gpt-oss Related to GPT-OSS models labels Dec 17, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Dec 17, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Dec 17, 2025

gemini-code-assist bot reviewed Dec 17, 2025

View reviewed changes

chaunceyjiang reviewed Dec 17, 2025

View reviewed changes

vllm/entrypoints/openai/serving_chat.py Show resolved Hide resolved

HaloWorld requested a review from chaunceyjiang December 18, 2025 02:16

HaloWorld force-pushed the main branch from a1b860c to 82858d9 Compare December 18, 2025 02:28

chaunceyjiang reviewed Dec 18, 2025

View reviewed changes

vllm/entrypoints/openai/serving_chat.py Outdated Show resolved Hide resolved

chaunceyjiang reviewed Dec 18, 2025

View reviewed changes

chaunceyjiang self-assigned this Dec 18, 2025

HaloWorld requested a review from chaunceyjiang December 18, 2025 03:56

chaunceyjiang reviewed Dec 18, 2025

View reviewed changes

vllm/entrypoints/openai/serving_chat.py Outdated Show resolved Hide resolved

HaloWorld requested a review from chaunceyjiang December 18, 2025 06:39

HaloWorld force-pushed the main branch from 8d44b49 to 895d0de Compare December 18, 2025 10:03

chaunceyjiang approved these changes Dec 18, 2025

View reviewed changes

github-project-automation bot moved this from To Triage to Ready in gpt-oss Issues & Enhancements Dec 18, 2025

chaunceyjiang added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 18, 2025

HaloWorld and others added 2 commits December 18, 2025 20:47

fix lint

4875e1f

Signed-off-by: yujiepu <pyjapple@gmail.com>

Update vllm/entrypoints/openai/serving_chat.py

0f54f73

Co-authored-by: Chauncey <chaunceyjiang@gmail.com> Signed-off-by: PlatinumGod <pyjapple@gmail.com>

HaloWorld added 2 commits December 18, 2025 20:47

fix lint

45a0adf

Signed-off-by: yujiepu <pyjapple@gmail.com>

fix wrong commit by github

949015a

Signed-off-by: yujiepu <pyjapple@gmail.com>

HaloWorld force-pushed the main branch from 895d0de to 949015a Compare December 18, 2025 12:47

chaunceyjiang merged commit 6a09612 into vllm-project:main Dec 19, 2025
47 checks passed

github-project-automation bot moved this from Ready to Done in gpt-oss Issues & Enhancements Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix tool_choice="none" being ignored by GPT-OSS/harmony models #30867

[Bugfix] Fix tool_choice="none" being ignored by GPT-OSS/harmony models #30867

HaloWorld commented Dec 17, 2025 •

edited by github-actions bot

Loading

chatgpt-codex-connector bot commented Dec 17, 2025

gemini-code-assist bot left a comment

Uh oh!

github-actions bot commented Dec 17, 2025

Uh oh!

chaunceyjiang left a comment

mergify bot commented Dec 18, 2025

HaloWorld commented Dec 18, 2025

Uh oh!

chaunceyjiang left a comment

HaloWorld commented Dec 18, 2025

Uh oh!

Labels

2 participants

Uh oh!

[Bugfix] Fix tool_choice="none" being ignored by GPT-OSS/harmony models #30867

[Bugfix] Fix tool_choice="none" being ignored by GPT-OSS/harmony models #30867

Conversation

HaloWorld commented Dec 17, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

chatgpt-codex-connector bot commented Dec 17, 2025

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions bot commented Dec 17, 2025

Uh oh!

chaunceyjiang left a comment

Choose a reason for hiding this comment

mergify bot commented Dec 18, 2025

HaloWorld commented Dec 18, 2025

Uh oh!

chaunceyjiang left a comment

Choose a reason for hiding this comment

HaloWorld commented Dec 18, 2025

Uh oh!

Labels

2 participants

HaloWorld commented Dec 17, 2025 •

edited by github-actions bot

Loading