fix json schema alias serializing when streaming #26356

WoutDeRijck · 2025-10-07T14:12:21Z

Purpose

Fix a serialization bug in streaming responses where Pydantic field aliases (e.g. schema → schema_) were not preserved during .model_dump() calls.

This caused the "schema_" key to appear instead of "schema" in streamed response events for JSON schema output formats, breaking compatibility with the OpenAI SDK’s ResponseFormatTextJSONSchemaConfig parsing.

Related issue: vllm-project/vllm#26288

Root Cause

ResponsesResponse.from_request(...).model_dump() was called without by_alias=True at:
- vllm/entrypoints/openai/serving_responses.py:1830
- vllm/entrypoints/openai/serving_responses.py:1879
Without by_alias=True, Pydantic outputs internal field names (e.g. schema_) instead of their aliases (schema), causing validation errors downstream.

Fix

Add by_alias=True to both .model_dump() calls so serialized responses use the correct alias names consistent with OpenAI schema expectations.

# Before initial_response = ResponsesResponse.from_request(...).model_dump() # After initial_response = ResponsesResponse.from_request(...).model_dump(by_alias=True)

and

response=final_response.model_dump(by_alias=True)

Test Plan

Setup
- vllm==0.11.0
- openai==1.108.0

Reproduce the Bug (before fix)

stream = await client.responses.create( model=model, input=formatted_prompt, text={"format": {"name": "schema_ner", "schema": json_schema, "type": "json_schema", "strict": True}}, stream=True, )

Observe the first streamed event includes "schema_" instead of "schema".

Apply the Fix
- Add by_alias=True in both .model_dump() calls.
- Rebuild and rerun the same request.
Expected Behavior
- Streamed events now correctly include "schema" key.
- No validation error occurs when parsing through OpenAI SDK or FastAPI’s Pydantic model.

Test Result

✅ Before fix

Streaming response JSON contained "schema_"
Validation failed with missing "schema" field

✅ After fix

Streaming response JSON correctly uses "schema"
Validation passes
Structured outputs parse successfully in both streaming and non-streaming modes

Example (after fix):

{ "text": { "format": { "name": "schema_ner", "schema": { ... }, "type": "json_schema", "strict": true } } }

Fix is effective, without this fix following errors persist

[1;36m(APIServer pid=7)�[0;0m \| response.text.format.ResponseFormatTextJSONSchemaConfig.schema [1;36m(APIServer pid=7)�[0;0m \| Field required [type=missing, input_value={'name': 'schema_ner', 's...': None, 'strict': True}, input_type=dict]

Essential Elements of an Effective PR Description Checklist

Purpose of the PR
Test plan provided
Test results before and after
Links to related issue(s)
(Optional) Documentation update — not required
(Optional) Release notes update — internal behavioral fix only

BEFORE SUBMITTING: see vLLM contributing guide

Signed-off-by: WoutDeRijck <derijck.2001@icloud.com>

gemini-code-assist

Code Review

This pull request aims to fix a serialization bug in streaming responses where Pydantic field aliases were not being used. The provided change correctly addresses this for the response.created event by adding by_alias=True to the model_dump() call. However, the fix is incomplete. A similar issue persists for the response.completed event, as the final_response object is not serialized with the correct alias settings before being sent. I've left a critical comment detailing the necessary change to fully resolve the bug.

vllm/entrypoints/openai/serving_responses.py

WoutDeRijck · 2025-10-07T14:16:36Z

Labels: structured output, streaming

qandrew

hi @WoutDeRijck , thanks for looking into this! Could you add a unit test in https://github.com/vllm-project/vllm/blob/main/tests/entrypoints/openai/test_response_api_with_harmony.py so we can prevent this behavior in the future?

Signed-off-by: WoutDeRijck <derijck.2001@icloud.com>

WoutDeRijck · 2025-10-09T08:13:20Z

Hi @qandrew, I've added the unit test!

qandrew · 2025-10-09T15:50:21Z

vllm/entrypoints/openai/serving_responses.py

 type="response.completed",
 sequence_number=-1,
- response=final_response,
+ response=final_response.model_dump(by_alias=True),


@WoutDeRijck could you verify that this doesn't break serialization? I just added a PR to revert the model_dump :P not sure if by_alias will cause a different behavior? #26185

fix json schema alias serializing when streaming

3b3d2dd

Signed-off-by: WoutDeRijck <derijck.2001@icloud.com>

WoutDeRijck requested review from aarnphm and chaunceyjiang as code owners October 7, 2025 14:12

mergify bot added the frontend label Oct 7, 2025

gemini-code-assist bot reviewed Oct 7, 2025

View reviewed changes

vllm/entrypoints/openai/serving_responses.py Show resolved Hide resolved

qandrew reviewed Oct 7, 2025

View reviewed changes

add unit test

47a954c

Signed-off-by: WoutDeRijck <derijck.2001@icloud.com>

WoutDeRijck requested review from DarkLight1337, NickLucche, robertgshaw2-redhat and simon-mo as code owners October 8, 2025 07:40

mergify bot added the gpt-oss Related to GPT-OSS models label Oct 8, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Oct 8, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Oct 8, 2025

qandrew reviewed Oct 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix json schema alias serializing when streaming #26356

fix json schema alias serializing when streaming #26356

WoutDeRijck commented Oct 7, 2025 •

edited by github-actions bot

Loading

gemini-code-assist bot left a comment

Uh oh!

WoutDeRijck commented Oct 7, 2025

qandrew left a comment

WoutDeRijck commented Oct 9, 2025

qandrew Oct 9, 2025

Labels

2 participants

Uh oh!

fix json schema alias serializing when streaming #26356

Are you sure you want to change the base?

fix json schema alias serializing when streaming #26356

Conversation

WoutDeRijck commented Oct 7, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Root Cause

Fix

Test Plan

Test Result

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

WoutDeRijck commented Oct 7, 2025

qandrew left a comment

Choose a reason for hiding this comment

WoutDeRijck commented Oct 9, 2025

qandrew Oct 9, 2025

Choose a reason for hiding this comment

Labels

2 participants

WoutDeRijck commented Oct 7, 2025 •

edited by github-actions bot

Loading