Support comments in streaming response #259

SplittyDev · 2025-02-04T17:05:12Z

What

This PR adjusts the streaming response parsing logic to gracefully deal with comments.
Supersedes #252.

Why

Server Sent Events, the standard used by the OpenAI API and compatible APIs, support comments by definition¹, but the parsing code did not deal with them gracefully. Instead, it caused a decoding error, trying to parse the comment as JSON.

Even though the official OpenAI API does not seem to be using comments as far as I can tell, other OpenAI-compatible APIs are using them (e.g. OpenRouter), and they are part of the standard, so should probably be supported anyway.

Honestly, that part of the code probably needs a bigger overhaul in the future, because the parsing logic there is far from robust, and far from correct. It's correct enough to handle the specific way that OpenAI is using Server Sent Events, but it ignores many parts of the spec, such as multi-line events that can have multiple data: parts while still being the same event.

The PR now includes a refactoring of the stream data processing code, and I think that mostly addresses these claims I made earlier. Probably still needs some stricter compliance testing to make sure we handle edge-cases well, but it's more than good enough for me in its current state.

Affected Areas

Only the stream response parsing is affected.

Mentions

Huge thanks to @nezhyborets for contributing tests and refactoring!

https://html.spec.whatwg.org/multipage/server-sent-events.html#parsing-an-event-stream ↩

sonarqubecloud · 2025-02-04T17:05:31Z

Quality Gate passed

Issues
3 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

SplittyDev · 2025-02-04T17:08:08Z

@nezhyborets I've closed the other PR. I've rolled back my main to the commit you based your PR on and cherry-picked your commit. I'll have another quick look to see whether the tests have to be adjusted

SplittyDev · 2025-02-04T17:11:41Z

@nezhyborets Alright, I don't think I need to bring over any of the changes I made in my second commit. They only concern tests, and it seems you're already testing chunks and comments in your StreamInterpreter tests.

I'll bring this over to my app and make sure it properly handles some edge cases, and I'll be right back in a minute

SplittyDev · 2025-02-04T17:16:52Z

@nezhyborets Ready for review 👍 Works well in my testing with OpenRouter and OpenAI, and handles comments well in a real-world scenario.

nezhyborets

Great job, thanks!

SplittyDev · 2025-02-04T20:31:56Z

@nezhyborets Are you interested in adding further compatibility with other providers, or should this be a strict OpenAI-specific implementation?

This PR was totally in scope because it improves the implementation of the event standard used by OpenAI, but some other things I'm planning to add to my fork are not quite as clearly in scope for upstream contributions.

For example, in my fork I'm adding support for some more exotic features, such as Perplexity's citations. I can contribute that stuff upstream, but I don't know whether you want to stray that far from the OpenAI format.

For example, Perplexity adds a citations key as part of the response:

{ // example response "id": "410e6fb9-8325-1f25-adf8-73718ba6074d", "model": "sonar", "created": 1738684359, "citations": [ "https://example.org/citation1", "https://example.org/citation2" ], "choices": [ /* ... */ ] }

This is generally quite compatible with OpenAI, because the citations field can simply be optional:

/// The model used for the chat completion. public let model: String + /// A list of citations for the completion. + public let citations: [String]?

But it might be a bit confusing for people who plan to use it strictly with OpenAI and expect the field to actually be used in some cases, which in the case of OpenAI will never be true, unless they decide to add citations in the future.

Please let me know whether you're interested in adding support for stuff like that.

nezhyborets · 2025-02-05T09:26:56Z

@SplittyDev thanks for the suggestion and willing to contribute! The "strategy" we've discussed with @Krivoblotsky is to support different providers.

The solution to just add the field (and synthesized CodingKey) might not be the best one as other providers may have different key name for the same data. But I also don't see a better, more universal solution. Let's just add the field and see how it goes. You're right that adding an optional field doesn't seem to add a potential harm.

Would you do the PR?

nezhyborets · 2025-02-05T09:38:44Z

@SplittyDev I'm also thinking if we should mention our "support different providers" efforts in Readme. Maybe something like

Supporting different providers
This lib supports different providers that have OpenAI compatible API, like Openrouter, Gemini, Perplexity.

We've added some provider-specific changes to provide a better support:
Perplexity: citations field added to ChatResult.

It might be helpful not just for people, but also for us to track such little provider-specific injections, so that later it's easier to review them and maybe come up with something more universal

SplittyDev · 2025-02-05T14:46:09Z

@nezhyborets Thanks for letting me know! I'll prepare some PRs as I add more features.

One thing to note is that compatibility with other providers generally isn't great. OpenRouter is well-supported because it very closely mimics the OpenAI API (other than the comments in the event stream), but some other providers have several challenges that have to be addressed in different ways.

Here's a few off the top of my head:

Perplexity includes a citations field in responses
Google AI Studio doesn't use /v1, but /v1beta/openai
DeepSeek officially doesn't use /v1, but supports it for compatibility reasons
Other providers might require additional headers too

Anthropic is a whole different can of worms:

Uses a slightly different request and response format
Has the system prompt as a top-level string instead of a role
Does not allow multiple assistant messages without a user message in-between
Requires special headers, without which it refuses to run
...and probably some more stuff I haven't even noticed yet

So far, in my fork I've added proper support for Perplexity citations, and for arbitrary headers.

Google AI Studio compatibility is a bit more complicated because the basePath does not include the /v1, which is currently hardcoded for all endpoints. Supporting this requires an extensive refactoring, and is possibly a breaking change because people might use the basePath already without including /v1.

Anthropic is even more complicated, because of their different format. For Anthropic, the request and response types cannot be reused at all, and I'm not sure how this should even be addressed. I would personally just opt for not supporting Anthropic, but I guess the other option is to have an Anthropic client that works similar to the OpenAI one, but uses other types. That would almost justify its own separate library though, so I don't think that makes sense.

nezhyborets · 2025-02-07T20:36:57Z

@SplittyDev I think we can go with the easiest one first, i.e. add citations for Perplexity.

As for v1, you're right that it might be a breaking change. But we've actually just recently introduced basePath, so maybe it won't be that much of a problem if we move v1 from paths to basePath now... before more people adopt it

…-comments-merge Support comments in streaming response

SplittyDev and others added 2 commits February 4, 2025 22:43

Ignore comments in streaming response

37820bd

Add stream interpreter and write tests

2bbae9b

SplittyDev mentioned this pull request Feb 4, 2025

[deprecated] Support comments in streaming response #252

Closed

nezhyborets approved these changes Feb 4, 2025

View reviewed changes

nezhyborets merged commit c1a8a0b into MacPaw:main Feb 4, 2025
5 checks passed

nezhyborets mentioned this pull request Feb 4, 2025

fix: filter out sse comments #243

Closed

SplittyDev mentioned this pull request Feb 7, 2025

Add support for Perplexity-style citations #262

Merged

wangqi pushed a commit to wangqi/OpenAI that referenced this pull request Feb 18, 2025

Merge pull request MacPaw#259 from FiveSheepCo/feat/support-streaming…

36a0a8e

…-comments-merge Support comments in streaming response

SplittyDev deleted the feat/support-streaming-comments-merge branch February 23, 2025 14:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support comments in streaming response #259

Support comments in streaming response #259

Uh oh!

SplittyDev commented Feb 4, 2025 •

edited

Loading

sonarqubecloud bot commented Feb 4, 2025

SplittyDev commented Feb 4, 2025

SplittyDev commented Feb 4, 2025

SplittyDev commented Feb 4, 2025

nezhyborets left a comment

Uh oh!

SplittyDev commented Feb 4, 2025 •

edited

Loading

nezhyborets commented Feb 5, 2025 •

edited

Loading

nezhyborets commented Feb 5, 2025 •

edited

Loading

SplittyDev commented Feb 5, 2025

nezhyborets commented Feb 7, 2025 •

edited

Loading

Labels

2 participants

Support comments in streaming response #259

Support comments in streaming response #259

Uh oh!

Conversation

SplittyDev commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Affected Areas

Mentions

Footnotes

sonarqubecloud bot commented Feb 4, 2025

Quality Gate passed

SplittyDev commented Feb 4, 2025

SplittyDev commented Feb 4, 2025

SplittyDev commented Feb 4, 2025

nezhyborets left a comment

Choose a reason for hiding this comment

Uh oh!

SplittyDev commented Feb 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

nezhyborets commented Feb 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

nezhyborets commented Feb 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

SplittyDev commented Feb 5, 2025

nezhyborets commented Feb 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Labels

2 participants

SplittyDev commented Feb 4, 2025 •

edited

Loading

SplittyDev commented Feb 4, 2025 •

edited

Loading

nezhyborets commented Feb 5, 2025 •

edited

Loading

nezhyborets commented Feb 5, 2025 •

edited

Loading

nezhyborets commented Feb 7, 2025 •

edited

Loading