[ML] Change format for Unified Chat error responses #121396

prwhelan · 2025-01-31T13:52:07Z

Unified Chat Completion error responses now forward code, type, and param to in the response payload. reason has been renamed to message.

Notes:

XContentFormattedException is a ChunkedToXContent so that the REST listener can call toXContentChunked to format the output structure. By default, the structure forwards to our existing ES exception structure.
UnifiedChatCompletionException will override the structure to match the new unified format.
The Rest, Transport, and Stream handlers all check the exception to verify it is a UnifiedChatCompletionException.
OpenAI response handler now reads all the fields in the error message and forwards them to the user.
In the event that a Throwable is a Error, we rethrow it on another thread so the JVM can catch and handle it. We also stop surfacing the JVM details to the user in the error message (but it's still logged for debugging purposes).

Unified Chat Completion error responses now forward code, type, and param to in the response payload. `reason` has been renamed to `message`.

elasticsearchmachine · 2025-01-31T13:52:55Z

Hi @prwhelan, I've created a changelog YAML for you.

elasticsearchmachine · 2025-01-31T16:50:14Z

Pinging @elastic/ml-core (Team:ML)

jonathan-buttner

Great changes Pat!

I think we'll also want to change this for EIS since it leverages the unified format as well:

https://github.com/elastic/elasticsearch/blob/18345c41ab707f2cdfcfe2fd3d942ae811f14803/x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/external/http/sender/ElasticInferenceServiceUnifiedCompletionRequestManager.java

Let me know if we already have coverage for this but could we add a new integration test that spins up a mock web server to mock an openai error response? That way we can make a request to a live ES node and ensure that the response all the way back to the rest client is what we're expecting.

I'm thinking something like this: https://github.com/elastic/elasticsearch/blob/main/x-pack/plugin/inference/qa/inference-service-tests/src/javaRestTest/java/org/elasticsearch/xpack/inference/MockElasticInferenceServiceAuthorizationServer.java#L21

Which is used here: https://github.com/elastic/elasticsearch/blob/main/x-pack/plugin/inference/qa/inference-service-tests/src/javaRestTest/java/org/elasticsearch/xpack/inference/BaseMockEISAuthServerTest.java#L33

jonathan-buttner · 2025-01-31T17:08:50Z

...main/java/org/elasticsearch/xpack/core/inference/results/UnifiedChatCompletionException.java

+
+ public UnifiedChatCompletionException(RestStatus status, String message, String type, @Nullable String code, @Nullable String param) {
+ super(message, status);
+ this.message = message;


Should we add some Objects.requireNonNull for the values that are required?

jonathan-buttner · 2025-01-31T17:14:00Z

...main/java/org/elasticsearch/xpack/core/inference/results/UnifiedChatCompletionException.java

+ public static UnifiedChatCompletionException fromThrowable(Throwable t) {
+ if (t instanceof UnifiedChatCompletionException e) {
+ return e;
+ } else if (unwrapCause(t) instanceof UnifiedChatCompletionException e) {


Below we implement the unwrapCause(). It doesn't look like UnifiedChatCompletionException implements ElasticsearchWrapperException in the inheritance chain. Could we use ExceptionHelper.unwrapCause() and kind of like this:

public static UnifiedChatCompletionException fromThrowable2(Throwable t) { var unwrappedCause = ExceptionsHelper.unwrapCause(t); if (unwrappedCause instanceof UnifiedChatCompletionException e) { return e; } else { return maybeError(t).map(error -> { ... }); } }

Ah, I think this method was a holdover from when UnifiedChatCompletionException was a ElasticsearchWrapperException. We can remove it

jonathan-buttner

Thanks for the changes!

elasticsearchmachine · 2025-02-05T14:43:26Z

💔 Backport failed

Status	Branch	Result
❌	9.0	Commit could not be cherrypicked due to conflicts
❌	8.18	Commit could not be cherrypicked due to conflicts
❌	8.x	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 121396

Unified Chat Completion error responses now forward code, type, and param to in the response payload. `reason` has been renamed to `message`. Notes: - `XContentFormattedException` is a `ChunkedToXContent` so that the REST listener can call `toXContentChunked` to format the output structure. By default, the structure forwards to our existing ES exception structure. - `UnifiedChatCompletionException` will override the structure to match the new unified format. - The Rest, Transport, and Stream handlers all check the exception to verify it is a UnifiedChatCompletionException. - OpenAI response handler now reads all the fields in the error message and forwards them to the user. - In the event that a `Throwable` is a `Error`, we rethrow it on another thread so the JVM can catch and handle it. We also stop surfacing the JVM details to the user in the error message (but it's still logged for debugging purposes).

[ML] Change format for Unified Chat

95099a7

Unified Chat Completion error responses now forward code, type, and param to in the response payload. `reason` has been renamed to `message`.

prwhelan added >bug :ml Machine learning Team:ML Meta label for the ML team auto-backport Automatically create backport pull requests when merged v9.0.0 v8.18.0 labels Jan 31, 2025

elasticsearchmachine added the v9.1.0 label Jan 31, 2025

Update docs/changelog/121396.yaml

477afbf

elasticsearchmachine and others added 2 commits January 31, 2025 13:58

[CI] Auto commit changes from spotless

564fd3c

Merge branch 'main' into inference/openai-error-2

924253a

prwhelan marked this pull request as ready for review January 31, 2025 16:49

jonathan-buttner added the v8.19.0 label Jan 31, 2025

jonathan-buttner reviewed Jan 31, 2025

View reviewed changes

jonathan-buttner changed the title ~~[ML] Change format for Unified Chat~~ [ML] Change format for Unified Chat error responses Jan 31, 2025

prwhelan added 2 commits February 3, 2025 17:37

address comments

8bdd23e

Merge branch 'main' into inference/openai-error-2

6afef65

jonathan-buttner approved these changes Feb 4, 2025

View reviewed changes

Merge branch 'main' into inference/openai-error-2

8a9ce38

prwhelan enabled auto-merge (squash) February 5, 2025 13:53

prwhelan merged commit ad00113 into elastic:main Feb 5, 2025
17 checks passed

elasticsearchmachine added the backport pending label Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Change format for Unified Chat error responses #121396

[ML] Change format for Unified Chat error responses #121396

Uh oh!

prwhelan commented Jan 31, 2025

elasticsearchmachine commented Jan 31, 2025

elasticsearchmachine commented Jan 31, 2025

jonathan-buttner left a comment

jonathan-buttner Jan 31, 2025

jonathan-buttner Jan 31, 2025

prwhelan Feb 3, 2025

jonathan-buttner left a comment

Uh oh!

elasticsearchmachine commented Feb 5, 2025

Labels

3 participants

[ML] Change format for Unified Chat error responses #121396

[ML] Change format for Unified Chat error responses #121396

Uh oh!

Conversation

prwhelan commented Jan 31, 2025

elasticsearchmachine commented Jan 31, 2025

elasticsearchmachine commented Jan 31, 2025

jonathan-buttner left a comment

Choose a reason for hiding this comment

jonathan-buttner Jan 31, 2025

Choose a reason for hiding this comment

jonathan-buttner Jan 31, 2025

Choose a reason for hiding this comment

prwhelan Feb 3, 2025

Choose a reason for hiding this comment

jonathan-buttner left a comment

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Feb 5, 2025

💔 Backport failed

Labels

3 participants