[ML] Partial fix for deployment disappearing #137216

jonathan-buttner · 2025-10-27T18:35:59Z

WIP

This PR is just to show the changes I made to be able to test the issue here: #137134

To make the reproduction faster I temporarily changed the code to allow the times to be shorter:

PUT /_cluster/settings { "persistent": { "xpack.ml.trained_models.adaptive_allocations.scale_to_zero_time": "10s", "xpack.ml.trained_models.adaptive_allocations.scale_up_cooldown_time": "10s", "logger.org.elasticsearch.xpack.ml.inference.assignment": "DEBUG" } }

Then we can follow the steps in the issue to reproduce, which are:

Create deployment via creating inference endpoint

PUT _inference/rerank/mytest-old { "service": "elasticsearch", "service_settings": { "num_threads": 1, "model_id": ".rerank-v1", "adaptive_allocations": { "enabled": true, "min_number_of_allocations": 0, "max_number_of_allocations": 2 } } }

Wait for mytest-old to scale to zero ~10 seconds

GET _ml/trained_models/_stats

Create a new deployment via inference endpoint, mytest-old should still exist, but it will have an allocation which is not intended.

PUT _inference/rerank/mytest-new3 { "service": "elasticsearch", "service_settings": { "num_threads": 1, "model_id": ".rerank-v1", "adaptive_allocations": { "enabled": true, "min_number_of_allocations": 0, "max_number_of_allocations": 2 } } }

GET _ml/trained_models/_stats

elasticsearchmachine · 2025-10-27T18:36:40Z

Hi @jonathan-buttner, I've created a changelog YAML for you.

Partial fix for deployment disappearing

0f14434

jonathan-buttner added >bug :ml Machine learning Team:ML Meta label for the ML team v9.3.0 labels Oct 27, 2025

jonathan-buttner mentioned this pull request Oct 27, 2025

[ML] Old trained model deployment got deleted unexpectedly after a new one is added through inference API #137134

Open

Update docs/changelog/137216.yaml

e1b519c

[CI] Auto commit changes from spotless

6f4c50b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[ML] Partial fix for deployment disappearing #137216

[ML] Partial fix for deployment disappearing #137216

jonathan-buttner commented Oct 27, 2025 •

edited

Loading

elasticsearchmachine commented Oct 27, 2025

Labels

2 participants

Uh oh!

[ML] Partial fix for deployment disappearing #137216

Are you sure you want to change the base?

[ML] Partial fix for deployment disappearing #137216

Conversation

jonathan-buttner commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

elasticsearchmachine commented Oct 27, 2025

Labels

2 participants

jonathan-buttner commented Oct 27, 2025 •

edited

Loading