Adds new parameters to the elasticsearch inference API for the rerank task type #5476

kosabogi · 2025-10-13T10:18:32Z

This PR adds the new long_document_strategy and max_chunks_per_doc parameters to the service_settings object of the Create an Elasticsearch inference endpoint documentation.

It also updates the description of the chunking_settings object to clarify that this setting is only applicable for the sparse_embeddings and text_embeddings task types.

Related issue: #5451

github-actions · 2025-10-13T10:31:48Z

Following you can find the validation changes against the target branch for the APIs.

API	Status	Request	Response
`index`	🟢	1445/1445 → 1443/1443	1447/1447 → 1445/1445
`indices.create`	🔴	1378/1402 → 1385/1409	1402/1402 → 1409/1409
`indices.refresh`	🟢	329/329 → 327/327	329/329 → 327/327
`ml.get_job_stats`	🟢	30/30 → 29/29	30/30 → 29/29
`ml.put_job`	🟢	65/65 → 64/64	65/65 → 64/64

You can validate these APIs yourself by using the make validate target.

dan-rubinstein · 2025-10-15T14:37:41Z

specification/inference/_types/CommonTypes.ts

 */
 num_threads: integer
+ /**
+ * Only for the `rerank` task type.


A quick clarification. For 9.2, these two values are only configurable for rerank endpoints using the elastic reranker model.

dan-rubinstein · 2025-10-15T14:39:20Z

specification/inference/put_elasticsearch/PutElasticsearchRequest.ts

 body: {
 /**
- * The chunking configuration object.
+ * The chunking configuration object. For the `rerank` task type, you can enable chunking by setting the `long_document_strategy` parameter to `chunk` in the `service_settings` object.


I'm not sure if we need to be more specific about this anywhere but for this new method of chunking the user can not set chunking_settings the way that they would for embeddings. We handle building the chunking settings for them. If we want to clarify how we build the chunking settings somewhere we can.

dan-rubinstein · 2025-10-15T14:40:30Z

specification/inference/_types/CommonTypes.ts

+ *
+ * Possible values:
+ * - `truncate` (default): Processes only the beginning of each document.
+ * - `chunk`: Splits long documents into smaller parts (chunks) before inference.


I'm not sure where it's best to clarify this but with chunking enabled we will return to the user a single score per document (same as we do for truncating) with the score correlating to the highest score of any chunk. I just want to make it clear that the structure of the response to the user will not change, only the rerank relevance scores.

davidkyle

LGTM

Adds new parameters to the elasticsearch inference rerank API

2b23030

kosabogi requested a review from dan-rubinstein October 13, 2025 10:18

kosabogi added specification backport 9.2 labels Oct 13, 2025

kosabogi and others added 2 commits October 13, 2025 12:18

Merge branch 'main' into new-params-inference

e36beb6

Adds unique inference chunking settings for elasticsearch

911b868

Merge branch 'main' into new-params-inference

4357270

pquentin changed the title ~~[9.2] Adds new parameters to the elasticsearch inference API for the rerank task type~~ Adds new parameters to the elasticsearch inference API for the rerank task type Oct 14, 2025

dan-rubinstein reviewed Oct 15, 2025

View reviewed changes

kosabogi and others added 2 commits October 16, 2025 15:27

Addresses suggestions

e8b530c

Merge branch 'main' into new-params-inference

05f6ec6

kosabogi requested a review from davidkyle October 16, 2025 13:28

davidkyle approved these changes Oct 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds new parameters to the elasticsearch inference API for the rerank task type #5476

Adds new parameters to the elasticsearch inference API for the rerank task type #5476

kosabogi commented Oct 13, 2025 •

edited

Loading

github-actions bot commented Oct 13, 2025 •

edited

Loading

dan-rubinstein Oct 15, 2025

dan-rubinstein Oct 15, 2025

dan-rubinstein Oct 15, 2025

davidkyle left a comment

Labels

3 participants

Adds new parameters to the elasticsearch inference API for the rerank task type #5476

Are you sure you want to change the base?

Adds new parameters to the elasticsearch inference API for the rerank task type #5476

Conversation

kosabogi commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

github-actions bot commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

dan-rubinstein Oct 15, 2025

Choose a reason for hiding this comment

dan-rubinstein Oct 15, 2025

Choose a reason for hiding this comment

dan-rubinstein Oct 15, 2025

Choose a reason for hiding this comment

davidkyle left a comment

Choose a reason for hiding this comment

Labels

3 participants

kosabogi commented Oct 13, 2025 •

edited

Loading

github-actions bot commented Oct 13, 2025 •

edited

Loading