Add IT for num_reduced_phases with batched query execution #134312

benchaplin · 2025-09-08T17:26:35Z

The 120_batch_reduce_size.yml test was skipped with a TODO when batched query execution was introduced in #121885. A key piece of the test was an assertion on the num_reduce_phases in the search response - the assertion was removed as num_reduce_phases now depends on how shards are laid out in the cluster, which may change across test runs.

To know how many reductions occurred, we need to understand the shard layout:

how many shards are on the coordinating node (these won't be batched)?
how many shards are on each data node (these will be batched)?

We can do this in an IT that captures the batched transport requests then derives the layout from them. I've added this IT, and removed the skip on the YAML test. I'd hear an argument for removing the YAML test completely, but it still covers some validation on the batched_reduce_size query parameter so I'm leaning towards keeping it.

elasticsearchmachine · 2025-09-08T17:26:59Z

Pinging @elastic/es-search-foundations (Team:Search Foundations)

drempapis · 2025-09-12T06:42:49Z

server/src/main/java/org/elasticsearch/action/search/SearchQueryThenFetchAsyncAction.java

 return searchRequest.indicesOptions();
 }
+
+ public List<ShardToQuery> shards() {


Consider limiting this to a package-protected scope or exposing only shard.size(), as the test doesn’t need the complete list.

++ another way to do this would be to inspect the indices service in the test itself, and extract the info about the how many nodes and how many shards per node from there, as opposed to from the request.

Another another way to go about this test is to run it in a more controlled scenario, along the same lines as Dimi suggested above: decide upfront how many shards and nodes, and make the execution more predictable that way.

Good call, thanks @drempapis - reduced to package-private and just the size.

And thanks @javanna, I agree this is cleaner without the request intercepting stuff. I've reworked the test a bit to deduce how many shards aren't batched from cluster state.

server/src/internalClusterTest/java/org/elasticsearch/action/search/BatchedQueryPhaseIT.java

javanna · 2025-09-15T10:48:44Z

rest-api-spec/src/yamlRestTest/resources/rest-api-spec/test/search/120_batch_reduce_size.yml

@@ -1,7 +1,4 @@
 setup:
- - skip:
- awaits_fix: "TODO fix this test, the response with batched execution is not deterministic enough for the available matchers"


can we just get away with unskipping this ? I was under the impression that it's going to fail. Or does it run in a controlled scenario where the result is predictable?

If you take a look at #121885 the line that would make this test fail was removed. Now it essentially just tests batched_reduce_size validation. I'm not sure if it's worth keeping, what do you think?

I see, thanks. I'd keep it. Can we have some simpler check on num_reduce_phases, like greather than some threshold that's easier to predict?

Don't think we can assume anything about num_reduce_phases now - if it's 1, it's left out of the response entirely. And it can be 1 if all shards are batched.

javanna

nice work, thanks!

This reverts commit 6afe28a.

…34312)

* upstream/main: Add additional logging to make spotting stats issues easier (elastic#133972) [ESQL] Clean up ESQL enrich landing page (elastic#134820) ES|QL: Make kibana docs for Query settings more consistent (elastic#134881) Add file extension metadata to cache miss counter from SharedBlobCacheService (elastic#134374) Add IT for num_reduced_phases with batched query execution (elastic#134312) Remove `SizeValue` (elastic#134871)

…34312)

benchaplin added 2 commits September 8, 2025 13:16

Add IT for num_reduce_phases coverage in batched

ddbe410

Merge branch 'main' into batched_add_it_num_reduce_phases

dfe508e

benchaplin added >test Issues or PRs that are addressing/adding tests Team:Search Foundations Meta label for the Search Foundations team in Elasticsearch :Search Foundations/Search Catch all for Search Foundations labels Sep 8, 2025

benchaplin mentioned this pull request Sep 5, 2025

[Meta] Batched Query Phase Follow-up Tasks #125788

Open

6 tasks

elasticsearchmachine added the v9.2.0 label Sep 8, 2025

drempapis reviewed Sep 12, 2025

View reviewed changes

server/src/internalClusterTest/java/org/elasticsearch/action/search/BatchedQueryPhaseIT.java Show resolved Hide resolved

javanna reviewed Sep 15, 2025

View reviewed changes

benchaplin and others added 7 commits September 16, 2025 08:19

Correct assertion, reduce scope of field exposed in NodeQueryRequest

c7a564a

Merge branch 'main' into batched_add_it_num_reduce_phases

c95621c

Deduce batched shards from cluster state

15f2c13

Remove new getter - no longer needed

722f620

[CI] Auto commit changes from spotless

452884e

Typo

eb5028a

Typo

4f2abf8

javanna approved these changes Sep 17, 2025

View reviewed changes

benchaplin added 3 commits September 17, 2025 12:29

Add assertion that num_reduce_phases >= 1

6afe28a

Revert "Add assertion that num_reduce_phases >= 1"

eccb29d

This reverts commit 6afe28a.

Merge branch 'main' into batched_add_it_num_reduce_phases

74a4f50

benchaplin merged commit 81d63bc into elastic:main Sep 17, 2025
34 checks passed

mridula-s109 pushed a commit to mridula-s109/elasticsearch that referenced this pull request Sep 17, 2025

Add IT for num_reduced_phases with batched query execution (elastic#1…

a92841f

…34312)

gmjehovich pushed a commit to gmjehovich/elasticsearch that referenced this pull request Sep 18, 2025

Add IT for num_reduced_phases with batched query execution (elastic#1…

9b9d5c5

…34312)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add IT for num_reduced_phases with batched query execution #134312

Add IT for num_reduced_phases with batched query execution #134312

Uh oh!

benchaplin commented Sep 8, 2025 •

edited by javanna

Loading

elasticsearchmachine commented Sep 8, 2025

drempapis Sep 12, 2025

javanna Sep 15, 2025

javanna Sep 15, 2025

benchaplin Sep 16, 2025

benchaplin Sep 16, 2025

Uh oh!

javanna Sep 15, 2025

benchaplin Sep 15, 2025 •

edited

Loading

javanna Sep 17, 2025

benchaplin Sep 17, 2025

javanna left a comment

Uh oh!

Labels

4 participants

Add IT for num_reduced_phases with batched query execution #134312

Add IT for num_reduced_phases with batched query execution #134312

Uh oh!

Conversation

benchaplin commented Sep 8, 2025 • edited by javanna Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

elasticsearchmachine commented Sep 8, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

benchaplin Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

javanna left a comment

Choose a reason for hiding this comment

Uh oh!

Labels

4 participants

benchaplin commented Sep 8, 2025 •

edited by javanna

Loading

benchaplin Sep 15, 2025 •

edited

Loading