Fix bug of scan aggregate index returning empty non-end continuation #3397

pengpeng-lu · 2025-06-13T03:22:22Z

Fixes issue: #3206

Instead of returning the innerContinuation byte array, serializes into a protobuf, with an isEnd flag.

There will be 2 steps:

still serializes to the old format, being able to deserialize from both old and new format;
serializes to the new format.

This fixes #3206

pengpeng-lu · 2025-06-17T03:49:52Z

yaml-tests/src/test/resources/aggregate-index-tests-count.yamsql

 - query: select count(col1) from t2
 - explain: "ISCAN(MV5 <,>) | MAP (_ AS _0) | AGG (count(_._0.COL1) AS _0) | ON EMPTY NULL | MAP (coalesce_long(_._0._0, promote(0l AS LONG)) AS _0)"
- # Cannot run with FORCE_CONTINUATIONS due to: https://github.com/FoundationDB/fdb-record-layer/issues/3206
- - maxRows: 0


These 2 were also mis-categorized to issue 3206.

fdb-record-layer-core/src/main/java/com/apple/foundationdb/record/ExecuteProperties.java

alecgrieser

Sorry for the delay in getting to reviewing this!

fdb-record-layer-core/src/main/java/com/apple/foundationdb/record/ExecuteProperties.java

...re/src/main/java/com/apple/foundationdb/record/provider/foundationdb/KeyValueCursorBase.java

fdb-record-layer-core/src/main/proto/record_cursor.proto

...-core/src/main/java/com/apple/foundationdb/record/query/plan/plans/RecordQueryIndexPlan.java

...re/src/test/java/com/apple/foundationdb/record/provider/foundationdb/KeyValueCursorTest.java

...m/apple/foundationdb/record/provider/foundationdb/indexes/MultidimensionalIndexTestBase.java

...ava/com/apple/foundationdb/record/query/plan/plans/RecordQueryIndexPlanWithOverScanTest.java

...al-core/src/main/java/com/apple/foundationdb/relational/recordlayer/RecordLayerIterator.java

...-core/src/main/java/com/apple/foundationdb/record/query/plan/plans/RecordQueryIndexPlan.java

alecgrieser

We had a discussion offline, and I think we've converged on a strategy here.

As @pengpeng-lu points out, there are a few too many places where the key value cursor is created directly. We could theoretically limit the fix to just the two plan types, and have them manage wrapping and unwrapping the cursor, but then all users of the KeyValueCursor would have to worry about scans over single key ranges leading to an empty continuation on the first element.

For that reason, we're thinking that we have to go with an approach where the KeyValueCursor is the one that can accept either an old or new style continuation, and it just has to parse it to see if it fits. However, this is dangerous as some old continuations could be parsed as a new continuation. To avoid that, we want to insert a magic number into the new protobuf continuation, and then we will validate that the number exactly matches. Any continuation without the magic number must then be from an older instance even if it parses correctly.

If we take this approach, we can "just" put this into the KeyValueCursor, and then remove the changes to the plan serialization. Step one will put out a version that can read the new continuations. Then the next version, we make the new continuations by default. A future version can then remove the compatibility mode.

This is actually pretty close to the original role out plan, but we think that having the extra magic number check makes us less likely to misinterpret a continuation, which can lead to reading incorrect data.

This adds test cases to cover situations where we might have a byte array that could plausibly be either a `Tuple` or a `Protobuf` message. It enumerates through the `Tuple` types, and in each case, it either identifies why the `Tuple` code could not be a valid message, or it constructs a case that could. The upshot is that it's clear that parsing as Protobuf without an error is not enough to say that something was definitely a Protobuf, though the stars do kind of need to align for this to happen. Part of my hope here is that we can use this reference whenever this kind of situation comes up. For example, in our current thinking for FoundationDB#3397, we'll want to employ a strategy of guessing the origin of a byte string. If we want to test what happens if we start with a `Tuple` and then try to parse it as a Protobuf, this could be used to inform such a test case.

...re/src/main/java/com/apple/foundationdb/record/provider/foundationdb/KeyValueCursorBase.java

...-core/src/main/java/com/apple/foundationdb/record/query/plan/plans/RecordQueryIndexPlan.java

...r-core/src/main/java/com/apple/foundationdb/record/query/plan/plans/RecordQueryScanPlan.java

...re/src/main/java/com/apple/foundationdb/record/provider/foundationdb/KeyValueCursorBase.java

yaml-tests/src/test/resources/disabled-planner-rewrites/aggregate-index-tests.yamsql

...re/src/test/java/com/apple/foundationdb/record/provider/foundationdb/KeyValueCursorTest.java

...re/src/main/java/com/apple/foundationdb/record/provider/foundationdb/KeyValueCursorBase.java

...-core/src/main/java/com/apple/foundationdb/record/query/plan/plans/RecordQueryIndexPlan.java

alecgrieser

Okay, LGTM. I left one comment about how I still think it would be good to remove PrefixRemovingContinuation and just use KeyValueCursorBase.Continuation, but I'm also fine with how it is in the PR now if we want to hold off on that. The only remaining thing is that as far as I can tell, there's still one test case that was removed from disabled-planner-rewrites/aggregate-index-tests.yamsql, and it seems like we should add it back

yaml-tests/src/test/resources/disabled-planner-rewrites/aggregate-index-tests.yamsql

pengpeng-lu added 15 commits May 21, 2025 10:57

save

387a648

save

0fd847e

save

3ce0259

save

78bea95

save

aefa82c

save

3fe7ba0

save

1336975

clean

44a5cb9

save

cb40b88

Merge branch 'main' into keyvalue_cursor

d82d31c

add planner configuration

8d523c1

save

d015132

save

22a6e85

revert PlannerConfiguration change:

f30c3e3

add test

cc9567b

pengpeng-lu added the bug fix Change that fixes a bug label Jun 17, 2025

pengpeng-lu changed the title ~~Keyvalue cursor~~ Fix bug of scan aggregate index returning empty non-end continuation Jun 17, 2025

pengpeng-lu commented Jun 17, 2025

View reviewed changes

pengpeng-lu added 2 commits June 16, 2025 20:53

style

09907fe

checkstyle

8343661

pengpeng-lu marked this pull request as ready for review June 17, 2025 05:29

pengpeng-lu requested a review from alecgrieser June 17, 2025 05:29

pengpeng-lu commented Jun 17, 2025

View reviewed changes

fdb-record-layer-core/src/main/java/com/apple/foundationdb/record/ExecuteProperties.java Outdated Show resolved Hide resolved

alecgrieser requested changes Jul 8, 2025

View reviewed changes

pengpeng-lu added 6 commits July 10, 2025 15:24

small things

30cf435

implementation comments

1f27b7e

save

61ef12b

save

51e3746

save

c0c8e72

style

b151ab4

pengpeng-lu added 2 commits August 25, 2025 20:26

throw ex when error parsing

5866b52

serialize mode in plans

bc2f064

pengpeng-lu requested a review from alecgrieser August 28, 2025 07:48

alecgrieser requested changes Sep 9, 2025

View reviewed changes

...-core/src/main/java/com/apple/foundationdb/record/query/plan/plans/RecordQueryIndexPlan.java Show resolved Hide resolved

alecgrieser mentioned this pull request Sep 9, 2025

Add new KeySpacePath.exportAllData #3566

Merged

alecgrieser requested changes Sep 12, 2025

View reviewed changes

pengpeng-lu added 3 commits September 12, 2025 14:01

remove serialization in plans

68fe061

merge main

0f91a76

style

4c19031

pengpeng-lu requested a review from alecgrieser September 12, 2025 23:40

alecgrieser mentioned this pull request Sep 15, 2025

Add serialization tests for ambiguous Tuple/Protobuf cases #3597

Open

alecgrieser requested changes Sep 16, 2025

View reviewed changes

...re/src/main/java/com/apple/foundationdb/record/provider/foundationdb/KeyValueCursorBase.java Outdated Show resolved Hide resolved

...re/src/main/java/com/apple/foundationdb/record/provider/foundationdb/KeyValueCursorBase.java Outdated Show resolved Hide resolved

magic number

05890a0

alecgrieser requested changes Sep 17, 2025

View reviewed changes

pengpeng-lu added 3 commits September 17, 2025 17:04

comments

2e7c809

fix test and style

4280bdd

more tests

04b34a9

pengpeng-lu requested a review from alecgrieser September 18, 2025 06:29

alecgrieser requested changes Sep 18, 2025

View reviewed changes

comments

3f89e3d

pengpeng-lu requested a review from alecgrieser September 18, 2025 16:45

pengpeng-lu added 2 commits September 18, 2025 23:07

nit

2909eb9

Merge branch 'main' into keyvalue_cursor

425073d

alecgrieser requested changes Sep 19, 2025

View reviewed changes

yaml-tests/src/test/resources/disabled-planner-rewrites/aggregate-index-tests.yamsql Show resolved Hide resolved

add test back

db08407

alecgrieser approved these changes Sep 23, 2025

View reviewed changes

alecgrieser merged commit 53409f4 into FoundationDB:main Sep 23, 2025
8 checks passed

Fix bug of scan aggregate index returning empty non-end continuation #3397

Fix bug of scan aggregate index returning empty non-end continuation #3397

Uh oh!

Conversation

pengpeng-lu commented Jun 13, 2025 • edited by alecgrieser Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

pengpeng-lu Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

alecgrieser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alecgrieser left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alecgrieser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Labels

2 participants

pengpeng-lu commented Jun 13, 2025 •

edited by alecgrieser

Loading

alecgrieser left a comment •

edited

Loading