Allows SparseFileTracker to progressively execute listeners during Gap processing #58477

tlrx · 2020-06-24T08:10:55Z

Today SparseFileTracker allows to wait for a range to become available before executing a given listener. In the case of searchable snapshot, we'd like to be able to wait for a large range to be filled (ie, downloaded and written to disk) while being able to execute the listener as soon as a smaller range is available.

This pull request is an extract from #58164 which introduces a ProgressListenableActionFuture that is used internally by SparseFileTracker. The progressive listenable future allows to register listeners attached to SparseFileTracker.Gap so that they are executed once the Gap is completed (with success or failure) or as soon as the Gap progress reaches a given progress value. This progress value is defined when the tracker.waitForRange() method is called; this method has been modified to accept a range and another listener's range to operate on.

This pull request does not modify how CacheFile requests ranges from the SparseFileTracker, this should be done in another pull request. Therefore CacheFile uses a listener's range that is equal to the range to be written and a //TODO has been added.

elasticmachine · 2020-06-24T08:10:58Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

DaveCTurner

Looks good, I suggested mainly a few simplifications to the API.

DaveCTurner · 2020-06-24T10:04:33Z

...pshots/src/main/java/org/elasticsearch/index/store/cache/ProgressListenableActionFuture.java

+ * potentially triggers the execution of one or more listeners that are waiting for the progress
+ * to reach a value lower than the one just updated.
+ *
+ * @param value the new progress value


I think there might be an off-by-one error lurking here. IMO we should report progress of value when the range ⟨start, value⟩ is available, noting that our ranges are inclusive at the start and exclusive at the end. This means I think we can start with progress == start (not null) indicating that the available range is empty, and require start < value below.

Probably a good idea to document this here too.

I agree; I've been a bit back and forth here.

Should we rename the parameter progress to align it with the {@code progress} in the paragraph above?

DaveCTurner · 2020-06-24T10:07:21Z

...earchable-snapshots/src/main/java/org/elasticsearch/index/store/cache/SparseFileTracker.java

- public List<Gap> waitForRange(final long start, final long end, final ActionListener<Void> listener) {
+ public List<Gap> waitForRange(
+ final Tuple<Long, Long> range,
+ @Nullable final Tuple<Long, Long> subRange,


Let's require the subrange to be non-null, it's only null in tests AFAICT.

DaveCTurner · 2020-06-24T10:09:51Z

...earchable-snapshots/src/main/java/org/elasticsearch/index/store/cache/SparseFileTracker.java

- final PlainListenableActionFuture<Void> completionListener;
+ final ProgressListenableActionFuture completionListener;
+
+ Range(long start, long end) {


I expected this constructor to make a completed range whereas in fact we use the other one and pass null as the listener. There's only a couple of call-sites, I'd prefer to inline this to avoid that confusion.

DaveCTurner · 2020-06-24T10:11:31Z

...earchable-snapshots/src/main/java/org/elasticsearch/index/store/cache/SparseFileTracker.java

- @Override
+ public void onProgress(long value) {
+ if (value < start || end < value) {
+ throw new IllegalArgumentException("Cannot update progress [" + value + "] for gap [" + start + '-' + end + ']');


I think this shouldn't happen, so should be an assertion. Maybe just remove it since we assert it in onGapProgress anyway.

DaveCTurner · 2020-06-24T10:14:52Z

...earchable-snapshots/src/main/java/org/elasticsearch/index/store/cache/SparseFileTracker.java

+ assert invariant();
+
+ final Range range = new Range(start, end, null);
+ final SortedSet<Range> existingRanges = ranges.tailSet(range);


Seems strange to have to look up the range corresponding with the gap here, maybe Gap should keep hold of the corresponding Range so it can call the listener directly.

The onSuccess and onFailure methods would also benefit from keeping a reference to the range; the look up should not be necessary there too as a pending Range should not be completed or failed outside of the corresponding Gap. I'm tempted to address this in a follow up PR.

Sure, a followup is fine. The difference with onSuccess and onFailure is that they also adjust ranges, but they do indeed start with the same kind of lookup as we do here.

I opened #58587 for this.

DaveCTurner · 2020-06-24T10:16:55Z

...pshots/src/main/java/org/elasticsearch/index/store/cache/ProgressListenableActionFuture.java

+ * @param listener the {@link ActionListener} to add
+ */
+ @Override
+ public void addListener(final ActionListener<Long> listener) {


Can we drop this (and the implements ListenableActionFuture<Long> that requires it)? I think callers should always specify their target endpoint.

DaveCTurner · 2020-06-24T10:22:08Z

...able-snapshots/src/test/java/org/elasticsearch/index/store/cache/SparseFileTrackerTests.java

+ assertThat(fileContents[Math.toIntExact(i)], equalTo(UNAVAILABLE));
+ fileContents[Math.toIntExact(i)] = AVAILABLE;
+ assertTrue(wasNotified.get());
+ gap.onProgress(i);


Here's the off-by-one error: when fileContents[i] is available we should do this:

Suggested change

gap.onProgress(i);

gap.onProgress(i + 1);

Thanks David! It becomes a tradition (sadly)

tlrx · 2020-06-25T11:31:33Z

@DaveCTurner Thanks for your review. I've updated the code, let me know what you think please.

DaveCTurner

Production code looks good; I suggested some extra tests.

DaveCTurner · 2020-06-25T11:37:52Z

...pshots/src/main/java/org/elasticsearch/index/store/cache/ProgressListenableActionFuture.java

+ * potentially triggers the execution of one or more listeners that are waiting for the progress
+ * to reach a value lower than the one just updated.
+ *
+ * @param value the new progress value


Should we rename the parameter progress to align it with the {@code progress} in the paragraph above?

DaveCTurner · 2020-06-25T11:38:58Z

...pshots/src/main/java/org/elasticsearch/index/store/cache/ProgressListenableActionFuture.java

+ assert completed == false || listeners == null;
+ assert start <= progress : start + " <= " + progress;
+ assert progress <= end : progress + " <= " + end;
+ assert listeners == null || listeners.stream().allMatch(listener -> progress < listener.v1());


Should we require completed == false || progress == end too?

I don't think so: completed indicates that the future is done, either with success or failure. In case of failure it could be completed before the progress reached end.

Ah yes so it does. Can we assert that successful completion only happens with progress == end?

Can we assert that successful completion only happens with progress == end?

It means to change the done() method to pass around the completion state (success or failure/cancel) of the future so that we can later compare the successful completion plus the progress/end values. It also means that the progress must be updated in a more strictly manner before completing the future. I gave it a try in a5b29d5, let me know what you think.

DaveCTurner · 2020-06-25T11:52:15Z