Avoid NPE in DiskThresholdMonitor #93699

pxsalehi · 2023-02-10T12:38:41Z

Process new ClusterInfo only when the cluster state is recovered, since we need to know the indices/nodes in the cluster for this.
Avoid missing node IDs that could result from a REPLACE shutdown metadata.

Both of these could lead to NPEs which is not fatal in 8.7/8.8, but could lead to unresolved listeners on 7.17.

elasticsearchmachine · 2023-02-10T12:39:54Z

Hi @pxsalehi, I've created a changelog YAML for you.

elasticsearchmachine · 2023-02-10T13:05:48Z

Pinging @elastic/es-distributed (Team:Distributed)

DaveCTurner

Fixes look good but I think we should have some tests too - see DiskThresholdMonitorTests

…pxsalehi/elasticsearch into ps230210-avoidNPEInDiskThresholdMonitor

…DiskThresholdMonitor

DaveCTurner

LGTM, just a couple of very small nits

DaveCTurner · 2023-02-13T10:38:28Z

...er/src/test/java/org/elasticsearch/cluster/routing/allocation/DiskThresholdMonitorTests.java

+ assertEquals(Set.of("test"), result.v2());
+
+ final ClusterState blockedClusterState = ClusterState.builder(clusterState)
+ .blocks(ClusterBlocks.builder().addGlobalBlock(GatewayService.STATE_NOT_RECOVERED_BLOCK).build())


nit: we should also have an empty routing table in this state (the routing table is initialised when removing GatewayService.STATE_NOT_RECOVERED_BLOCK)

DaveCTurner · 2023-02-13T10:38:39Z

...er/src/test/java/org/elasticsearch/cluster/routing/allocation/DiskThresholdMonitorTests.java

+ DiscoveryNodes.Builder discoveryNodes = DiscoveryNodes.builder()
+ .add(newNormalNode("node1", "node1"))
+ .add(newNormalNode("node2", "node2"));
+ // node3 which is to replace node1 may or may not bee in the cluster


typo :)

Suggested change

// node3 which is to replace node1 may or may not bee in the cluster

// node3 which is to replace node1 may or may not be in the cluster

- Process new ClusterInfo only when the cluster state is recovered, since we need to know the indices/nodes in the cluster for this. - Avoid missing node IDs that could result from a REPLACE shutdown metadata. Both of these could lead to NPEs which is not fatal in 8.7/8.8, but could lead to unresolved listeners on 7.17.

elasticsearchmachine · 2023-02-13T11:47:16Z

💔 Backport failed

Status	Branch	Result
❌	7.17	Commit could not be cherrypicked due to conflicts
✅	8.7

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 93699

- Process new ClusterInfo only when the cluster state is recovered, since we need to know the indices/nodes in the cluster for this. - Avoid missing node IDs that could result from a REPLACE shutdown metadata. Both of these could lead to NPEs which is not fatal in 8.7/8.8, but could lead to unresolved listeners on 7.17.

* Fix Gradle project evaluation when runtime java home is unset * Avoid NPE in DiskThresholdMonitor (#93699) - Process new ClusterInfo only when the cluster state is recovered, since we need to know the indices/nodes in the cluster for this. - Avoid missing node IDs that could result from a REPLACE shutdown metadata. Both of these could lead to NPEs which is not fatal in 8.7/8.8, but could lead to unresolved listeners on 7.17. --------- Co-authored-by: Mark Vieira <portugee@gmail.com> Co-authored-by: Elastic Machine <elasticmachine@users.noreply.github.com>

- Process new ClusterInfo only when the cluster state is recovered, since we need to know the indices/nodes in the cluster for this. - Avoid missing node IDs that could result from a REPLACE shutdown metadata. Both of these could lead to NPEs which is not fatal in 8.7/8.8, but could lead to unresolved listeners on 7.17.

avoid NPE in DiskThresholdMonitor

0db2d3a

pxsalehi added >bug :Distributed Coordination/Allocation All issues relating to the decision making around placing a shard (both master logic & on the nodes) labels Feb 10, 2023

elasticsearchmachine added the v8.8.0 label Feb 10, 2023

pxsalehi added v7.17.10 v8.7.1 labels Feb 10, 2023

Update docs/changelog/93699.yaml

721744f

pxsalehi marked this pull request as ready for review February 10, 2023 13:05

pxsalehi requested a review from DaveCTurner February 10, 2023 13:05

elasticsearchmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Feb 10, 2023

pxsalehi added auto-backport-and-merge and removed auto-backport-and-merge labels Feb 10, 2023

DaveCTurner reviewed Feb 10, 2023

View reviewed changes

pxsalehi added 4 commits February 13, 2023 11:22

add test

4024cf0

Merge branch 'ps230210-avoidNPEInDiskThresholdMonitor' of github.com:…

e961a1f

…pxsalehi/elasticsearch into ps230210-avoidNPEInDiskThresholdMonitor

reword changelog

0727bb7

Merge remote-tracking branch 'upstream/main' into ps230210-avoidNPEIn…

bbe644b

…DiskThresholdMonitor

pxsalehi requested a review from DaveCTurner February 13, 2023 10:26

pxsalehi self-assigned this Feb 13, 2023

DaveCTurner approved these changes Feb 13, 2023

View reviewed changes

address nits

c8cad2f

pxsalehi added the auto-backport-and-merge label Feb 13, 2023

pxsalehi merged commit b4712a5 into elastic:main Feb 13, 2023

pxsalehi mentioned this pull request Feb 13, 2023

[8.7] Avoid NPE in DiskThresholdMonitor (#93699) #93737

Merged

elasticsearchmachine added the backport pending label Feb 13, 2023

pxsalehi mentioned this pull request Feb 13, 2023

[7.17] Avoid NPE in DiskThresholdMonitor (#93699) #93743

Merged

bpintea removed the backport pending label Apr 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Avoid NPE in DiskThresholdMonitor #93699

Avoid NPE in DiskThresholdMonitor #93699

Uh oh!

pxsalehi commented Feb 10, 2023 •

edited

Loading

elasticsearchmachine commented Feb 10, 2023

elasticsearchmachine commented Feb 10, 2023

DaveCTurner left a comment

DaveCTurner left a comment

DaveCTurner Feb 13, 2023

DaveCTurner Feb 13, 2023

pxsalehi Feb 13, 2023

elasticsearchmachine commented Feb 13, 2023

Labels

4 participants

	// node3 which is to replace node1 may or may not bee in the cluster
	// node3 which is to replace node1 may or may not be in the cluster

Uh oh!

Avoid NPE in DiskThresholdMonitor #93699

Avoid NPE in DiskThresholdMonitor #93699

Uh oh!

Conversation

pxsalehi commented Feb 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

elasticsearchmachine commented Feb 10, 2023

elasticsearchmachine commented Feb 10, 2023

DaveCTurner left a comment

Choose a reason for hiding this comment

DaveCTurner left a comment

Choose a reason for hiding this comment

DaveCTurner Feb 13, 2023

Choose a reason for hiding this comment

DaveCTurner Feb 13, 2023

Choose a reason for hiding this comment

pxsalehi Feb 13, 2023

Choose a reason for hiding this comment

elasticsearchmachine commented Feb 13, 2023

💔 Backport failed

Labels

4 participants

pxsalehi commented Feb 10, 2023 •

edited

Loading