Allow adjustment of transport TLS handshake timeout #130909

DaveCTurner · 2025-07-09T12:02:12Z

The default 10s TLS handshake timeout may be too short if there is some
bug causing event-loop latency, and this has more serious consequences
than the underlying performance issue (e.g. it prevents the cluster from
scaling up to work around the problem). With this commit we expose a
setting that allows the timeout to be configured, providing a workaround
in such cases.

The default 10s TLS handshake timeout may be too short if there is some bug causing event-loop latency, and this has more serious consequences than the underlying performance issue (e.g. it prevents the cluster from scaling up to work around the problem). With this commit we expose a setting that allows the timeout to be configured, providing a workaround in such cases.

elasticsearchmachine · 2025-07-09T12:02:37Z

Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination)

elasticsearchmachine · 2025-07-09T12:02:37Z

Hi @DaveCTurner, I've created a changelog YAML for you.

github-actions · 2025-07-09T12:05:33Z

🔍 Preview links for changed docs

docs/reference/elasticsearch/configuration-reference/security-settings.md

mhl-b

lgtm

...ain/java/org/elasticsearch/xpack/core/security/transport/netty4/SecurityNetty4Transport.java

ywangd · 2025-07-10T05:28:19Z

x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/ssl/SSLService.java

+ private static final Setting<TimeValue> TRANSPORT_TLS_HANDSHAKE_TIMEOUT_SETTING = Setting.positiveTimeSetting(
+ "xpack.security.transport.ssl.handshake_timeout",
+ TimeValue.timeValueSeconds(10),
+ Setting.Property.NodeScope
+ );


IIUC, this will affect more than transport connections. It should at least also apply to RCS 2.0 remote cluster client and likely security realms that initiate outbound TLS connections, e.g. OIDC realm.

Most existing SSL settings are affix settings that apply to different contexts. The transport is one of the contexts. Defining these settings is a somewhat involved process via SSLConfigurationSettings to support contexts.

I think we should either:

Support this new setting for different contexts as well.

Dropping the transport part from the setting name, i.e. xpack.security.ssl.handshake_timeout, as well as updating the docs to indicate it applies more broadly.

What do you think?

I think it only affects transport connections, i.e. those which go via SecurityNetty4Transport. That does indeed include remote-cluster connections, but not other outbound TLS connections like the HTTPS ones involved in OIDC. I hadn't noticed that we count RCS2.0 transport connections as distinct from other transport connections in terms of this kind of configuration.

It's a bit tricky tho, I don't really want to have to add support for this setting to all the different contexts in which we do TLS handshakes. At least not today: progress over perfection and all that. If we called it xpack.security.ssl.handshake_timeout then that'd imply it worked everywhere. I'd rather keep it transport-specific, but I think I can see a way to add this to the RCS2.0 settings too.

Ok see dc3a9ac

Yeah you are right about this does not apply to realms.

ywangd

LGTM

The default 10s TLS handshake timeout may be too short if there is some bug causing event-loop latency, and this has more serious consequences than the underlying performance issue (e.g. it prevents the cluster from scaling up to work around the problem). With this commit we expose a setting that allows the timeout to be configured, providing a workaround in such cases.

DaveCTurner requested a review from mhl-b July 9, 2025 12:02

DaveCTurner added >enhancement :Distributed Coordination/Network Http and internode communication implementations v9.2.0 labels Jul 9, 2025

elasticsearchmachine added the Team:Distributed Coordination Meta label for Distributed Coordination team label Jul 9, 2025

DaveCTurner added 2 commits July 9, 2025 13:02

Update docs/changelog/130909.yaml

9b0aa6e

Reinstate more specific exception type

39f6560

DaveCTurner added 2 commits July 9, 2025 17:47

Merge branch 'main' into 2025/07/09/tls-handshake-timeout-setting

3a2df38

Merge branch 'main' into 2025/07/09/tls-handshake-timeout-setting

dd47923

mhl-b approved these changes Jul 9, 2025

View reviewed changes

Merge branch 'main' into 2025/07/09/tls-handshake-timeout-setting

578c19e

DaveCTurner added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jul 9, 2025

ywangd reviewed Jul 10, 2025

View reviewed changes

...ain/java/org/elasticsearch/xpack/core/security/transport/netty4/SecurityNetty4Transport.java Outdated Show resolved Hide resolved

ywangd reviewed Jul 10, 2025

View reviewed changes

DaveCTurner added 2 commits July 10, 2025 08:32

Merge branch 'main' into 2025/07/09/tls-handshake-timeout-setting

499d94d

Make settings context-specific (and distinct for RCS2.0)

dc3a9ac

ywangd approved these changes Jul 10, 2025

View reviewed changes

DaveCTurner merged commit e57a0d0 into elastic:main Jul 10, 2025
34 checks passed

DaveCTurner deleted the 2025/07/09/tls-handshake-timeout-setting branch July 10, 2025 15:50

shainaraskas mentioned this pull request Jul 23, 2025

add availability information for ssl handshake timeout settings #131786

Merged

shainaraskas added the docs-missing-applies-tags PRs that are missing docs applies_to tags for an upcoming release. label Jul 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow adjustment of transport TLS handshake timeout #130909

Allow adjustment of transport TLS handshake timeout #130909

Uh oh!

DaveCTurner commented Jul 9, 2025

elasticsearchmachine commented Jul 9, 2025

elasticsearchmachine commented Jul 9, 2025

github-actions bot commented Jul 9, 2025 •

edited

Loading

mhl-b left a comment

Uh oh!

ywangd Jul 10, 2025

DaveCTurner Jul 10, 2025

DaveCTurner Jul 10, 2025

ywangd Jul 10, 2025

ywangd left a comment

Uh oh!

Labels

5 participants

Allow adjustment of transport TLS handshake timeout #130909

Allow adjustment of transport TLS handshake timeout #130909

Uh oh!

Conversation

DaveCTurner commented Jul 9, 2025

elasticsearchmachine commented Jul 9, 2025

elasticsearchmachine commented Jul 9, 2025

github-actions bot commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Preview links for changed docs

mhl-b left a comment

Choose a reason for hiding this comment

Uh oh!

ywangd Jul 10, 2025

Choose a reason for hiding this comment

DaveCTurner Jul 10, 2025

Choose a reason for hiding this comment

DaveCTurner Jul 10, 2025

Choose a reason for hiding this comment

ywangd Jul 10, 2025

Choose a reason for hiding this comment

ywangd left a comment

Choose a reason for hiding this comment

Uh oh!

Labels

5 participants

github-actions bot commented Jul 9, 2025 •

edited

Loading