Add _metric_names_hash field to OTel metric mappings #120952

felixbarny · 2025-01-27T18:28:18Z

A short-term workaround for #99123

If metrics that have the same timestamp and dimensions aren't grouped into the same document, ES will consider them to be a duplicate. The _metric_names_hash field will be set by the OTel ES exporter (see open-telemetry/opentelemetry-collector-contrib#37511). As it's mapped as a time_series_dimensions, it creates a different _tsid for documents with different sets of metrics. The tradeoff is that if the composition of the metrics grouping changes over time, a different _tsid will be created. That has an impact on the rate aggregation for counters.

If metrics that have the same timestamp and dimensions aren't grouped into the same document, ES will consider them to be a duplicate. The _metric_names_hash field will be set by the OTel ES exporter. As it's mapped as a time_series_dimensions, it creates a different _tsid for documents with different sets of metrics. The tradeoff is that if the composition of the metrics grouping changes over time, a different _tsid will be created. That has an impact on the rate aggregation for counters.

elasticsearchmachine · 2025-01-27T18:28:43Z

Pinging @elastic/es-data-management (Team:Data Management)

elasticsearchmachine · 2025-01-27T18:28:58Z

Hi @felixbarny, I've created a changelog YAML for you.

carsonip · 2025-01-27T21:09:51Z

x-pack/plugin/otel-data/src/main/resources/component-templates/metrics-otel@mappings.yaml

 priority: 10
+ # workaround for https://github.com/elastic/elasticsearch/issues/99123
+ _metric_names_hash:
+ type: keyword


Q: will a number be more lightweight, as you're using a 8 digit hex anyway?

At the moment, numbers can't leverage run-length encoding. So it's actually lighter to use a keyword here as all dimensions are incorporated into the _tsid, which we sort by. Therefore, all values for the same tsid are equal and can be compressed very efficiently.

felixbarny · 2025-02-17T12:56:19Z

I had a discussion with @martijnvg about this last week. The conclusion was that this change makes the consequences of imperfect grouping much less bad and we should therefore move forward with it. It's not a replacement for improving the grouping logic. But it's much better to have a different time series rather than dropping metrics. It's a much less stressful situation having to debug why the rate aggregation isn't working properly in some cases rather than debugging a data loss scenario. Longer-term, it seems like we'll go into the one metric per doc route where grouping of metrics isn't required anymore.

martijnvg

LGTM

If metrics that have the same timestamp and dimensions aren't grouped into the same document, ES will consider them to be a duplicate. The _metric_names_hash field will be set by the OTel ES exporter. As it's mapped as a time_series_dimensions, it creates a different _tsid for documents with different sets of metrics. The tradeoff is that if the composition of the metrics grouping changes over time, a different _tsid will be created. That has an impact on the rate aggregation for counters.

elasticsearchmachine · 2025-02-18T17:32:51Z

💚 Backport successful

Status	Branch	Result
✅	9.0
✅	8.x
✅	8.16
✅	8.17

If metrics that have the same timestamp and dimensions aren't grouped into the same document, ES will consider them to be a duplicate. The _metric_names_hash field will be set by the OTel ES exporter. As it's mapped as a time_series_dimensions, it creates a different _tsid for documents with different sets of metrics. The tradeoff is that if the composition of the metrics grouping changes over time, a different _tsid will be created. That has an impact on the rate aggregation for counters.

…tions (#37511) If metrics that have the same timestamp and dimensions aren't grouped into the same document, ES will consider them to be a duplicate. This adds a hash of the metric names that will be mapped as a dimension in Elasticsearch. The tradeoff is that if the composition of the metrics grouping changes over time, a new time series will be created. That has an impact on the rate aggregation for counters. ES mapping changes: elastic/elasticsearch#120952 --------- Co-authored-by: Carson Ip <carsonip@users.noreply.github.com>

carsonip · 2025-04-15T13:30:25Z

💚 All backports created successfully

Status	Branch	Result
✅	8.18

Questions ?

Please refer to the Backport tool documentation

If metrics that have the same timestamp and dimensions aren't grouped into the same document, ES will consider them to be a duplicate. The _metric_names_hash field will be set by the OTel ES exporter. As it's mapped as a time_series_dimensions, it creates a different _tsid for documents with different sets of metrics. The tradeoff is that if the composition of the metrics grouping changes over time, a different _tsid will be created. That has an impact on the rate aggregation for counters. (cherry picked from commit 5e8865d)

If metrics that have the same timestamp and dimensions aren't grouped into the same document, ES will consider them to be a duplicate. The _metric_names_hash field will be set by the OTel ES exporter. As it's mapped as a time_series_dimensions, it creates a different _tsid for documents with different sets of metrics. The tradeoff is that if the composition of the metrics grouping changes over time, a different _tsid will be created. That has an impact on the rate aggregation for counters. (cherry picked from commit 5e8865d) Co-authored-by: Felix Barnsteiner <felixbarny@users.noreply.github.com>

…#126850) Bump otel-data plugin version as #120952 missed the bump.

…elastic#126850) Bump otel-data plugin version as elastic#120952 missed the bump.

…elastic#126850) Bump otel-data plugin version as elastic#120952 missed the bump. (cherry picked from commit 5860ccb) # Conflicts: # x-pack/plugin/otel-data/src/main/resources/resources.yaml

…#126850) (#126899) Bump otel-data plugin version as #120952 missed the bump.

…#126850) (#126900) Bump otel-data plugin version as #120952 missed the bump. (cherry picked from commit 5860ccb)

…#126850) (#126898) Bump otel-data plugin version as #120952 missed the bump.

…#126850) (#126896) Bump otel-data plugin version as #120952 missed the bump.

felixbarny added >bug :Data Management/Data streams Data streams and their lifecycles auto-backport Automatically create backport pull requests when merged v9.0.0 v8.18.0 v8.17.2 v8.16.4 labels Jan 27, 2025

felixbarny requested a review from a team January 27, 2025 18:28

felixbarny requested a review from a team as a code owner January 27, 2025 18:28

felixbarny mentioned this pull request Jan 27, 2025

[exporter/elasticsearch] Add _metric_names_hash to avoid metric rejections open-telemetry/opentelemetry-collector-contrib#37511

Merged

elasticsearchmachine added the Team:Data Management Meta label for data/management team label Jan 27, 2025

elasticsearchmachine added the external-contributor Pull request authored by a developer outside the Elasticsearch team label Jan 27, 2025

Update docs/changelog/120952.yaml

64f168c

carsonip reviewed Jan 27, 2025

View reviewed changes

carsonip approved these changes Jan 28, 2025

View reviewed changes

elasticsearchmachine added v8.19.0 v9.1.0 v8.17.3 v8.16.5 and removed v8.18.0 v9.0.0 v8.17.2 v8.16.4 labels Jan 30, 2025

Merge branch 'main' into otel-metric-names-hash

eda60af

felixbarny requested a review from martijnvg February 17, 2025 12:56

martijnvg approved these changes Feb 17, 2025

View reviewed changes

felixbarny added the v9.0.0 label Feb 18, 2025

felixbarny merged commit 5e8865d into elastic:main Feb 18, 2025
17 checks passed

This was referenced Feb 18, 2025

[9.0] Add _metric_names_hash field to OTel metric mappings (#120952) #122879

Merged

[8.x] Add _metric_names_hash field to OTel metric mappings (#120952) #122880

Merged

This was referenced Feb 18, 2025

[8.16] Add _metric_names_hash field to OTel metric mappings (#120952) #122881

Merged

[8.17] Add _metric_names_hash field to OTel metric mappings (#120952) #122882

Merged

carsonip mentioned this pull request Apr 15, 2025

[8.18] Add _metric_names_hash field to OTel metric mappings (#120952) #126848

Merged

carsonip mentioned this pull request Apr 15, 2025

[otel-data] Bump plugin version to release _metric_names_hash changes #126850

Merged

carsonip added a commit that referenced this pull request Apr 16, 2025

[otel-data] Bump plugin version to release _metric_names_hash changes (…

5860ccb

…#126850) Bump otel-data plugin version as #120952 missed the bump.

carsonip added a commit to carsonip/elasticsearch that referenced this pull request Apr 16, 2025

[otel-data] Bump plugin version to release _metric_names_hash changes (…

05027c7

…elastic#126850) Bump otel-data plugin version as elastic#120952 missed the bump.

carsonip added a commit to carsonip/elasticsearch that referenced this pull request Apr 16, 2025

[otel-data] Bump plugin version to release _metric_names_hash changes (…

198c31c

…elastic#126850) Bump otel-data plugin version as elastic#120952 missed the bump.

carsonip added a commit to carsonip/elasticsearch that referenced this pull request Apr 16, 2025

[otel-data] Bump plugin version to release _metric_names_hash changes (…

d8f742c

…elastic#126850) Bump otel-data plugin version as elastic#120952 missed the bump.

elasticsearchmachine pushed a commit that referenced this pull request Apr 16, 2025

[otel-data] Bump plugin version to release _metric_names_hash changes (…

5cddb3a

…#126850) (#126899) Bump otel-data plugin version as #120952 missed the bump.

carsonip added a commit that referenced this pull request Apr 17, 2025

[otel-data] Bump plugin version to release _metric_names_hash changes (…

c8760c3

…#126850) (#126900) Bump otel-data plugin version as #120952 missed the bump. (cherry picked from commit 5860ccb)

elasticsearchmachine pushed a commit that referenced this pull request Apr 17, 2025

[otel-data] Bump plugin version to release _metric_names_hash changes (…

88d2f00

…#126850) (#126898) Bump otel-data plugin version as #120952 missed the bump.

elasticsearchmachine pushed a commit that referenced this pull request Apr 17, 2025

[otel-data] Bump plugin version to release _metric_names_hash changes (…

642d26e

…#126850) (#126896) Bump otel-data plugin version as #120952 missed the bump.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add _metric_names_hash field to OTel metric mappings #120952

Add _metric_names_hash field to OTel metric mappings #120952

Uh oh!

felixbarny commented Jan 27, 2025

elasticsearchmachine commented Jan 27, 2025

elasticsearchmachine commented Jan 27, 2025

carsonip Jan 27, 2025

felixbarny Jan 28, 2025

felixbarny commented Feb 17, 2025

martijnvg left a comment

Uh oh!

elasticsearchmachine commented Feb 18, 2025

carsonip commented Apr 15, 2025

Labels

4 participants

Add _metric_names_hash field to OTel metric mappings #120952

Add _metric_names_hash field to OTel metric mappings #120952

Uh oh!

Conversation

felixbarny commented Jan 27, 2025

elasticsearchmachine commented Jan 27, 2025

elasticsearchmachine commented Jan 27, 2025

carsonip Jan 27, 2025

Choose a reason for hiding this comment

felixbarny Jan 28, 2025

Choose a reason for hiding this comment

felixbarny commented Feb 17, 2025

martijnvg left a comment

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Feb 18, 2025

💚 Backport successful

carsonip commented Apr 15, 2025

💚 All backports created successfully

Questions ?

Labels

4 participants