Skip to content

Conversation

bric3
Copy link
Contributor

@bric3 bric3 commented Jun 30, 2025

What Does This Do

Check then notify if the release do not meet expected thresholds.

Screenshot 2025-07-11 at 16 37 58 Screenshot 2025-07-11 at 16 13 18

Motivation

Ensure performance thresholds.

Additional Notes

Contributor Checklist

Jira ticket: [PROJ-IDENT]

@bric3 bric3 requested a review from a team as a code owner June 30, 2025 16:59
@bric3 bric3 requested review from colin-higgins and removed request for a team June 30, 2025 16:59
@bric3 bric3 added tag: no release notes Changes to exclude from release notes comp: tooling Build & Tooling labels Jun 30, 2025
@bric3 bric3 marked this pull request as draft June 30, 2025 16:59
@bric3 bric3 changed the title chore(ci): Basic slo breach prototype Perf release gate Jun 30, 2025
@pr-commenter
Copy link

pr-commenter bot commented Jun 30, 2025

Benchmarks

Startup

Parameters

Baseline Candidate
baseline_or_candidate baseline candidate
git_branch master bdu/r-gate
git_commit_date 1752761628 1752763864
git_commit_sha d1794c1 df915f0
release_version 1.52.0-SNAPSHOT~d1794c17af 1.52.0-SNAPSHOT~df915f0a03
See matching parameters
Baseline Candidate
application insecure-bank insecure-bank
ci_job_date 1752765625 1752765625
ci_job_id 1034135474 1034135474
ci_pipeline_id 70870999 70870999
cpu_model Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz
kernel_version Linux runner-zfyrx7zua-project-304-concurrent-1-j5jmyhp9 6.8.0-1031-aws #33~22.04.1-Ubuntu SMP Thu Jun 26 14:22:30 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux Linux runner-zfyrx7zua-project-304-concurrent-1-j5jmyhp9 6.8.0-1031-aws #33~22.04.1-Ubuntu SMP Thu Jun 26 14:22:30 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
module Agent Agent
parent None None

Summary

Found 0 performance improvements and 0 performance regressions! Performance is the same for 49 metrics, 10 unstable metrics.

Startup time reports for petclinic
gantt title petclinic - global startup overhead: candidate=1.52.0-SNAPSHOT~df915f0a03, baseline=1.52.0-SNAPSHOT~d1794c17af dateFormat X axisFormat %s section tracing Agent [baseline] (1.043 s) : 0, 1043390 Total [baseline] (10.692 s) : 0, 10692369 Agent [candidate] (1.04 s) : 0, 1039928 Total [candidate] (10.702 s) : 0, 10702188 section appsec Agent [baseline] (1.214 s) : 0, 1213686 Total [baseline] (10.824 s) : 0, 10823993 Agent [candidate] (1.216 s) : 0, 1216395 Total [candidate] (10.77 s) : 0, 10770023 section iast Agent [baseline] (1.186 s) : 0, 1185650 Total [baseline] (10.897 s) : 0, 10896987 Agent [candidate] (1.171 s) : 0, 1171480 Total [candidate] (10.982 s) : 0, 10981897 section profiling Agent [baseline] (1.184 s) : 0, 1183973 Total [baseline] (10.93 s) : 0, 10929792 Agent [candidate] (1.184 s) : 0, 1183632 Total [candidate] (10.937 s) : 0, 10936649 
Loading
  • baseline results
Module Variant Duration Δ tracing
Agent tracing 1.043 s -
Agent appsec 1.214 s 170.296 ms (16.3%)
Agent iast 1.186 s 142.26 ms (13.6%)
Agent profiling 1.184 s 140.582 ms (13.5%)
Total tracing 10.692 s -
Total appsec 10.824 s 131.623 ms (1.2%)
Total iast 10.897 s 204.618 ms (1.9%)
Total profiling 10.93 s 237.423 ms (2.2%)
  • candidate results
Module Variant Duration Δ tracing
Agent tracing 1.04 s -
Agent appsec 1.216 s 176.467 ms (17.0%)
Agent iast 1.171 s 131.551 ms (12.7%)
Agent profiling 1.184 s 143.703 ms (13.8%)
Total tracing 10.702 s -
Total appsec 10.77 s 67.836 ms (0.6%)
Total iast 10.982 s 279.709 ms (2.6%)
Total profiling 10.937 s 234.461 ms (2.2%)
gantt title petclinic - break down per module: candidate=1.52.0-SNAPSHOT~df915f0a03, baseline=1.52.0-SNAPSHOT~d1794c17af dateFormat X axisFormat %s section tracing crashtracking [baseline] (1.438 ms) : 0, 1438 crashtracking [candidate] (1.428 ms) : 0, 1428 BytebuddyAgent [baseline] (733.131 ms) : 0, 733131 BytebuddyAgent [candidate] (730.932 ms) : 0, 730932 GlobalTracer [baseline] (242.267 ms) : 0, 242267 GlobalTracer [candidate] (241.265 ms) : 0, 241265 AppSec [baseline] (30.577 ms) : 0, 30577 AppSec [candidate] (30.506 ms) : 0, 30506 Debugger [baseline] (6.031 ms) : 0, 6031 Debugger [candidate] (6.016 ms) : 0, 6016 Remote Config [baseline] (662.859 µs) : 0, 663 Remote Config [candidate] (649.928 µs) : 0, 650 Telemetry [baseline] (8.247 ms) : 0, 8247 Telemetry [candidate] (8.192 ms) : 0, 8192 section appsec crashtracking [baseline] (1.439 ms) : 0, 1439 crashtracking [candidate] (1.434 ms) : 0, 1434 BytebuddyAgent [baseline] (748.779 ms) : 0, 748779 BytebuddyAgent [candidate] (750.853 ms) : 0, 750853 GlobalTracer [baseline] (233.218 ms) : 0, 233218 GlobalTracer [candidate] (234.472 ms) : 0, 234472 AppSec [baseline] (166.829 ms) : 0, 166829 AppSec [candidate] (168.234 ms) : 0, 168234 Debugger [baseline] (9.45 ms) : 0, 9450 Debugger [candidate] (7.164 ms) : 0, 7164 Remote Config [baseline] (612.387 µs) : 0, 612 Remote Config [candidate] (607.646 µs) : 0, 608 Telemetry [baseline] (8.895 ms) : 0, 8895 Telemetry [candidate] (8.794 ms) : 0, 8794 IAST [baseline] (23.3 ms) : 0, 23300 IAST [candidate] (23.7 ms) : 0, 23700 section iast crashtracking [baseline] (1.452 ms) : 0, 1452 crashtracking [candidate] (1.427 ms) : 0, 1427 BytebuddyAgent [baseline] (856.305 ms) : 0, 856305 BytebuddyAgent [candidate] (845.804 ms) : 0, 845804 GlobalTracer [baseline] (234.801 ms) : 0, 234801 GlobalTracer [candidate] (230.774 ms) : 0, 230774 AppSec [baseline] (28.988 ms) : 0, 28988 AppSec [candidate] (27.562 ms) : 0, 27562 Debugger [baseline] (6.633 ms) : 0, 6633 Debugger [candidate] (6.737 ms) : 0, 6737 Remote Config [baseline] (594.273 µs) : 0, 594 Remote Config [candidate] (597.083 µs) : 0, 597 Telemetry [baseline] (8.025 ms) : 0, 8025 Telemetry [candidate] (7.921 ms) : 0, 7921 IAST [baseline] (27.734 ms) : 0, 27734 IAST [candidate] (29.827 ms) : 0, 29827 section profiling crashtracking [baseline] (1.428 ms) : 0, 1428 crashtracking [candidate] (1.409 ms) : 0, 1409 BytebuddyAgent [baseline] (759.06 ms) : 0, 759060 BytebuddyAgent [candidate] (759.498 ms) : 0, 759498 GlobalTracer [baseline] (220.833 ms) : 0, 220833 GlobalTracer [candidate] (220.32 ms) : 0, 220320 AppSec [baseline] (30.4 ms) : 0, 30400 AppSec [candidate] (30.339 ms) : 0, 30339 Debugger [baseline] (6.293 ms) : 0, 6293 Debugger [candidate] (6.322 ms) : 0, 6322 Remote Config [baseline] (700.313 µs) : 0, 700 Remote Config [candidate] (675.378 µs) : 0, 675 Telemetry [baseline] (10.974 ms) : 0, 10974 Telemetry [candidate] (12.394 ms) : 0, 12394 ProfilingAgent [baseline] (106.018 ms) : 0, 106018 ProfilingAgent [candidate] (104.396 ms) : 0, 104396 Profiling [baseline] (106.04 ms) : 0, 106040 Profiling [candidate] (104.418 ms) : 0, 104418 
Loading
Startup time reports for insecure-bank
gantt title insecure-bank - global startup overhead: candidate=1.52.0-SNAPSHOT~df915f0a03, baseline=1.52.0-SNAPSHOT~d1794c17af dateFormat X axisFormat %s section tracing Agent [baseline] (1.042 s) : 0, 1041547 Total [baseline] (8.636 s) : 0, 8635628 Agent [candidate] (1.037 s) : 0, 1037324 Total [candidate] (8.602 s) : 0, 8601976 section iast Agent [baseline] (1.171 s) : 0, 1171232 Total [baseline] (9.282 s) : 0, 9282471 Agent [candidate] (1.19 s) : 0, 1190343 Total [candidate] (9.35 s) : 0, 9349969 
Loading
  • baseline results
Module Variant Duration Δ tracing
Agent tracing 1.042 s -
Agent iast 1.171 s 129.685 ms (12.5%)
Total tracing 8.636 s -
Total iast 9.282 s 646.843 ms (7.5%)
  • candidate results
Module Variant Duration Δ tracing
Agent tracing 1.037 s -
Agent iast 1.19 s 153.02 ms (14.8%)
Total tracing 8.602 s -
Total iast 9.35 s 747.993 ms (8.7%)
gantt title insecure-bank - break down per module: candidate=1.52.0-SNAPSHOT~df915f0a03, baseline=1.52.0-SNAPSHOT~d1794c17af dateFormat X axisFormat %s section tracing crashtracking [baseline] (1.43 ms) : 0, 1430 crashtracking [candidate] (1.432 ms) : 0, 1432 BytebuddyAgent [baseline] (731.047 ms) : 0, 731047 BytebuddyAgent [candidate] (728.97 ms) : 0, 728970 GlobalTracer [baseline] (242.523 ms) : 0, 242523 GlobalTracer [candidate] (240.887 ms) : 0, 240887 AppSec [baseline] (30.641 ms) : 0, 30641 AppSec [candidate] (30.305 ms) : 0, 30305 Debugger [baseline] (6.037 ms) : 0, 6037 Debugger [candidate] (5.989 ms) : 0, 5989 Remote Config [baseline] (657.752 µs) : 0, 658 Remote Config [candidate] (654.164 µs) : 0, 654 Telemetry [baseline] (8.3 ms) : 0, 8300 Telemetry [candidate] (8.183 ms) : 0, 8183 section iast crashtracking [baseline] (1.425 ms) : 0, 1425 crashtracking [candidate] (1.453 ms) : 0, 1453 BytebuddyAgent [baseline] (845.701 ms) : 0, 845701 BytebuddyAgent [candidate] (861.146 ms) : 0, 861146 GlobalTracer [baseline] (230.673 ms) : 0, 230673 GlobalTracer [candidate] (233.319 ms) : 0, 233319 AppSec [baseline] (28.144 ms) : 0, 28144 AppSec [candidate] (27.675 ms) : 0, 27675 Debugger [baseline] (8.328 ms) : 0, 8328 Debugger [candidate] (6.75 ms) : 0, 6750 Remote Config [baseline] (589.455 µs) : 0, 589 Remote Config [candidate] (615.116 µs) : 0, 615 Telemetry [baseline] (7.962 ms) : 0, 7962 Telemetry [candidate] (8.071 ms) : 0, 8071 IAST [baseline] (27.497 ms) : 0, 27497 IAST [candidate] (30.198 ms) : 0, 30198 
Loading

Load

Parameters

Baseline Candidate
baseline_or_candidate baseline candidate
git_branch master bdu/r-gate
git_commit_date 1752761628 1752763864
git_commit_sha d1794c1 df915f0
release_version 1.52.0-SNAPSHOT~d1794c17af 1.52.0-SNAPSHOT~df915f0a03
See matching parameters
Baseline Candidate
application insecure-bank insecure-bank
ci_job_date 1752765370 1752765370
ci_job_id 1034135475 1034135475
ci_pipeline_id 70870999 70870999
cpu_model Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz
kernel_version Linux runner-zfyrx7zua-project-304-concurrent-0-hrg2yowu 6.8.0-1031-aws #33~22.04.1-Ubuntu SMP Thu Jun 26 14:22:30 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux Linux runner-zfyrx7zua-project-304-concurrent-0-hrg2yowu 6.8.0-1031-aws #33~22.04.1-Ubuntu SMP Thu Jun 26 14:22:30 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux

Summary

Found 1 performance improvements and 2 performance regressions! Performance is the same for 9 metrics, 12 unstable metrics.

scenario Δ mean http_req_duration Δ mean throughput candidate mean http_req_duration candidate mean throughput baseline mean http_req_duration baseline mean throughput
scenario:load:insecure-bank:tracing:high_load worse
[+334.443µs; +587.842µs] or [+4.381%; +7.700%]
unstable
[-109.476op/s; +41.038op/s] or [-18.043%; +6.764%]
8.095ms 572.531op/s 7.634ms 606.750op/s
scenario:load:insecure-bank:iast:high_load worse
[+304.279µs; +637.376µs] or [+3.341%; +6.998%]
unstable
[-83.520op/s; +33.833op/s] or [-16.392%; +6.640%]
9.579ms 484.688op/s 9.109ms 509.531op/s
scenario:load:petclinic:appsec:high_load better
[-4.214ms; -3.277ms] or [-8.415%; -6.545%]
unstable
[+0.324op/s; +14.776op/s] or [+0.347%; +15.801%]
46.325ms 101.062op/s 50.070ms 93.513op/s
Request duration reports for insecure-bank
gantt title insecure-bank - request duration [CI 0.99] : candidate=1.52.0-SNAPSHOT~df915f0a03, baseline=1.52.0-SNAPSHOT~d1794c17af dateFormat X axisFormat %s section baseline no_agent (4.452 ms) : 4391, 4512 . : milestone, 4452, iast (9.109 ms) : 8960, 9257 . : milestone, 9109, iast_FULL (14.038 ms) : 13761, 14315 . : milestone, 14038, iast_GLOBAL (10.033 ms) : 9840, 10227 . : milestone, 10033, profiling (8.733 ms) : 8583, 8884 . : milestone, 8733, tracing (7.634 ms) : 7518, 7750 . : milestone, 7634, section candidate no_agent (4.462 ms) : 4412, 4512 . : milestone, 4462, iast (9.579 ms) : 9419, 9740 . : milestone, 9579, iast_FULL (13.78 ms) : 13505, 14055 . : milestone, 13780, iast_GLOBAL (10.197 ms) : 10015, 10379 . : milestone, 10197, profiling (9.002 ms) : 8862, 9143 . : milestone, 9002, tracing (8.095 ms) : 7976, 8215 . : milestone, 8095, 
Loading
  • baseline results
Variant Request duration [CI 0.99] Δ no_agent
no_agent 4.452 ms [4.391 ms, 4.512 ms] -
iast 9.109 ms [8.96 ms, 9.257 ms] 4.657 ms (104.6%)
iast_FULL 14.038 ms [13.761 ms, 14.315 ms] 9.587 ms (215.4%)
iast_GLOBAL 10.033 ms [9.84 ms, 10.227 ms] 5.582 ms (125.4%)
profiling 8.733 ms [8.583 ms, 8.884 ms] 4.282 ms (96.2%)
tracing 7.634 ms [7.518 ms, 7.75 ms] 3.183 ms (71.5%)
  • candidate results
Variant Request duration [CI 0.99] Δ no_agent
no_agent 4.462 ms [4.412 ms, 4.512 ms] -
iast 9.579 ms [9.419 ms, 9.74 ms] 5.118 ms (114.7%)
iast_FULL 13.78 ms [13.505 ms, 14.055 ms] 9.318 ms (208.8%)
iast_GLOBAL 10.197 ms [10.015 ms, 10.379 ms] 5.735 ms (128.5%)
profiling 9.002 ms [8.862 ms, 9.143 ms] 4.541 ms (101.8%)
tracing 8.095 ms [7.976 ms, 8.215 ms] 3.633 ms (81.4%)
Request duration reports for petclinic
gantt title petclinic - request duration [CI 0.99] : candidate=1.52.0-SNAPSHOT~df915f0a03, baseline=1.52.0-SNAPSHOT~d1794c17af dateFormat X axisFormat %s section baseline no_agent (37.981 ms) : 37667, 38296 . : milestone, 37981, appsec (50.07 ms) : 49607, 50534 . : milestone, 50070, code_origins (44.599 ms) : 44209, 44989 . : milestone, 44599, iast (44.459 ms) : 44079, 44839 . : milestone, 44459, profiling (47.137 ms) : 46689, 47586 . : milestone, 47137, tracing (44.719 ms) : 44338, 45100 . : milestone, 44719, section candidate no_agent (37.54 ms) : 37237, 37844 . : milestone, 37540, appsec (46.325 ms) : 45920, 46730 . : milestone, 46325, code_origins (44.838 ms) : 44464, 45213 . : milestone, 44838, iast (44.708 ms) : 44315, 45101 . : milestone, 44708, profiling (47.536 ms) : 47083, 47990 . : milestone, 47536, tracing (44.137 ms) : 43765, 44508 . : milestone, 44137, 
Loading
  • baseline results
Variant Request duration [CI 0.99] Δ no_agent
no_agent 37.981 ms [37.667 ms, 38.296 ms] -
appsec 50.07 ms [49.607 ms, 50.534 ms] 12.089 ms (31.8%)
code_origins 44.599 ms [44.209 ms, 44.989 ms] 6.618 ms (17.4%)
iast 44.459 ms [44.079 ms, 44.839 ms] 6.477 ms (17.1%)
profiling 47.137 ms [46.689 ms, 47.586 ms] 9.156 ms (24.1%)
tracing 44.719 ms [44.338 ms, 45.1 ms] 6.738 ms (17.7%)
  • candidate results
Variant Request duration [CI 0.99] Δ no_agent
no_agent 37.54 ms [37.237 ms, 37.844 ms] -
appsec 46.325 ms [45.92 ms, 46.73 ms] 8.785 ms (23.4%)
code_origins 44.838 ms [44.464 ms, 45.213 ms] 7.298 ms (19.4%)
iast 44.708 ms [44.315 ms, 45.101 ms] 7.168 ms (19.1%)
profiling 47.536 ms [47.083 ms, 47.99 ms] 9.996 ms (26.6%)
tracing 44.137 ms [43.765 ms, 44.508 ms] 6.597 ms (17.6%)

Dacapo

Parameters

Baseline Candidate
baseline_or_candidate baseline candidate
git_branch master bdu/r-gate
git_commit_date 1752761628 1752763864
git_commit_sha d1794c1 df915f0
release_version 1.52.0-SNAPSHOT~d1794c17af 1.52.0-SNAPSHOT~df915f0a03
See matching parameters
Baseline Candidate
application biojava biojava
ci_job_date 1752765790 1752765790
ci_job_id 1034135476 1034135476
ci_pipeline_id 70870999 70870999
cpu_model Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz
kernel_version Linux runner-zfyrx7zua-project-304-concurrent-2-c35tsyqg 6.8.0-1031-aws #33~22.04.1-Ubuntu SMP Thu Jun 26 14:22:30 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux Linux runner-zfyrx7zua-project-304-concurrent-2-c35tsyqg 6.8.0-1031-aws #33~22.04.1-Ubuntu SMP Thu Jun 26 14:22:30 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux

Summary

Found 0 performance improvements and 0 performance regressions! Performance is the same for 11 metrics, 1 unstable metrics.

Execution time for tomcat
gantt title tomcat - execution time [CI 0.99] : candidate=1.52.0-SNAPSHOT~df915f0a03, baseline=1.52.0-SNAPSHOT~d1794c17af dateFormat X axisFormat %s section baseline no_agent (1.476 ms) : 1464, 1487 . : milestone, 1476, appsec (3.652 ms) : 3434, 3871 . : milestone, 3652, iast (2.202 ms) : 2139, 2264 . : milestone, 2202, iast_GLOBAL (2.236 ms) : 2174, 2299 . : milestone, 2236, profiling (2.035 ms) : 1985, 2085 . : milestone, 2035, tracing (2.012 ms) : 1963, 2061 . : milestone, 2012, section candidate no_agent (1.476 ms) : 1465, 1488 . : milestone, 1476, appsec (3.573 ms) : 3361, 3786 . : milestone, 3573, iast (2.209 ms) : 2146, 2272 . : milestone, 2209, iast_GLOBAL (2.244 ms) : 2181, 2308 . : milestone, 2244, profiling (2.044 ms) : 1994, 2095 . : milestone, 2044, tracing (2.02 ms) : 1971, 2068 . : milestone, 2020, 
Loading
  • baseline results
Variant Execution Time [CI 0.99] Δ no_agent
no_agent 1.476 ms [1.464 ms, 1.487 ms] -
appsec 3.652 ms [3.434 ms, 3.871 ms] 2.177 ms (147.5%)
iast 2.202 ms [2.139 ms, 2.264 ms] 725.646 µs (49.2%)
iast_GLOBAL 2.236 ms [2.174 ms, 2.299 ms] 760.395 µs (51.5%)
profiling 2.035 ms [1.985 ms, 2.085 ms] 558.867 µs (37.9%)
tracing 2.012 ms [1.963 ms, 2.061 ms] 536.156 µs (36.3%)
  • candidate results
Variant Execution Time [CI 0.99] Δ no_agent
no_agent 1.476 ms [1.465 ms, 1.488 ms] -
appsec 3.573 ms [3.361 ms, 3.786 ms] 2.097 ms (142.1%)
iast 2.209 ms [2.146 ms, 2.272 ms] 732.582 µs (49.6%)
iast_GLOBAL 2.244 ms [2.181 ms, 2.308 ms] 768.271 µs (52.0%)
profiling 2.044 ms [1.994 ms, 2.095 ms] 568.107 µs (38.5%)
tracing 2.02 ms [1.971 ms, 2.068 ms] 543.416 µs (36.8%)
Execution time for biojava
gantt title biojava - execution time [CI 0.99] : candidate=1.52.0-SNAPSHOT~df915f0a03, baseline=1.52.0-SNAPSHOT~d1794c17af dateFormat X axisFormat %s section baseline no_agent (14.758 s) : 14758000, 14758000 . : milestone, 14758000, appsec (14.983 s) : 14983000, 14983000 . : milestone, 14983000, iast (18.089 s) : 18089000, 18089000 . : milestone, 18089000, iast_GLOBAL (18.002 s) : 18002000, 18002000 . : milestone, 18002000, profiling (16.055 s) : 16055000, 16055000 . : milestone, 16055000, tracing (14.959 s) : 14959000, 14959000 . : milestone, 14959000, section candidate no_agent (15.672 s) : 15672000, 15672000 . : milestone, 15672000, appsec (14.891 s) : 14891000, 14891000 . : milestone, 14891000, iast (18.632 s) : 18632000, 18632000 . : milestone, 18632000, iast_GLOBAL (18.276 s) : 18276000, 18276000 . : milestone, 18276000, profiling (15.21 s) : 15210000, 15210000 . : milestone, 15210000, tracing (15.015 s) : 15015000, 15015000 . : milestone, 15015000, 
Loading
  • baseline results
Variant Execution Time [CI 0.99] Δ no_agent
no_agent 14.758 s [14.758 s, 14.758 s] -
appsec 14.983 s [14.983 s, 14.983 s] 225.0 ms (1.5%)
iast 18.089 s [18.089 s, 18.089 s] 3.331 s (22.6%)
iast_GLOBAL 18.002 s [18.002 s, 18.002 s] 3.244 s (22.0%)
profiling 16.055 s [16.055 s, 16.055 s] 1.297 s (8.8%)
tracing 14.959 s [14.959 s, 14.959 s] 201.0 ms (1.4%)
  • candidate results
Variant Execution Time [CI 0.99] Δ no_agent
no_agent 15.672 s [15.672 s, 15.672 s] -
appsec 14.891 s [14.891 s, 14.891 s] -781.0 ms (-5.0%)
iast 18.632 s [18.632 s, 18.632 s] 2.96 s (18.9%)
iast_GLOBAL 18.276 s [18.276 s, 18.276 s] 2.604 s (16.6%)
profiling 15.21 s [15.21 s, 15.21 s] -462.0 ms (-2.9%)
tracing 15.015 s [15.015 s, 15.015 s] -657.0 ms (-4.2%)
Copy link
Contributor

@ddyurchenko ddyurchenko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good from my side! 🎉
Need also @igoragoli review for the final approval.

@ddyurchenko ddyurchenko requested a review from igoragoli July 1, 2025 13:43
@bric3 bric3 requested a review from a team July 1, 2025 14:07
Copy link
Contributor

@igoragoli igoragoli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @bric3! 🙌

There are just some points regarding startup:petclinic.* thresholds that need to be addressed before merging.

@bric3 bric3 requested a review from a team July 2, 2025 15:24
@bric3 bric3 force-pushed the bdu/r-gate branch 7 times, most recently from 7e61d82 to feb60ef Compare July 10, 2025 12:48
@ddyurchenko ddyurchenko self-requested a review July 11, 2025 14:39
@bric3 bric3 marked this pull request as ready for review July 11, 2025 15:45
Copy link
Contributor

@igoragoli igoragoli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, thanks Brice!

I think it's a good idea to include the source for the SLOs in the thresholds file, nice.

when: always
- when: manual
allow_failure: true
- when: on_success # TODO: PLEASE revert before merging the PR
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

todo:

  • To revert before merging
@ddyurchenko
Copy link
Contributor

Still see some yellow in results, so while job won't block releases, it will send warning messages in Java guild channel.
Augusto practically finalized the changes to reporting (so now confidence intervals and thresholds are clearly displayed, incl. warning threshold). I propose to rebuild Java image, so the changes are included, and update the SLOs once more for p50 latency, p99 latency and startup time mean value (execution_time metric), so they are no longer in yellow zone.

SLO breach check | SLO breach check | #### high_load--only-tracing SLO breach check | SLO breach check | - 🟩 `throughput` 1[250](https://gitlab.ddbuild.io/DataDog/apm-reliability/dd-trace-java/-/jobs/1025486085#L250).55 op/s > 1100.00 op/s SLO breach check | SLO breach check | #### high_load--otel-latest SLO breach check | SLO breach check | - 🟩 `throughput` 1245.11 op/s > 1100.00 op/s SLO breach check | SLO breach check | #### normal_operation--only-tracing SLO breach check | SLO breach check | - 🟩 `agg_http_req_duration_p50` 2.12 ms < 2.36 ms SLO breach check | - 🟨 `agg_http_req_duration_p99` 7.10 ms < 7.89 ms SLO breach check | SLO breach check | #### normal_operation--otel-latest SLO breach check | SLO breach check | - 🟨 `agg_http_req_duration_p50` 2.12 ms < 2.34 ms SLO breach check | - 🟨 `agg_http_req_duration_p99` 8.75 ms < 9.50 ms SLO breach check | SLO breach check | #### startup:petclinic:appsec:GlobalTracer SLO breach check | SLO breach check | - 🟨 `execution_time` 235.01 ms < [260](https://gitlab.ddbuild.io/DataDog/apm-reliability/dd-trace-java/-/jobs/1025486085#L260).00 ms SLO breach check | SLO breach check | #### startup:petclinic:iast:GlobalTracer SLO breach check | SLO breach check | - 🟩 `execution_time` 231.70 ms < 260.00 ms SLO breach check | SLO breach check | #### startup:petclinic:profiling:GlobalTracer SLO breach check | SLO breach check | - 🟨 `execution_time` 361.27 ms < 368.00 ms SLO breach check | SLO breach check | #### startup:petclinic:tracing:GlobalTracer SLO breach check | SLO breach check | - 🟨 `execution_time` 243.13 ms < 260.00 ms SLO breach check | SLO breach check | --- SLO breach check | SLO breach check | Legend: SLO breach check | - 🟩 pass SLO breach check | - 🟥 breach SLO breach check | - 🟨 warning SLO breach check | - (unstable) unstable 
@igoragoli
Copy link
Contributor

igoragoli commented Jul 16, 2025

Augusto practically finalized the changes to reporting

Changes for reporting done! 🙂

I'm updating the registry.ddbuild.io/images/benchmarking-platform-tools-ubuntu:latest image (the one used in the check-slo-breaches job across the board) here: https://gitlab.ddbuild.io/DataDog/benchmarking-platform-tools/-/jobs/1031270236

@ddyurchenko
Copy link
Contributor

Thanks @igoragoli ! I reran the job https://gitlab.ddbuild.io/DataDog/apm-reliability/dd-trace-java/-/jobs/1031460223, based on its results, will provide suggestions to @bric3 .

# https://datadoghq.atlassian.net/wiki/x/LgI1LgE#Specification
# Measurements
# https://benchmarking.us1.prod.dog/trends?projectId=4&branch=master&trendsTab=per_scenario
scenarios:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Based on the results of https://gitlab.ddbuild.io/DataDog/apm-reliability/dd-trace-java/-/jobs/1031460223, I suggest to update the SLOs to the following values:

 # Standard macrobenchmarks # https://benchmarking.us1.prod.dog/trends?projectId=4&branch=master&trendsTab=per_scenario&scenario=normal_operation%2Fonly-tracing&trendsType=scenario - name: normal_operation/only-tracing thresholds: - agg_http_req_duration_p50 < 2.36 ms - agg_http_req_duration_p99 < 7.89 ms # https://benchmarking.us1.prod.dog/trends?projectId=4&branch=master&trendsTab=per_scenario&scenario=normal_operation%2Fotel-latest&trendsType=scenario - name: normal_operation/otel-latest thresholds: - agg_http_req_duration_p50 < 2.5 ms - agg_http_req_duration_p99 < 10 ms # https://benchmarking.us1.prod.dog/trends?projectId=4&branch=master&trendsTab=per_scenario&scenario=high_load%2Fonly-tracing&trendsType=scenario - name: high_load/only-tracing thresholds: - throughput > 1100.0 op/s # https://benchmarking.us1.prod.dog/trends?projectId=4&branch=master&trendsTab=per_scenario&scenario=high_load%2Fotel-latest&trendsType=scenario - name: high_load/otel-latest thresholds: - throughput > 1100.0 op/s # Startup macrobenchmarks # https://benchmarking.us1.prod.dog/trends?projectId=4&branch=master&trendsTab=per_scenario&scenario=startup%3Apetclinic%3Atracing%3AGlobalTracer&trendsType=scenario # https://benchmarking.us1.prod.dog/trends?projectId=4&branch=master&trendsTab=per_scenario&scenario=startup%3Apetclinic%3Aappsec%3AGlobalTracer&trendsType=scenario # https://benchmarking.us1.prod.dog/trends?projectId=4&branch=master&trendsTab=per_scenario&scenario=startup%3Apetclinic%3Aiast%3AGlobalTracer&trendsType=scenario - name: "startup:petclinic:(tracing|appsec|iast):GlobalTracer" thresholds: - execution_time < 280 ms # https://benchmarking.us1.prod.dog/trends?projectId=4&branch=master&trendsTab=per_scenario&scenario=startup%3Apetclinic%3Aprofiling%3AGlobalTracer&trendsType=scenario - name: "startup:petclinic:profiling:GlobalTracer" thresholds: - execution_time < 420 ms 
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Another tweak
image

Copy link
Contributor

@ddyurchenko ddyurchenko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need to update some of SLO thresholds before merge.

@bric3 bric3 enabled auto-merge (squash) July 22, 2025 10:08
@bric3 bric3 merged commit 4e4c286 into master Jul 22, 2025
503 checks passed
@bric3 bric3 deleted the bdu/r-gate branch July 22, 2025 10:33
@github-actions github-actions bot added this to the 1.52.0 milestone Jul 22, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

comp: tooling Build & Tooling tag: no release notes Changes to exclude from release notes

3 participants