Do not run mir opts for targets with convergent ops and add convergent attribute #149637

Flakebi · 2025-12-04T09:48:27Z

GPU targets have convergent operations that require careful handling
when running optimizations. E.g. they must not be duplicated.

An example convergent operation is a barrier/syncthreads.

We do not want to deal with convergent operations in mir optimizations,
so set the optimization level to 0 and skip all optimizations.

On targets with convergent operations, we need to add the convergent
attribute to all functions that run convergent operations. Following
clang, we can conservatively apply the attribute to all functions when
compiling for such a target and rely on LLVM optimizing away the
attribute in cases where it is not necessary.

The amdgpu and nvptx targets are marked as having convergent operations.

Fixes #137086, see this issue for details.
Tracking issue: #135024

cc @RDambrosio016 @kjetilkjeka for nvptx
cc @ZuseZ4

rustbot · 2025-12-04T09:48:31Z

These commits modify compiler targets.
(See the Target Tier Policy.)

rustbot · 2025-12-04T09:48:33Z

r? @nnethercote

rustbot has assigned @nnethercote.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

compiler/rustc_codegen_llvm/src/declare.rs

compiler/rustc_target/src/spec/json.rs

compiler/rustc_session/src/session.rs

GPU targets have convergent operations that require careful handling when running optimizations. E.g. they must not be duplicated. An example convergent operation is a barrier/syncthreads. We do not want to deal with convergent operations in mir optimizations, so set the optimization level to 0 and skip all optimizations. This affects the amdgpu and nvptx targets.

On targets with convergent operations, we need to add the convergent attribute to all functions that run convergent operations. Following clang, we can conservatively apply the attribute to all functions when compiling for such a target and rely on LLVM optimizing away the attribute in cases where it is not necessary. This affects the amdgpu and nvptx targets.

nnethercote · 2025-12-09T04:51:02Z

cc @LegNeato @eddyb @FractalFir

rustbot assigned nnethercote Dec 4, 2025

This was referenced Dec 4, 2025

MIR passes do not take into account if an operation is convergent #137086

Open

Tracking Issue for amdgcn target #135024

Open

bjorn3 reviewed Dec 4, 2025

View reviewed changes

compiler/rustc_codegen_llvm/src/declare.rs Show resolved Hide resolved

ZuseZ4 reviewed Dec 4, 2025

View reviewed changes

compiler/rustc_target/src/spec/json.rs Outdated Show resolved Hide resolved

Noratrieb reviewed Dec 4, 2025

View reviewed changes

compiler/rustc_session/src/session.rs Outdated Show resolved Hide resolved

Flakebi added 2 commits December 4, 2025 22:38

Flakebi force-pushed the fix-convergent-mir-opts branch from 25d4193 to f17636b Compare December 4, 2025 21:40

ZuseZ4 mentioned this pull request Dec 5, 2025

Tracking Issue for GPU-offload #131513

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Do not run mir opts for targets with convergent ops and add convergent attribute #149637

Do not run mir opts for targets with convergent ops and add convergent attribute #149637

Flakebi commented Dec 4, 2025 •

edited

Loading

rustbot commented Dec 4, 2025

rustbot commented Dec 4, 2025

Uh oh!

Uh oh!

Uh oh!

nnethercote commented Dec 9, 2025

Labels

6 participants

Uh oh!

Do not run mir opts for targets with convergent ops and add convergent attribute #149637

Are you sure you want to change the base?

Do not run mir opts for targets with convergent ops and add convergent attribute #149637

Conversation

Flakebi commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

rustbot commented Dec 4, 2025

rustbot commented Dec 4, 2025

Uh oh!

Uh oh!

Uh oh!

nnethercote commented Dec 9, 2025

Labels

6 participants

Flakebi commented Dec 4, 2025 •

edited

Loading