[AIE2p] Use multi-slot pseudo for const COPY with unique def #454

krishnamtibrewala · 2025-05-02T00:19:41Z

This will help in Liner code. Since reg-to-reg copy goes only on one slot by using multi-slot in case of const copy we can bundle them or use different slot for better packing.
PS : Very small optimization

krishnamtibrewala · 2025-05-02T00:23:29Z

Note : The best place to do this would be after register coalescing, that is where const COPY are created, and we should do this before RA

llvm/lib/Target/AIE/aie2p/AIE2PInstrInfo.cpp

martien-de-jong · 2025-05-02T13:31:58Z

How is this different from constant rematerialization by RA?

krishnamtibrewala · 2025-05-02T17:27:58Z

How is this different from constant rematerialization by RA?

Hi @martien-de-jong, i do not see that happening in RA, additionally the idea here is to covert the Scalar COPY instruction into PseudoImm move so that we can be packed in a same VLIW bundle.

martien-de-jong · 2025-05-05T09:25:25Z

What is stopping you from implementing this before RA? Also, I think that register coalescing removes copies rather than creating them. I think that PHI elimination is the biggest creator of copies.

krishnamtibrewala · 2025-05-05T16:55:09Z

What is stopping you from implementing this before RA? Also, I think that register coalescing removes copies rather than creating them. I think that PHI elimination is the biggest creator of copies.

Hi @martien-de-jong you are right PHI is the biggest creator of COPY, the register coalescing pass helps to clean up these copies to a certain extend leading to IR like. (Note: When it comes to COPY from a unmatching sub-reg to sub-reg, coalescing pass does not do a great job for us)

bb.0
%1 = mov_imm_pseudo 0
%2 = COPY %1
%3 = ADD %1, %x

bb.1
%4 = COPY %1

The only motivation to implement it before RA is the live range of %1 might reduce, aiding it in RA (both are big IFs)
The current implementation was more from ease of implementation & show a working PoC by using copyPhysReg(...) to pick a mov_imm_pseudo rather than mov_scl when possible, helping scheduler to do better bundling.

I saw this helpful in Conv2D_bfp16_* test cases.

martien-de-jong · 2025-05-06T07:43:25Z

@krishnamtibrewala yes, everything is related. There are more liverange considerations around REQ_SEQ and subreg use, especially across PHI nodes. I have a feeling that a combined PHI-elimination + constant materialization + register coalescing might be quite powerful. (although rematerialization might be reserved as a repair mechanism in core RA. It might influence coalescing decisions though.)

[AutoBump] Merge with 8a9921f (Oct 23) (17)

krishnamtibrewala requested review from F-Stuckmann, SagarMaheshwari99, abhinay-anubola, abnikant, andcarminati, gbossu, katerynamuts, khallouh, konstantinschwarz, martien-de-jong, niwinanto and stephenneuendorffer as code owners May 2, 2025 00:19

krishnamtibrewala commented May 2, 2025

View reviewed changes

llvm/lib/Target/AIE/aie2p/AIE2PInstrInfo.cpp Outdated Show resolved Hide resolved

martien-de-jong reviewed May 2, 2025

View reviewed changes

llvm/lib/Target/AIE/aie2p/AIE2PInstrInfo.cpp Outdated Show resolved Hide resolved

martien-de-jong reviewed May 2, 2025

View reviewed changes

llvm/lib/Target/AIE/aie2p/AIE2PInstrInfo.cpp Outdated Show resolved Hide resolved

[AIE2p] Use multi-slot pseudo for const COPY with unique def

b49e586

krishnamtibrewala force-pushed the aie2p-mov-const-opt branch from ece36b2 to b49e586 Compare May 2, 2025 17:20

mgehre-amd pushed a commit that referenced this pull request Aug 21, 2025

Merge pull request #454 from Xilinx/bump_to_8a9921f5

cc2a236

[AutoBump] Merge with 8a9921f (Oct 23) (17)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AIE2p] Use multi-slot pseudo for const COPY with unique def #454

[AIE2p] Use multi-slot pseudo for const COPY with unique def #454

krishnamtibrewala commented May 2, 2025

krishnamtibrewala commented May 2, 2025

Uh oh!

Uh oh!

Uh oh!

martien-de-jong commented May 2, 2025

krishnamtibrewala commented May 2, 2025

martien-de-jong commented May 5, 2025

krishnamtibrewala commented May 5, 2025

martien-de-jong commented May 6, 2025

Labels

3 participants

[AIE2p] Use multi-slot pseudo for const COPY with unique def #454

Are you sure you want to change the base?

[AIE2p] Use multi-slot pseudo for const COPY with unique def #454

Conversation

krishnamtibrewala commented May 2, 2025

krishnamtibrewala commented May 2, 2025

Uh oh!

Uh oh!

Uh oh!

martien-de-jong commented May 2, 2025

krishnamtibrewala commented May 2, 2025

martien-de-jong commented May 5, 2025

krishnamtibrewala commented May 5, 2025

martien-de-jong commented May 6, 2025

Labels

3 participants