[FRONTEND] Complete rewrite of the runtime #644

ptillet · 2022-09-11T23:48:18Z

This PR completely rewrites the runtime of Triton to be more lean and clearly separate the compilation step from the just-in-time cache logic. This should substantially remove launch overhead, and also pave the way for even lower overhead in the future as support for type annotations is added, and users start explicitly leveraging the <500ns C entry point that triton.compile now provides when specialization hints are known.

Young768 · 2022-10-12T21:53:58Z

@ptillet Hi regarding lowering the cpu overhead, I wonder if this PR can handle those ops with dynamic shapes? And how?

ptillet · 2022-10-12T21:57:46Z

I am not sure I understand the question. Triton kernels recompile int arguments when they are equal to 1 or a multiple of 16. If a frontend maps shapes to int arguments then things won't get recompiled everytime the shapes change

Young768 · 2022-10-12T22:06:30Z

Do you mean that you actually cache the result for every seen shapes? If there is a new shape, triton still needs to do the compilation? My question is related to some language models. Some of kernels could have variable length of inputs.

ptillet · 2022-10-12T22:08:33Z

This is not what I said. Triton maintains only three versions of each int arguments: any value, multiple of 16, and equal to 1.

Young768 · 2022-10-12T22:19:59Z

what if they are not equal to 1 or a multiple of 16?

ptillet · 2022-10-12T22:27:28Z

Then it's it's the third version, they're unannotated int32 arguments.

This PR completely rewrites the runtime of Triton to be more lean and clearly separate the compilation step from the just-in-time caching logic. This should substantially reduce launch overhead.

ptillet added 30 commits September 10, 2022 19:21

some work

089e839

Merge branch 'master' into phil/new-runtime

5462d32

seems to work

9282bb3

more work

88c87aa

more work

0683209

more work

d67e29c

more progress

1d9ee1e

test-core passes

03be2ec

.

ba6151a

.

60c2f64

.

f0d7a2d

.

fb5ef0a

.

f8020e9

.

f9fd4bc

.

7ea2217

.

dec0add

.

c96f102

.

b74c06a

.

ac763a9

.

5f90262

.

a8a7f2a

.

87ede81

.

e2d4e3f

debug

1c3e741

.

ac18830

.

d4e1c96

.

8c91efd

.

c3ed8db

.

542092a

.

ec7e7d9

ptillet added 5 commits September 14, 2022 14:53

Merge branch 'master' into phil/new-runtime

5a6bdbb

Now using current_device and set_device again

7c90059

Fixed initialization issue

0d9ed9a

Fix more bugs

f2075d7

Merge branch 'master' into HEAD

adc65de

ptillet force-pushed the phil/new-runtime branch from 7d4c3b4 to adc65de Compare September 17, 2022 01:04

ptillet added 3 commits September 16, 2022 18:38

.

43dddc3

style

25bbd59

fixup hook

df4d57b

ptillet force-pushed the phil/new-runtime branch from 912d9cd to df4d57b Compare September 18, 2022 03:14

ptillet added 3 commits September 17, 2022 20:26

some cleaning

b157f03

.

5ed921c

style

ee772da

ptillet force-pushed the phil/new-runtime branch from b8a249d to ee772da Compare September 18, 2022 03:59

ptillet added 2 commits September 17, 2022 21:24

fixup

3f3cc8c

Merge branch 'master' into phil/new-runtime

cfa8d18

ptillet merged commit 4a77dfb into master Sep 18, 2022

ptillet deleted the phil/new-runtime branch September 18, 2022 15:51

jansel mentioned this pull request Sep 24, 2022

Use new Triton runtime pytorch/torchdynamo#1338

Merged

pommedeterresautee mentioned this pull request Oct 16, 2022

Add new annotation based on triton.compile to replace the use of jit ELS-RD/kernl#106

Open

iclementine mentioned this pull request May 15, 2024

Why not allow JITFunction as parameter to another JITFunction(high-order jit function)? #3918

Open

ZzEeKkAa pushed a commit to ZzEeKkAa/triton that referenced this pull request Aug 5, 2024

Update PyTorch pin to fix triton-lang#640 (triton-lang#644)

5970464

simonidaa mentioned this pull request Dec 30, 2024

[WIP] Optimize Autotuner with Parallel Compilation #5436

Open

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FRONTEND] Complete rewrite of the runtime #644

[FRONTEND] Complete rewrite of the runtime #644

Uh oh!

ptillet commented Sep 11, 2022

Young768 commented Oct 12, 2022

ptillet commented Oct 12, 2022

Young768 commented Oct 12, 2022

ptillet commented Oct 12, 2022

Young768 commented Oct 12, 2022

ptillet commented Oct 12, 2022

Labels

3 participants

[FRONTEND] Complete rewrite of the runtime #644

[FRONTEND] Complete rewrite of the runtime #644

Uh oh!

Conversation

ptillet commented Sep 11, 2022

Young768 commented Oct 12, 2022

ptillet commented Oct 12, 2022

Young768 commented Oct 12, 2022

ptillet commented Oct 12, 2022

Young768 commented Oct 12, 2022

ptillet commented Oct 12, 2022

Labels

3 participants