[Pallas] Introduce gmm_backward #7151

alanwaketan · 2024-05-30T02:49:19Z

Summary:
This pull request introduces a helper for gmm_backward. I'm still debuting if we need to make gmm as a autograd.function given we will do manual back-propagation in Mixtral.

Test Plan:
python test/test_gmm.py

JackCaoG · 2024-05-30T03:43:17Z

test/test_gmm.py


+ @unittest.skipIf(xr.device_type() != 'TPU', "This test only works on TPU.")
+ def test_gmm_backward(self):
+ self._init_test_cases()


nit, you might need a met.clear_all() here.

Let me fix it in the next PR. Don't want to waste CI cycles.

JackCaoG

What do you mean by "manual backprop in mixtral"?

alanwaketan · 2024-05-30T03:47:13Z

Skip GPU tests to move fast.

alanwaketan · 2024-05-30T03:48:00Z

What do you mean by "manual backprop in mixtral"?

Just we need to override the MoE back prop to accommondate gmm backward and manual sharding. You will know what that means once the code is ready. Thanks for approving this change.

alanwaketan added 4 commits May 30, 2024 02:42

initial commit

78718d0

fix linters

b4d3a52

t

bcd82bd

fix linters

a5e7f16

alanwaketan requested review from JackCaoG, miladm and wonjoo-wj May 30, 2024 02:49

alanwaketan self-assigned this May 30, 2024

JackCaoG reviewed May 30, 2024

View reviewed changes

JackCaoG approved these changes May 30, 2024

View reviewed changes

alanwaketan merged commit c96c95a into master May 30, 2024

alanwaketan deleted the alanwaktan/tgmm3 branch May 30, 2024 03:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Pallas] Introduce gmm_backward #7151

[Pallas] Introduce gmm_backward #7151

Uh oh!

alanwaketan commented May 30, 2024

JackCaoG May 30, 2024

alanwaketan May 30, 2024

JackCaoG left a comment

alanwaketan commented May 30, 2024

alanwaketan commented May 30, 2024

Labels

3 participants

Uh oh!

[Pallas] Introduce gmm_backward #7151

[Pallas] Introduce gmm_backward #7151

Uh oh!

Conversation

alanwaketan commented May 30, 2024

JackCaoG May 30, 2024

Choose a reason for hiding this comment

alanwaketan May 30, 2024

Choose a reason for hiding this comment

JackCaoG left a comment

Choose a reason for hiding this comment

alanwaketan commented May 30, 2024

alanwaketan commented May 30, 2024

Labels

3 participants