Skip to content

Conversation

@ysiraichi
Copy link
Collaborator

Fix: #7084

This PR adds data-type promotion to gelu_backward operation. Previously, there was none. So, the kernel implicitly expected the arguments to be of the same data-type. This might not be the case when using AMP.

cc @miladm @JackCaoG

@ysiraichi ysiraichi requested a review from JackCaoG May 21, 2024 23:52
@vanbasten23
Copy link
Collaborator

Curious, how did you find out it was gelu_backward based on the error message in #7084 (comment) (which I don't see any hint of gelu_backward)?

@ysiraichi
Copy link
Collaborator Author

Since it was a non-dynamo bug, it was thanks to XLA_USE_EAGER_DEBUG_MODE=1 that I found the bug.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

4 participants