Skip to content

Conversation

@TR666
Copy link
Contributor

@TR666 TR666 commented Apr 3, 2024

PR Category

Performance Optimization

PR Types

Performance

Description

  1. add cross_attention_xpu_fuse_pass
  2. add cross_attention_xpu_kernel
  3. speedup detect pattern of decoder_attention_xpu_fuse_pass
@paddle-bot
Copy link

paddle-bot bot commented Apr 3, 2024

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@paddle-bot
Copy link

paddle-bot bot commented Apr 3, 2024

❌ The PR is not created using PR's template. You can refer to this Demo.
Please use PR's template, it helps save our maintainers' time so that more developers get helped.

@@ -0,0 +1,666 @@
// Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

2024

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@TR666 TR666 force-pushed the shuziren-ca branch 3 times, most recently from 371e691 to 2277525 Compare April 7, 2024 11:38
Copy link
Contributor

@zhupengyang zhupengyang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@zhupengyang zhupengyang merged commit 22fb3b1 into PaddlePaddle:develop Apr 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

4 participants