Skip to content

Conversation

@ggggxm
Copy link
Contributor

@ggggxm ggggxm commented Jul 16, 2025

PR Category

Operator Mechanism

PR Types

Bug fixes

Description

variable_length_memory_efficient_attention在默认情况下生成且调用的是对齐的kernel,但实际情况下输入qkv的维度可能不能对齐,为保障正确性,在SM>80时也支持生成非对齐kernel。同时添加value_head_size的对齐检查。

@paddle-bot
Copy link

paddle-bot bot commented Jul 16, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@wanghuancoder wanghuancoder merged commit 7a8b039 into PaddlePaddle:develop Jul 21, 2025
73 of 74 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

3 participants