Skip to content

Conversation

@lshpku
Copy link
Contributor

@lshpku lshpku commented Apr 11, 2025

PR Category

CINN

PR Types

Bug fixes

Description

显式设置grid reduce的min_blocks_per_sm大小,避免编译器分配寄存器过多导致无法launch

注:nvrtc和nvcc的表现还不太一样,nvcc不设置也行,但是nvrtc报错了,所以还是设置一下

TODO:避免直接写1024这样的常数,但是这个要改的话需要从一开始的tile config就开始改,修改内容较多,所以之后再系统改

Pcard-85711

@paddle-bot
Copy link

paddle-bot bot commented Apr 11, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@lshpku lshpku merged commit 207e34a into PaddlePaddle:develop Apr 11, 2025
33 of 34 checks passed
YqGe585 pushed a commit to YqGe585/Paddle that referenced this pull request May 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants