Skip to content

Conversation

@cangtianhuang
Copy link
Contributor

PR Category

Operator Mechanism

PR Types

Bug fixes

Description

错误描述:

#73194 显式抛出 bigtensor 错误后,conv_transpose 测试时仍旧报 CUDA error 9:
image

问题定位:

检查发现 CUDNN_ENFORCE_TENSOR_SIZE_SUPPORTED check 的位置过于靠后,CUDNN 描述符 ConvArgs 在构造时就发生错误;因此将前反向的 check 均提前至 GPUDNNDataLayout 之前

修改后,测试通过(已由 paddle_error_dismiss 放行):
image

其他:

  1. conv 未发现此问题,但是 bigtensor 测试配置存在 [numpy error]、错误形状等情况,需要在 APITest 中修复
  2. 等待配置修复完毕后,再进行 convconv_transpose 的全量回测

Pcard-85711

@paddle-bot
Copy link

paddle-bot bot commented Jul 17, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@wanghuancoder wanghuancoder merged commit b8b4c24 into PaddlePaddle:develop Jul 18, 2025
55 checks passed
co63oc pushed a commit to co63oc/Paddle that referenced this pull request Jul 18, 2025
@cangtianhuang cangtianhuang deleted the fix-conv branch July 26, 2025 14:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants