Skip to content

Conversation

@hxzd5568
Copy link
Contributor

PR Category

Operator Mechanism

PR Types

Bug fixes

Description

Pcard-67164

  1. cublas 的SCAL, VCOPY 函数调用了封装的 CUBLASAPI cublasStatus_t CUBLASWINAPI cublasScalEx(...) 等, 而这些函数中表示下标的类型为int, 因此对大tensor采用了paddle自定义的scal函数
  2. 修改后和torch 一致,torch也是使用了自定义函数来处理vcopy , scale
    (torch: pytorch/tools/autograd/derivatives.yaml:258)
@paddle-bot
Copy link

paddle-bot bot commented Jul 29, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@hxzd5568
Copy link
Contributor Author

/re-run all-failed

Copy link
Contributor

@wanghuancoder wanghuancoder left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lshpku lshpku merged commit a6f0874 into PaddlePaddle:develop Jul 30, 2025
69 of 71 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

3 participants