[BIG tensor]Fix int32 overflow for l1_loss #74211

Difers · 2025-07-24T03:35:51Z

PR Category

Operator Mechanism

PR Types

Bug fixes

Description

修复paddle.nn.functional.l1_loss底层Reduce Kernel int32溢出问题

Pcard-73145

测例

import paddle input = paddle.rand([1, 2147483646], dtype="float16") //接近但小于int32边界 label = paddle.rand([1, 2147483646], dtype="float16") //接近但小于int32边界 l1_loss = paddle.nn.functional.l1_loss(input, label) print(l1_loss)

触发原因
input.numel() 小于int32 边界 -> 走ReduceAnyKernel indextype int32分支 ->
for (; input_idx + block_size < bound; input_idx += REDUCE_VEC_SIZE * stride)
input_idx在循环最后一次可能int32溢出

paddle-bot · 2025-07-24T03:35:55Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

fix reduce int32 overflow

80a07c3

Difers changed the title ~~[PHI] Fix int32 overflow for l1_loss~~ [BIG tensor]Fix int32 overflow for l1_loss Jul 24, 2025

Difers mentioned this pull request Jul 24, 2025

[Big tensor] Refine some configurations that may cause OOM for l1_loss PFCCLab/PaddleAPITest#406

Merged

lshpku approved these changes Jul 29, 2025

View reviewed changes

lshpku merged commit 0a94741 into PaddlePaddle:develop Jul 29, 2025
55 of 56 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BIG tensor]Fix int32 overflow for l1_loss #74211

[BIG tensor]Fix int32 overflow for l1_loss #74211

Difers commented Jul 24, 2025 •

edited

Loading

paddle-bot bot commented Jul 24, 2025

Uh oh!

Labels

2 participants

[BIG tensor]Fix int32 overflow for l1_loss #74211

[BIG tensor]Fix int32 overflow for l1_loss #74211

Conversation

Difers commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

修复paddle.nn.functional.l1_loss底层Reduce Kernel int32溢出问题

paddle-bot bot commented Jul 24, 2025

Uh oh!

Labels

2 participants

Difers commented Jul 24, 2025 •

edited

Loading