【pir&npu】fix fetch op with memcpy_d2h use Npu indentity kernel. #72149

xiaoguoguo626807 · 2025-04-09T08:40:32Z

PR Category

Execute Infrastructure

PR Types

Bug fixes

Description

pcard-67164

x_data = paddle.static.data(shape=self.shape, name="data", dtype="float32")
output = paddle.incubate._npu_identity(x=x_data, format=self.format)

上述网络在PIR 下输出有误，原因是npu_identity b=不只是内存拷贝，还会根据format 修改内存布局，因此从npu 拷贝到cpu 时也需要用专用的kernel.
pir 的memcpy_d2h kernel 选择逻辑是按照src_place .type() 选backend, 所有的custom place 都选择custom backend ，而npu 注册了自己的kernel没被选到。修改选kernel的逻辑，如果是customplace, 按照value.place 构造fake tensor，进行kernr key set准备，按照优先级选到npu kernel。

… develop

…e/Paddle into develop

paddle-bot · 2025-04-09T08:40:37Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

… npu_indentity

changeyoung98

LGTM

…lePaddle#72149) * modify place error * modify place error * modify RemoveRedundantMemcpyAfterShadowFeed * change input place by next op need * recover * add npu memcpy_d2h choose * add npu memcpy_d2h choose

xiaoguoguo626807 added 9 commits March 31, 2025 14:47

modify place error

3d48607

modify place error

1c07752

modify RemoveRedundantMemcpyAfterShadowFeed

f40b2e1

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

07d1c4d

… develop

change input place by next op need

80836ff

recover

2385cea

Merge commit 'refs/pull/72094/head' of https://github.com/PaddlePaddl…

6feddd5

…e/Paddle into develop

add npu memcpy_d2h choose

c615de8

add npu memcpy_d2h choose

e5d059d

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

64eae78

… npu_indentity

changeyoung98 approved these changes Apr 10, 2025

View reviewed changes

xiaoguoguo626807 merged commit 7e2c9ed into PaddlePaddle:develop Apr 10, 2025
33 of 34 checks passed

xiaoguoguo626807 deleted the npu_indentity branch April 10, 2025 07:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

【pir&npu】fix fetch op with memcpy_d2h use Npu indentity kernel. #72149

【pir&npu】fix fetch op with memcpy_d2h use Npu indentity kernel. #72149

Uh oh!

xiaoguoguo626807 commented Apr 9, 2025

paddle-bot bot commented Apr 9, 2025

changeyoung98 left a comment

Uh oh!

Labels

2 participants

【pir&npu】fix fetch op with memcpy_d2h use Npu indentity kernel. #72149

【pir&npu】fix fetch op with memcpy_d2h use Npu indentity kernel. #72149

Uh oh!

Conversation

xiaoguoguo626807 commented Apr 9, 2025

PR Category

PR Types

Description

paddle-bot bot commented Apr 9, 2025

changeyoung98 left a comment

Choose a reason for hiding this comment

Uh oh!

Labels

2 participants