[0-size Tensor Job2 No.29、28] Add 0-size Tensor support for paddle.incubate.nn.functional.fused_multi_head_attention [fluid_ops] #74006
+261 −15
Add this suggestion to a batch that can be applied as a single commit. This suggestion is invalid because no changes were made to the code. Suggestions cannot be applied while the pull request is closed. Suggestions cannot be applied while viewing a subset of changes. Only one suggestion per line can be applied in a batch. Add this suggestion to a batch that can be applied as a single commit. Applying suggestions on deleted lines is not supported. You must change the existing code in this line in order to create a valid suggestion. Outdated suggestions cannot be applied. This suggestion has been applied or marked resolved. Suggestions cannot be applied from pending reviews. Suggestions cannot be applied on multi-line comments. Suggestions cannot be applied while the pull request is queued to merge. Suggestion cannot be applied right now. Please check back later.
PR Category
Operator Mechanism
PR Types
Bug fixes
Description
29 paddle.incubate.nn.functional.fused_multi_head_attention
修改前向和反向,反向填充0,PaddleAPITest中为自定义规则
https://github.com/PFCCLab/PaddleAPITest/blob/d782af6c860fdf2f20a346f56be2f4baad2433c7/tester/paddle_to_torch/rules.py#L1995
LayerNorm增加0-size判断
增加单测

PaddleAPITest测试通过
28 paddle.incubate.nn.functional.fused_linear_activation
原有代码已有修改,只是 x_grad 没有分配内存,修改增加x_grad分配内存
增加单测
PaddleAPITest 测试通过
