Skip to content

Conversation

@lshpku
Copy link
Contributor

@lshpku lshpku commented Jul 29, 2025

PR Category

Communication Library

PR Types

Performance

Description

将 intranode & internode 的底层 kernel 更新至官方commit:deepseek-ai/DeepEP@079c5a4 (7月14日)
该 commit 已包含 TMA 优化 internode 性能

本PR修改内容

intranode.cuinternode.cuconfigs.cuhibgda_device.cuh直接拷贝过来

launch.cuhutils.cuh拷贝过来,但保留 low_latency 仍然依赖的 deprecated 的函数(low_latency 由推理同学维护,不做修改)

runtime.culayout.cu拷贝过来,合并成一个runtime.cu(之前也是这样合并的)

api.cuh中 intranode & internode 的部分拷贝过来

deep_ep.hpp中 Buffer 的成员变量做小幅修改

deep_ep.cpp中 Buffer 的构造函数和 sync 方法,以及涉及 intranode & internode 调用的地方做了修改,正确设置新增的成员变量,适配新的 CUDA 层接口

types.h里增加一个 helper 方法

正确性测试

使用 test_intranode.pytest_internode.py(2、4、8机)进行了单测,均通过

使用DeepseekV3进行了多种PP、EP配置的端到端收敛性测试,均通过

性能变化

新版的优势在于可以用更少的SM达到相同的通信带宽,从而为计算分配更多的SM

例如在DeepseekV3上,deepep sm 20->14, deepgemm sm 112->118,端到端提升 1-2%


Pcard-85711
@lshpku lshpku requested review from ForFishes and sneaxiy as code owners July 29, 2025 06:49
@paddle-bot
Copy link

paddle-bot bot commented Jul 29, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@lshpku lshpku force-pushed the update-deepep-079c5a4 branch 3 times, most recently from 117bd81 to 5f846fc Compare August 4, 2025 11:09
@lshpku lshpku force-pushed the update-deepep-079c5a4 branch from 5f846fc to a3d0d9e Compare August 6, 2025 09:17
Copy link
Contributor

@gongweibao gongweibao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

@XiaoguangHu01 XiaoguangHu01 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lshpku lshpku merged commit 530cd6d into PaddlePaddle:develop Aug 12, 2025
80 of 83 checks passed
maxiaolong001 pushed a commit to maxiaolong001/Paddle that referenced this pull request Aug 12, 2025
lshpku added a commit to lshpku/Paddle that referenced this pull request Oct 28, 2025
lshpku added a commit to lshpku/Paddle that referenced this pull request Oct 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

4 participants