Skip to content

Conversation

@kuizhiqing
Copy link
Member

PR types

New features

PR changes

Others

Describe

Heterogenous mix training represents the model training with heterogenous hardwares. Dygraph mode is only supported now. GPU/NPU/XPU are targeting devices for this prototype work.

The basic idea is very similar as the use of hierarchical communication topology. The low layer reduce the data within each node, while the upper layer reduce across all global nodes.

image

@paddle-bot-old
Copy link

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Copy link

@sandyhouse sandyhouse left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

看着都是动态图的,不能支持静态图吧?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Broadcast?Broadcast是一个单词。另外,这个接口没有实现,为什么还要添加这个接口呢?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. 同上。
  2. gloo接口为什么需要传入ring_id?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

BroadCast -> Broadcast?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

BroadCast -> Broadcast?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

同上

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

同上

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Abort?

Copy link

@sandyhouse sandyhouse left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@sandyhouse sandyhouse requested a review from fuyinno4 December 1, 2021 03:30
Copy link
Contributor

@zhiqiu zhiqiu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM for const_cast

@sandyhouse sandyhouse merged commit 1bdb857 into PaddlePaddle:develop Dec 6, 2021
Zjq9409 pushed a commit to Zjq9409/Paddle that referenced this pull request Dec 10, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

7 participants