Skip to content

NCCL compilation and linking version not match #8195

@tonyyang-svail

Description

@tonyyang-svail

Currently, paddle is compiled with nccl1.3 by pulling its source code from https://github.com/NVIDIA/nccl. During the runtime, it is linked to nccl2. This would result in failure such as unhandled cuda error.

The CI doesn't fail because some CI machine only have one GPU so the nccl test is skipped.

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions