expert distributions #3709

CUHKSZzxy · 2025-07-04T04:01:07Z

Refer to the dlBLAS. Not compatible with CUDA graph, used with --eager-mode

LMDEPLOY_DUMP_EXPERT_DISTRIBUTION=1 \ LMDEPLOY_EXPERT_DUMP_DIR="your_expert_distribution_dir" \ LMDEPLOY_DP_MASTER_ADDR=0.0.0.0 \ LMDEPLOY_DP_MASTER_PORT=29555 \ lmdeploy serve api_server \ Qwen/Qwen3-235B-A22B-FP8 \ --backend pytorch \ --tp 1 \ --dp 4 \ --ep 4 \ --proxy-url http://0.0.0.0:8001 \ --nnodes 1 \ --node-rank 0 \ --eager-mode \ --log-level INFO

CUHKSZzxy added 2 commits July 4, 2025 11:54

expert distributions

a84d356

add expert distribution recorder for deepseek

184ef28

CUHKSZzxy marked this pull request as ready for review July 4, 2025 04:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

expert distributions #3709

expert distributions #3709

Uh oh!

CUHKSZzxy commented Jul 4, 2025 •

edited

Loading

Labels

1 participant

expert distributions #3709

Are you sure you want to change the base?

expert distributions #3709

Uh oh!

Conversation

CUHKSZzxy commented Jul 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Labels

1 participant

CUHKSZzxy commented Jul 4, 2025 •

edited

Loading