[RFC]: In-place weights loading

@youkaichao

Motivation.

We've implemented the initial version of inplace weights loading in #18745 and #19884.

Currently load_model will decide itself whether to load model from scratch or reload weighs inplace.
Should we give users more control? I've been thinking about making the inplace weights loading choice explicit.

In addition, a standalone weights reloading API makes it possible to support model swap in the original load_model method.

Proposed Change.

collective_rpc("load_model")
- this will always recreate model and load weights from sractch
collective_rpc("update_config", overrides) [Core] Add update_config RPC method #20095
- Update any config in model runner.
- For example, collective_rpc("update_config", overrides={"load_config": {"load_format": "auto"}})
collective_rpc("reload_weights") [Core] Add reload_weights RPC method #20096
- this will explicitly reload weights inplace, without re-initialize the model
Might need to move process_weights_after_loading from load_model to load_weights
Quantized inplace weights loading support: Support RL online quantization with torchao #23014

Alternative approach
Instead of providing a standalone update_config API, we can embed the overrides in load_model and reload_weights APIs, in which we mutate the configs before performing model/weights loading.

Feedback Period.

No response

CC List.

cc @youkaichao @houseroad @yeqcharlotte @foreverlms

Any Other Things.

No response

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[RFC]: In-place weights loading #19886

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[RFC]: In-place weights loading #19886

Description

Motivation.

Proposed Change.

Feedback Period.

CC List.

Any Other Things.

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions