Skip to content

Conversation

@mjamroz
Copy link
Contributor

@mjamroz mjamroz commented Oct 23, 2023

Sometimes im getting Missing key(s) in state_dict: "stem.conv1.c.weight", [...] Unexpected key(s) in state_dict: "_orig_mod.stem.conv1.c.weight", [...] while trying to resume training from checkpoint of compiled (--torchcompile=inductor) model.
This PR solves the issue by stripping _orig_mod. prefix before loading weights.

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

@rwightman
Copy link
Collaborator

@mjamroz does this cover the case of a DDP wrapped torchcompile model? I don't know if i've actually checked that ... is it _orig_mod.module. or just _orig_mod?

@rwightman rwightman closed this Jun 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

3 participants