Skip to content

Conversation

adefazio
Copy link
Contributor

This pull request contains a version of Schedule-Free AdamW that makes use of the batch-norm fixes made after the competition deadline, and which follows the published version of Schedule-Free in a few small details for consistency.

I have also included a weighted dual-averaging version of Schedule-Free in the external tuning track, which shows improved convergence over the unweighted version on some workloads, particularly ViT.

This version is intended for the on-going public leaderboard that has been discussed in the workgroup meetings, and is designed for the original competition V100 hardware.

Note that the external-tuning track ResNet50 runs only just barely hit the baseline depending on the random seed, so there may be some non-determinism there.

@priyakasimbeg

@adefazio adefazio requested a review from a team as a code owner November 14, 2024 00:02
Copy link

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

fsschneider added a commit that referenced this pull request Feb 11, 2025
@priyakasimbeg priyakasimbeg changed the base branch from main to rolling_leaderboard_submissions March 17, 2025 21:16
Copy link
Contributor

@priyakasimbeg priyakasimbeg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks Aaron!

@priyakasimbeg priyakasimbeg merged commit 76bf768 into mlcommons:rolling_leaderboard_submissions Mar 17, 2025
1 of 2 checks passed
@github-actions github-actions bot locked and limited conversation to collaborators Mar 17, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

2 participants