Continuing cleanup. #76

znado · 2022-05-08T21:06:18Z

refactoring dataset iterators to return Dict[str, Any] instead of a tuple/triplet of inputs/labels/masks
cleaning up some of the WMT code
standardizing param shape/type utilities
small cleanups/fixes to get all reference submissions to work with random inputs for 1 train and 2 eval steps (takes ~272s total to test)

github-actions · 2022-05-08T21:06:39Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

fsschneider

Looks really awesome!

I added a few comments and nits. I think, after merging #66 and addressing these comments, this is ready to merge.

.github/workflows/linting.yml

RULES.md

fsschneider · 2022-05-10T14:24:33Z

algorithmic_efficiency/param_utils.py

+ # flax.linen.Embed names the embedding parameter "embedding"
+ # https://github.com/google/flax/blob/main/flax/linen/linear.py#L604.
+ elif name == 'embedding':
+ param_types_dict[name] = spec.ParameterType.EMBEDDING


Is it problematic that JAX gets more information than PyTorch?

yes. is the way I added the right way to check the name of an nn.Embedding layer in Pytorch? we should add a test to make sure (for jax and pytorch) these are properly set (at least, for some selection of params). I added a TODO to my list of things to follow up with.

fsschneider · 2022-05-10T14:34:50Z

algorithmic_efficiency/workloads/imagenet/imagenet_pytorch/workload.py


- if is_train:
- dataloader = cycle(dataloader)
+ dataloader = cycle(dataloader)


Just curious why you switched to also cycle the evaluation datasets?

I added it so that we can iterate through them across multiple evals. I'm not sure if this is the best way to do it.

algorithmic_efficiency/workloads/librispeech/librispeech_pytorch/workload.py

algorithmic_efficiency/workloads/wmt/wmt_jax/models.py

tests/reference_submission_tests.py

znado requested a review from fsschneider May 8, 2022 21:06

fsschneider reviewed May 10, 2022

View reviewed changes

znado added 17 commits May 11, 2022 14:49

starting work on returning dict from ds iter

336bce7

restoring changes to wmt workload

bfd24ec

first untested draft of converting to dict datasets

ecfd16b

continuing implementing fns and cleaning up

7d928aa

got librispeech pytorch running in ref sub test

d8c8950

supporting fake graph batches

1243d75

ogbg working

dcba475

spaces around ** in random_utils

38082bb

standardizing the jax param types helper fn

0b6afa7

wmt jax working in submission runner test

89e28e1

all reference submissions working

63a3810

fixing lint except yapf

56dbd51

fixing yapf lint

f8e9f2e

making pytorch imagenet into dicts

97a34d4

rewriting DO NOT SUBMIT as TODO

17775dc

removing ref sub test from CI (for now)

d1dc857

all reference submission tests passing

daf6d70

znado force-pushed the batch-dicts branch from e0ef482 to daf6d70 Compare May 17, 2022 04:48

znado added 4 commits May 17, 2022 00:54

fixing lint

ef5dab5

fixing imagenet workload test

bd107c9

undoing wmt debugging config

fdf1bad

frank comments

eaf384f

fsschneider approved these changes May 17, 2022

View reviewed changes

znado merged commit cf14d26 into mlcommons:main May 17, 2022

github-actions bot locked and limited conversation to collaborators May 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Continuing cleanup. #76

Continuing cleanup. #76

Uh oh!

znado commented May 8, 2022

github-actions bot commented May 8, 2022 •

edited

Loading

fsschneider left a comment

Uh oh!

Uh oh!

fsschneider May 10, 2022

znado May 17, 2022

fsschneider May 10, 2022

znado May 17, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Labels

2 participants

Continuing cleanup. #76

Continuing cleanup. #76

Uh oh!

Conversation

znado commented May 8, 2022

github-actions bot commented May 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

fsschneider left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fsschneider May 10, 2022

Choose a reason for hiding this comment

znado May 17, 2022

Choose a reason for hiding this comment

fsschneider May 10, 2022

Choose a reason for hiding this comment

znado May 17, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Labels

2 participants

github-actions bot commented May 8, 2022 •

edited

Loading