Skip to content

Conversation

znado
Copy link
Contributor

@znado znado commented Mar 3, 2021

A rough sketch of an implementation of our current API as outlined in the rules draft, using jax/numpy.

A rough sketch of an implementation of our current API as outlined in the rules draft, using jax/numpy.
@github-actions
Copy link

github-actions bot commented Mar 3, 2021

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

@znado znado changed the title Create draft_api.py Minimal MNIST workload Apr 6, 2021
Copy link
Contributor

@mikerabbat mikerabbat left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a great start! I think it covers almost all of the main points. I've added minor comments inline and I have three high-level comments.

The main thing that seems to be missing is a way for submissions to specify the hyperparameter search space. Would it make sense for submissions to provide one more function that accepts a source of randomness and outputs a sample from the tuning search space?

The way PRNG seeds are handled here is pretty specific to Jax. Different frameworks will probably need to do this differently. Is there any utility to making the harness code as framework-agnostic as possible? Is there any concern that framework-specific differences in how PNRGs are handled can become a non-trivial source of difference between implementations?

We've been discussing the benchmark in terms of a system that has 8 GPUs. Will submissions have control over how those GPUs are used? For some workloads (e.g., a ResNet50/ImageNet-like one) it may be useful to do data-parallel training using all 8 GPUs. For others (smaller ones like CIFAR-10), that may be overkill and a submission may want to try something different. The current API doesn't seem to expose this flexibility.

@znado znado closed this Apr 20, 2021
@znado znado reopened this Apr 20, 2021
@github-actions github-actions bot locked and limited conversation to collaborators Apr 20, 2021
@georgedahl georgedahl merged commit effe55a into mlcommons:main Apr 21, 2021
@znado znado deleted the patch-1 branch June 2, 2021 19:38
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

3 participants