This repository was archived by the owner on Jan 21, 2025. It is now read-only.
- Notifications
You must be signed in to change notification settings - Fork 257
Pull requests: tensorflow/mesh
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Adding a new Gradient Estimator for Routing using REINFORCE with a leave-one-out baseline.
#374 opened Mar 16, 2022 by copybara-service bot Loading…
Minor comment fix to refer to the correct argument name. cla: yes
#367 opened Oct 27, 2021 by copybara-service bot Loading…
Fix some example code in readme for einsum operation cla: yes
#365 opened Oct 10, 2021 by baragona Loading…
MODE models with hetereogeneous expert width cla: no
#342 opened Jul 28, 2021 by copybara-service bot Loading…
Option to use mtf.Print to log which tokens are sent to which experts when run on CPU. cla: yes
#329 opened Jun 17, 2021 by copybara-service bot Loading…
Add
cast
preprocessor and add tasks for inference prompts for deduplication project. cla: yes #320 opened May 12, 2021 by copybara-service bot Loading…
Allow for disabling the automatic save on shutdown. cla: yes
#293 opened Feb 25, 2021 by copybara-service bot Loading…
Add loss functions for multiple-target objectives for distillation. cla: no
#291 opened Feb 9, 2021 by copybara-service bot Loading…
Use multiple target objectives for distillation. Also see cl/356382304 cla: no
#290 opened Feb 9, 2021 by copybara-service bot Loading…
Decode Unicode strings in inference mode. cla: no
#281 opened Jan 28, 2021 by copybara-service bot Loading…
Remove unused vocab argument from dataset function calls. cla: yes
#217 opened Oct 22, 2020 by copybara-service bot Loading…
This shouldn't have public changes once the diffbase is submitted. cla: yes
#191 opened Sep 18, 2020 by copybara-service bot Loading…
Add scores for generated text in inference mode cla: yes
#164 opened Aug 20, 2020 by allen-q Loading…
Previous Next
ProTip! Follow long discussions with comments:>50.