Skip to content

Pull requests: databrickslabs/dolly

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update README.md with instructions for DBR 13
#200 by edurdevic was merged Jun 24, 2023 Loading…
Set bf16 flags correctly for a10/a100
#192 by srowen was merged Jun 7, 2023 Loading…
Drop duplicate torch from requirements_dev.txt
#189 by holdenk was merged Jun 5, 2023 Loading…
Allow Dolly's trainer main to be called,
#186 by holdenk was closed May 31, 2023 Loading…
A10 & v100 config
#182 by tnixon was merged Jun 2, 2023 Loading…
working V100 GPU config
#181 by tnixon was closed May 26, 2023 Loading…
[Fix] Move attention mask to the model device type
#180 by BaiqingL was merged May 26, 2023 Loading…
Add Dolly as the Input Model
#164 by xuanyuanking was merged May 16, 2023 Loading…
Support for Training the Model Using Local Files
#163 by xuanyuanking was closed May 15, 2023 Loading…
Fix documentation for max_new_tokens.
#162 by SamiKalliomaki was merged May 15, 2023 Loading…
Eight bit 12b model
#159 by densongenesys was closed May 16, 2023 Loading…
convert to 8 bit mode
#157 by densongenesys was closed May 11, 2023 Loading…
Note that dataset should be used from Hugging Face now
#144 by srowen was merged May 3, 2023 Loading…
Update to Trainer.train to Allow Override Dataset
#142 by rmosleydb was merged May 3, 2023 Loading…
import missing datetime
#135 by opyate was closed Apr 26, 2023 Loading…
Drop back to deepspeed 0.8.3 because of issues with 0.9.x bug Something isn't working
#130 by srowen was merged Apr 25, 2023 Loading…
Reference HF dataset by default, now that it's live enhancement New feature or request
#123 by srowen was merged Apr 21, 2023 Loading…
Update reqs to match DBR 13; add torch enhancement New feature or request
#122 by srowen was merged Apr 21, 2023 Loading…
Allow Alternate files to be loaded
#120 by rmosleydb was closed Apr 22, 2023 Loading…
Added datasets in 5 languages
#115 by Lednik7 was closed Apr 21, 2023 Loading…
File edits
#114 by carolyn-gronlund was closed Apr 20, 2023 Loading…
Fix reference to pythia-2.8b documentation Improvements or additions to documentation
#113 by srowen was merged Apr 20, 2023 Loading…
Improve batch size guidance for other instance training documentation Improvements or additions to documentation
#106 by srowen was merged Apr 19, 2023 Loading…
ProTip! Add no:assignee to see everything that’s not assigned.