Skip to content

Pull requests: unslothai/notebooks

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

2048 RL: hard-timeout strategy rollout
#148 opened Dec 20, 2025 by cgpadwick Loading…
Change the vllm saving cells
#147 opened Dec 19, 2025 by Etherll Loading…
Add Gemma phone deployment notebook
#146 opened Dec 18, 2025 by glee2429 Loading…
Add Modal notebooks
#116 opened Oct 7, 2025 by aniketmaurya Loading…
Add support to AMD Dev Cloud
#111 opened Sep 24, 2025 by vivienfanghuagood Loading…
updated qwen2_5
#106 opened Sep 16, 2025 by pluesclues Loading…
fix: regex match in format rewards
#96 opened Sep 1, 2025 by Erland366 Loading…
add glm4 9b model for reasoning conversational
#94 opened Aug 28, 2025 by MengAiDev Loading…
Qwen2.5 VL GRPO notebook
#61 opened Jun 23, 2025 by GAD-cell Loading…
add tool calling notebooks
#10 opened Feb 26, 2025 by oliveirabruno01 Draft
ProTip! Follow long discussions with comments:>50.