Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
AI CODE CREATION
GitHub Copilot
Write better code with AI
GitHub Spark
Build and deploy intelligent apps
GitHub Models
Manage and compare prompts
MCP Registry
New
Integrate external tools
DEVELOPER WORKFLOWS
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
APPLICATION SECURITY
GitHub Advanced Security
Find and fix vulnerabilities
Code security
Secure your code as you build
Secret protection
Stop leaks before they start
EXPLORE
Why GitHub
Documentation
Blog
Changelog
Marketplace
View all features
Solutions
BY COMPANY SIZE
Enterprises
Small and medium teams
Startups
Nonprofits
BY USE CASE
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
BY INDUSTRY
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
EXPLORE BY TOPIC
AI
Software Development
DevOps
Security
View all topics
EXPLORE BY TYPE
Customer stories
Events & webinars
Ebooks & reports
Business insights
GitHub Skills
SUPPORT & SERVICES
Documentation
Customer support
Community forum
Trust center
Partners
Open Source
COMMUNITY
GitHub Sponsors
Fund open source developers
PROGRAMS
Security Lab
Maintainer Community
Accelerator
Archive Program
REPOSITORIES
Topics
Trending
Collections
Enterprise
ENTERPRISE SOLUTIONS
Enterprise platform
AI-powered developer platform
AVAILABLE ADD-ONS
GitHub Advanced Security
Enterprise-grade security features
Copilot for Business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
state:open label:DeepSpeed
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
huggingface
/
transformers
Public
Notifications
You must be signed in to change notification settings
Fork
31.4k
Star
154k
Code
Issues
1.1k
Pull requests
1.1k
Actions
Projects
1
Security
Uh oh!
There was an error while loading.
Please reload this page
.
Insights
Additional navigation options
Code
Issues
Pull requests
Actions
Projects
Security
Insights
Welcome v5
#40822 ·
LysandreJik
opened
on Sep 11, 2025
23
Issues
Search Issues
state
:
open
label
:
DeepSpeed
state:open label:DeepSpeed
Search
Labels
Milestones
New issue
Search results
Open
Closed
Tutorial for using DeepSpeed's activation checkpointing instead of PyTorch's
DeepSpeed
Feature request
Request for a new feature
Request for a new feature
Status: Open.
#32409
In huggingface/transformers;
·
huyiwen
opened
on Aug 4, 2024
DeepSpeed sequence parallelism (aka Ulysses) integration with HF transformer
DeepSpeed
Status: Open (in progress).
huggingface/transformers
number 32305
#32305
In huggingface/transformers;
·
samadejacobs
opened
on Jul 29, 2024
Support saving models trained with DeepSpeed in Trainer callbacks
DeepSpeed
Feature request
Request for a new feature
Request for a new feature
trainer
Status: Open.
#31338
In huggingface/transformers;
·
dwyatte
opened
on Jun 9, 2024
modeling_t5 incompatible with multiprocessing
DeepSpeed
Good Second Issue
Issues that are more difficult to do than "Good First" issues - give it a try if you want!
Issues that are more difficult to do than "Good First" issues - give it a try if you want!
Status: Open.
#30280
In huggingface/transformers;
·
rangehow
opened
on Apr 17, 2024
add deepspeed grad ckpt
DeepSpeed
Status: Open (in progress).
huggingface/transformers
number 30233
#30233
In huggingface/transformers;
·
SeunghyunSEO
opened
on Apr 13, 2024
Support H100 training with FP8 in Trainer and Deepspeed
DeepSpeed
Feature request
Request for a new feature
Request for a new feature
trainer
Status: Open.
#25333
In huggingface/transformers;
·
michaelroyzen
opened
on Aug 5, 2023
[Deepspeed] [performance] inefficient load with <code>from_pretrained</code> w/ zero3
DeepSpeed
WIP
Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
Status: Open.
#12273
In huggingface/transformers;
·
stas00
opened
on Jun 20, 2021
[Deepspeed zero3] lazy weights init
DeepSpeed
WIP
Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
Status: Open.
#12272
In huggingface/transformers;
·
stas00
opened
on Jun 20, 2021
[2D Parallelism] Tracking feasibility
DeepSpeed
Model Parallel
Model Parallelilsm Implementations
Model Parallelilsm Implementations
Pipeline Parallel
WIP
Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
Status: Open.
#9931
In huggingface/transformers;
·
stas00
opened
on Feb 1, 2021
[DeepSpeed] Features to integrate / Optimizations to add / Experiments to do
DeepSpeed
Feature request
Request for a new feature
Request for a new feature
Status: Open.
#9606
In huggingface/transformers;
·
stas00
opened
on Jan 14, 2021
You can’t perform that action at this time.