Skip to content
Navigation Menu
Toggle navigation
Sign in
Appearance settings
Platform
GitHub Copilot
Write better code with AI
GitHub Spark
New
Build and deploy intelligent apps
GitHub Models
New
Manage and compare prompts
GitHub Advanced Security
Find and fix vulnerabilities
Actions
Automate any workflow
Codespaces
Instant dev environments
Issues
Plan and track work
Code Review
Manage code changes
Discussions
Collaborate outside of code
Code Search
Find more, search less
Explore
Why GitHub
Documentation
GitHub Skills
Blog
Integrations
GitHub Marketplace
MCP Registry
View all features
Solutions
By company size
Enterprises
Small and medium teams
Startups
Nonprofits
By use case
App Modernization
DevSecOps
DevOps
CI/CD
View all use cases
By industry
Healthcare
Financial services
Manufacturing
Government
View all industries
View all solutions
Resources
Topics
AI
DevOps
Security
Software Development
View all
Explore
Learning Pathways
Events & Webinars
Ebooks & Whitepapers
Customer Stories
Partners
Executive Insights
Open Source
GitHub Sponsors
Fund open source developers
The ReadME Project
GitHub community articles
Repositories
Topics
Trending
Collections
Enterprise
Enterprise platform
AI-powered developer platform
Available add-ons
GitHub Advanced Security
Enterprise-grade security features
Copilot for business
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Appearance settings
Resetting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
pytorch-tpu
/
transformers
Public
forked from
huggingface/transformers
Notifications
You must be signed in to change notification settings
Fork
13
Star
17
Code
Issues
2
Pull requests
4
Actions
Projects
0
Security
Uh oh!
There was an error while loading.
Please reload this page
.
Insights
Additional navigation options
Code
Issues
Pull requests
Actions
Projects
Security
Insights
Commits
Branch selector
flash_attention_minibatch_v6e
User selector
All users
Datepicker
All time
Commit History
Commits on Feb 14, 2025
Update step time calculation
bhavya01
committed
b185651
Copy full SHA for b185651
Commits on Jan 15, 2025
Merge pull request #70 from ManfeiBai/patch-2
Show description for 18e27d5
tengyifei
authored
18e27d5
Copy full SHA for 18e27d5
not only save on rank-0
ManfeiBai
authored
5638d19
Copy full SHA for 5638d19
Commits on Oct 16, 2024
enable minibatch
tengyifei
committed
97d7d51
Copy full SHA for 97d7d51
Commits on Sep 16, 2024
Remove main_process_first calls
bhavya01
committed
3dd4ec1
Copy full SHA for 3dd4ec1
Commits on Aug 26, 2024
Fix user guide
alanwaketan
committed
b736ca9
Copy full SHA for b736ca9
Add user guide
alanwaketan
committed
cae5c3c
Copy full SHA for cae5c3c
Commits on Jul 31, 2024
Fix 2D
alanwaketan
committed
03f8ba8
Copy full SHA for 03f8ba8
enable 2d sharding
alanwaketan
committed
80b5a19
Copy full SHA for 80b5a19
add profiler hints
alanwaketan
committed
e4fbe2f
Copy full SHA for e4fbe2f
Commits on Apr 23, 2024
Fix profiler
alanwaketan
committed
7f88656
Copy full SHA for 7f88656
Enable attn_weight scalar
alanwaketan
committed
850ac3c
Copy full SHA for 850ac3c
implement save_strategy no
alanwaketan
committed
05cef15
Copy full SHA for 05cef15
Commits on Apr 20, 2024
Add toggle for fa
alanwaketan
committed
38b886f
Copy full SHA for 38b886f
Use BF16
alanwaketan
committed
63e8be1
Copy full SHA for 63e8be1
Commits on Apr 19, 2024
Enable causal mask
alanwaketan
committed
2f95274
Copy full SHA for 2f95274
enable flash attention
alanwaketan
committed
b6fa01b
Copy full SHA for b6fa01b
Adds profiler
alanwaketan
committed
8d60bf1
Copy full SHA for 8d60bf1
Fix use_cache
alanwaketan
committed
af53003
Copy full SHA for af53003
Commits on Apr 17, 2024
Fix quality Olmo + SDPA (#30302)
Show description for ec92f98
fxmarty
authored
ec92f98
Copy full SHA for ec92f98
Re-enable SDPA's FA2 path (#30070)
Show description for 05bdef1
fxmarty
and
ArthurZucker
authored
05bdef1
Copy full SHA for 05bdef1
Add OLMo model family (#29890)
Show description for e4ea19b
2015aroras
authored
e4ea19b
Copy full SHA for e4ea19b
Upgrading to tokenizers 0.19.0 (#30289)
Show description for 8e5f76f
Narsil
authored
8e5f76f
Copy full SHA for 8e5f76f
Add strategy to store results in evaluation loop (#30267)
Show description for c15aad0
qubvel
authored
c15aad0
Copy full SHA for c15aad0
Add token type ids to CodeGenTokenizer (#29265)
Show description for 8d6b509
st81
authored
8d6b509
Copy full SHA for 8d6b509
FIX: Fix push important models CI (#30291)
Show description for 812a5de
younesbelkada
authored
812a5de
Copy full SHA for 812a5de
Fix `Fatal Python error: Bus error` in `ZeroShotAudioClassificationPipelineTests` (#30283)
Show description for eb75516
ydshieh
authored
eb75516
Copy full SHA for eb75516
Fix test `ExamplesTests::test_run_translation` (#30281)
Show description for 05dab4e
ydshieh
authored
05dab4e
Copy full SHA for 05dab4e
Enable fx tracing for Mistral (#30209)
Show description for 304c6a1
zucchini-nlp
authored
304c6a1
Copy full SHA for 304c6a1
Configuring Translation Pipelines documents update #27753 (#29986)
Show description for 98717cb
UtkarshaGupte
authored
98717cb
Copy full SHA for 98717cb
FIX / AWQ: Fix failing exllama test (#30288)
Show description for 080b700
younesbelkada
authored
080b700
Copy full SHA for 080b700
Fix SpeechT5 forward docstrings (#30287)
ylacombe
authored
4114524
Copy full SHA for 4114524
Fix SDPA sliding window compatibility (#30127)
Show description for 40eb6d6
fxmarty
and
ehuaa
authored
40eb6d6
Copy full SHA for 40eb6d6
Commits on Apr 16, 2024
Fix test fetcher (doctest) + `Idefics2`'s doc example (#30274)
Show description for 5fabebd
ydshieh
authored
5fabebd
Copy full SHA for 5fabebd
fix: Fixed a `raise` statement (#30275)
Show description for 37b5946
Sai-Suraj-27
authored
37b5946
Copy full SHA for 37b5946
Pagination
Previous
Next
You can’t perform that action at this time.