- Notifications
You must be signed in to change notification settings - Fork 31.4k
Open
Labels
Description
Feature request
Support H100 training with FP8 in Trainer and Deepspeed
Motivation
FP8 should be much faster than FP16 on supported Hopper hardware. Particularly with Deepspeed integration @stas00
Your contribution
Happy to help in any way that I can.
float-trip, noobmaster29, AntreasAntonio and umarbutlerstas00