bfloat

A bfloat16 has the same range as float32, but less precision.
Precisely, the mantissa is given less bits, and the exponent is given more bits
You use bfloat to shorten training time. Empirically, performance is preserved

1. sources

reddit

Created: 2025-11-02 Sun 18:49