Discussion about this post

User's avatar
Hao Hoang's avatar

📚 Related Papers:

- Mixed Precision Training. Available at: https://arxiv.org/abs/1710.03740

- A Study of BFLOAT16 for Deep Learning Training. Available at: https://arxiv.org/abs/1905.12322

- ZeRO: Memory Optimizations Toward Training Trillion Parameter Models. Available at: https://arxiv.org/abs/1910.02054

- FP8 Formats for Deep Learning. Available at: https://arxiv.org/abs/2209.05433

No posts

Ready for more?