Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 423 Bytes

220912 FP8 Formats for Deep Learning.md

File metadata and controls

5 lines (3 loc) · 423 Bytes

https://arxiv.org/abs/2209.05433

FP8 Formats for Deep Learning (Paulius Micikevicius, Dusan Stosic, Neil Burgess, Marius Cornea, Pradeep Dubey, Richard Grisenthwaite, Sangwon Ha, Alexander Heinecke, Patrick Judd, John Kamalu, Naveen Mellempudi, Stuart Oberman, Mohammad Shoeybi, Michael Siu, Hao Wu)

tesla dojo가 configurable fp8을 썼던가요? 이 정도 low precision으로 어디까지 가능할지 궁금하네요.