https://arxiv.org/abs/2209.05433
FP8 Formats for Deep Learning (Paulius Micikevicius, Dusan Stosic, Neil Burgess, Marius Cornea, Pradeep Dubey, Richard Grisenthwaite, Sangwon Ha, Alexander Heinecke, Patrick Judd, John Kamalu, Naveen Mellempudi, Stuart Oberman, Mohammad Shoeybi, Michael Siu, Hao Wu)
tesla dojo가 configurable fp8을 썼던가요? 이 정도 low precision으로 어디까지 가능할지 궁금하네요.