Question on loading llama 405B FP8 using HF transformer API #76

Neo9061 · 2024-10-16T20:17:33Z

Based on this notebook: https://github.com/huggingface/huggingface-llama-recipes/blob/main/local_inference/fp8-405B.ipynb

since we are loading FP8, will that matter if we specify data_type to be torch.bfloat16?

CC @ianporada who made recent edits in this notebook

The text was updated successfully, but these errors were encountered:

ianporada · 2024-10-18T01:55:17Z

I believe it doesn't matter. Even if you didn't specify torch_dtype it would default to the config.json value which is also "torch_dtype": "bfloat16".

Keep in mind not all weights are FP8, some weights of the quantized model are still BF16.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on loading llama 405B FP8 using HF transformer API #76

Question on loading llama 405B FP8 using HF transformer API #76

Neo9061 commented Oct 16, 2024 •

edited

Loading

ianporada commented Oct 18, 2024

Question on loading llama 405B FP8 using HF transformer API #76

Question on loading llama 405B FP8 using HF transformer API #76

Comments

Neo9061 commented Oct 16, 2024 • edited Loading

ianporada commented Oct 18, 2024

Neo9061 commented Oct 16, 2024 •

edited

Loading