You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an
alternative browser.
-
-
The only practical deployment of FP8 is Llama 405B quantised to FP8 such that it can be fit into the 80GB x 8 = 960GB vRAM of one H100...
-
Loading…
-
Loading…
-
Loading…
-
Loading…