News AMD announces MI325X AI accelerator, reveals MI350 and MI400 plans at Computex

Admin · Jun 3, 2024

AMD reveals CDNA 3-based Instinct MI325X, CDNA 4-powered Instinct MI350, and Instinct MI400 based on CDNA 'Next'.

AMD announces MI325X AI accelerator, reveals MI350 and MI400 plans at Computex : Read more

Deleted member 2731765 · Jun 3, 2024

It's good to see AMD offering a refresh or a completely new AI accelerator each respective year.

The MI325X AI accelerator appears more like a interim solution, since it is a beefed-up refresh of the current MI300X, composed of eight compute, four I/O, and eight memory chiplets stitched together.

So AMD is now confident that its MI325X system could support 1 trillion parameter model ? But they are still focusing on FP16, which requires twice as much memory per parameter as FP8.

MI300X does have hardware support for FP8, but AMD has generally focused on half-precision performance in its benchmarks. At least for inferencing/vLLM, the MI300X was stuck with FP16. And vLLM lacks proper support for FP8 data types.

This isn't any apples-to-apples comparison though. But while it's great to have 288GB of capacity, I'm just worried the extra memory doesn't get overshadowed in a model that would run at FP8 vs Nvidia's H200.

Because that would still require extra memory on the MI325X, twice to be precise. But it appears AMD might have overcome this limitation with this new accelerator .

I hope so !

TechyIT223 · Jun 3, 2024

Instead of dubbing the future architecture as "CDNA Next" AMD should refer it as CDNA 4++ , or just CDNA 5 for sake of clarity.

Search

News AMD announces MI325X AI accelerator, reveals MI350 and MI400 plans at Computex

Admin

Administrator

Deleted member 2731765

Guest

TechyIT223

Prominent

TRENDING THREADS

Latest posts

Moderators online

Share this page