AgentBirdnest
Respectable
I love your number crunching.I don't think so. I mean, new architecture with 67% more raw bandwidth, supposedly even more if you look at the total Infinity Cache bandwidth, and theoretically 160% more compute. And in practice it performs as if a lot of that bandwidth and compute isn't realized in the real world. Having to go over the extra Infinity Fabric to get to L3 cache could be a big part of this.
[snip]
It's also worth looking at relative chip size and performance. AMD has a 300mm^2 GCD plus 220mm^2 of MCDs. Some of that size is due to the Infinity Fabric linking the chiplets together. Nvidia meanwhile has a 379mm^2 die that has a lot of extra features (DLSS and DXR stuff). I'd wager the RTX 4080 actually costs less to manufacture than Navi 31, factoring in everything.
AMD is going to need to prove with RDNA 4 that chiplet GPUs can continue to scale to even higher levels of performance without sacrificing features. They certainly haven't done that with RDNA 3. A monolithic Navi 31 without chiplets probably would have been in the 400mm^2 range and offered even higher performance. That's just my rough estimate, and we can't know for certain, but I'd love to know how much AMD actually saved at the end of the day by doing chiplets.

It's really interesting, and makes you wonder what could have been with a monolithic die, and why use chiplets if it's not cheaper (I'd guess they are expecting it to get cheaper with next gen, or something?)
It'd be interesting to see someone do a deep-diving story about this topic.