If launch dates aren't important then you can compare the 7700 to skylake, right?
No, because it was technically possible for AMD to have launched the 7700 when they launched the 7700X - they simply
chose not to.
Btw, I highly doubt that even at 88w it's as fast as a 13600k - but ill let this slide.
That's what the data in my plot shows. If you have other data, feel free to share it. I'm sure there are some workloads where the i5-13600K is faster, but not the multithreaded ones that ComputerBase.de chose to include in their test set.
Again, you are looking at efficiency completely wrong. The 7700 doesn't have better ST efficiency, it's just slower.
Efficiency is work per unit of energy (approximated here as average power). You can define other metrics, but efficiency allows a slower CPU to win if its power usage decreases by even greater than its speed.
You need to normalize for either performance or power. The below graph demonstrates the point much better.
...
You must not have looked at them very carefully, because the only difference is that they sorted in the reverse order. The one I quoted from the Ryzen 7700 review put the most efficient at the bottom, while the one you quoted put the most efficient at the top. Also, maybe they changed the Cinebench version, because the 7700 went from 105.2 Pts/W to 112.1 Pts/W in yours!
The closest Intel CPU in terms of ST performance to the 7800x 3d is the 13400f (1796 vs 1817 score), and the Intel part has it easily beat.
Now, you're talking about performance, not efficiency.
Then as you move to the more high end CPUs, the 13900k absolutely blasts the 7950x and the 7950x3d WHILE being faster, which is actually insane.
It's not insane - it's exactly like I said. The 7950X has higher clockspeed limits than the 7700, which allows a single threaded job to push well past the point of diminishing returns, in terms of perf/W. The 7950X also has the disadvantage of the extra compute die, which is just dead weight in any single-threaded efficiency comparison.
I'm sure you can close much of the gap between the 7950X's efficiency and that of the 7700, if you just limited the former to a single-core limit of 5.3 GHz, instead of the 5.7 GHz turbo limit it has by default. This is what one should do, if they wanted the multithreaded performance of the 7950X, but didn't want to take such a hit on single-threaded efficiency.
I said this before: Zen 4 is efficient in its sweet spot. The 7950X's boost limit is well outside that sweet spot.
It's 50% more efficient - if you match the actual performance or power the difference would probably go over 100%. In other words, the 7950x needs twice the power to match a 13900k in ST tasks.
Your conceptional model is completely broken, here. Any overclocker would tell you that power increases needed to push higher frequencies grow at a nonlinear rate. You can't use an efficiency ratio to predict how much power one or the other CPU would need at a given performance level. It might be that
no amount of power could extract equivalent performance from the 7950X, at a given task.
If someone looks at these results realistically they'd understand that this isn't just a generational leap, that gap would take years (lots, lots of years) to cover.
Your analysis is also deeply flawed in this part. AMD simply chose to make Zen 4 smaller. Zen 4 is primarily targeted at servers and laptops. In such cases, you don't optimize for high clockspeeds, because that requires a lot of die area and always burns a lot of power. Instead, it's targeted to be efficient in a lower window, but it's also smaller. This enables them to pack more full-size cores (or half-size Zen 4C cores, which are basically identical except for half the L3 cache), achieving strong multithreaded performance and efficiency.
In contrast, Intel opted to make Golden Cove big and complex, but added the small and simple Gracemont cores to balance it out. That hybrid strategy works okay for desktops and laptops, but not servers.
Attribute | Zen 4 | Zen 3 | Golden Cove | Comments |
---|
Reorder Buffer | 320 | 256 | 512 | Each entry on Zen 4 can hold 4 NOPs. Actual capacity confirmed using a mix of instructions |
Integer Register File | 224 | 192 | 280 | |
Flags Register File | 238 | 122 | Tied to Integer Registers | AMD started renaming the flags register separately in Zen 3 |
FP/Vector Register File | 192 | 160 | 332 | Zen 4 extends vector registers to 512-bit |
AVX-512 Mask Register File | 52 measured + 16 non-speculative | N/A | (152 measured via MMX) | Since SKL, Intel uses one RF for MMX/x87 and AVX-512 mask registers
However Golden Cove does not officially support AVX-512 |
Load Queue | 88 (136 measured) | 72 (116 measured) | 192 | All Zen generations can have more loads in flight than AMD’s documentation and slides suggest. Intel and AMD have different load queue implementations |
Store Queue | 64 | 64 | 114 | A bit small on Zen 4, would have been nice to see an increase here |
Branch Order Buffer | 62 Taken
118 Not Taken | 48 Taken
117 Not Taken | 128 | |
Source: https://chipsandcheese.com/2022/11/05/amds-zen-4-part-1-frontend-and-execution-engine/
How do I know it will take years?
You don't. At this point, you have no idea what's going on. You're piling invalid assumptions atop more invalid assumptions.
What's absolutely bonkers is the
real performance and efficiency advantages of Naples and Bergamo over Sapphire Rapids (or even Emerald Rapids). This is what AMD designed Zen 4 to do and it's why AMD's datacenter marketshare keeps growing. They care about that
much more than the desktop performance segment.