Gon Freecss :
juanrga :
Gon Freecss :
@juanrga;
Wanted to ask. How does the Zen architecture perform with the AMD compiler compared to the Skylake architecture with the Intel compiler? Also, how does it perform when you specifically code for both architectures? On average.
When running GCC with medium-level optimization flags, the 32-core Zen is 39% faster than the 22-core Broadwell in base score. When using Intel and AMD compilers, the performance gap is reduced to 15%.
The Intel compiler is better and extracts more performance from the hardware.
Broadwell has only 256bit SIMD units. Skylake Xeons have 512bit units. Using specific code for Skylake Xeon can increase several server/HPC workloads by huge amounts such as 50% or 80% faster than Broadwell Xeons.
https://www.anandtech.com/show/11615/intels-data-center-event-live-blog-830am-pt
Doesn't this show the Zen architecture being slow? >45% more cores, yet only ~15% more performance.
Actually that depends entirely upon the workload.
If you notice, SPECfp peak and base are ~50% faster on EPYC
SPECInt peak and base are approximately 20% faster on EPYC
In Data center, the typical usage case trumps single core IPC most of the time.
Not to mention we have no idea if these SPEC benchmarks still included libquantum and the other rubbish they recently removed because it unfairly skewed benchmarks toward Intel in benchmark scenarios.