Heise.de publised their SPEC2006 results for Platinum 8180(28C/56T), Epyc 7601(32C/64T), Gold 6148(20C/40T) and E5-2699 v4(22C/44T), all of them in 2P configuration.
https://www.heise.de/ct/ausgabe/201...6148-E5-2699v4-und-AMD-Epyc-7601-3787494.html
All measurements with 64-bit code, without auto-parallelization, without SmartHeap with Intel System Studio 2017Up4 (opt: AVX512 / AVX2)Or x86-open64 4.5.2.1 (AMD, opt for bulldozer / piledriver).
It's paywalled(1.5 Euro). I've had a look at it and the most important findings are:
2X Epyc 7601 and 2X Xeon Platinum 8180 are extremely close. Xeon was 12% ahead in int_rate_base and 4.5% ahead in fp_rate_base.
Without NUMA optimiztion, Epyc is ahead in STREAM-Triad for very low thread counts(<8) sometimes by almost 2X over Skylake-SP Xeons. The 8180 is bandwidth-starved compared to the 6148 by 35% for 16-32 threads. When fully saturated with the maximum permissible number of threads, bandwidth on Skylake-SP dips while Epyc remains constant.
The huge performance disparity in libquantum is obliterated when compiled for 64-bit. Compile times were reduced to two seconds from 10 minutes on 32-bit mode. Yes, minutes. Epyc and Xeon 8180 give the same performance even with AVX512 enabled on the latter in 64-bit mode.
tamz_msc, Aug 12, 2017
SPEC2017 results by heise.de are out:
Something is wrong with the integer performance on EPYC on the 'speed' benchmark(which measures time to completion for a single job) - 2x Xeon Platinum 8180 is over 60% faster than 2x Epyc 7601 using intel17 -O3. Same issue with gcc, so might be a compiler issue. AMD says their own AOCC does fine, but not tested due to time constraints.
This discrepancy is not there when running the 'rate' benchmark(measuring throughput running multiple copies of each job) - Xeon Platinum is only ~5% ahead of Epyc with intel17 -O3.
There is no issue with fp performance however. In the 'speed' benchmark Xeon Platinum is just ~4% ahead of Epyc. However, in the throughput-oriented 'rate' benchmark Epyc pulls ahead of the Xeon Platinum by ~16%, again with intel17 -O3. Due to the huge memory bandwidth it offers, in some of the benchmarks, like fluid-dynamics using the lattice-Boltzmann method, Epyc is over 55% faster than Xeon! Epyc does very well in general with throughput-oriented numerical PDE solvers from what I've seen, which is expected because these kinds of HPC workloads absolutely love memory bandwidth and generally run without issues when made NUMA-aware.
AVX512 does nothing over AVX2 with the Intel compiler; AVX2 is still slightly faster for the 8180, however with the 20C Gold 6148, AVX512 is marginally faster than AVX2 in the 'speed' benchmarks, which suggests that there is some kind of bottleneck with the higher core count chip; which is ironic given Intel's claims with the mesh topology and cache restructuring, but not entirely unexpected. On gcc 7.1 however, skylake-avx512 opt. flag is behind broadwell opt. flag with the 8180, sometimes by a long shot with the rate benchmark.
Bottom line(my interpretation): it's down to application preferences when it comes to which processor you're going to buy. There are still issues with int performance on Epyc, so Skylake-SP is the winner integer workloads as of now. However, Epyc is obviously much faster in fp performance, and since AVX512 does nothing at the moment for Intel, it is obviously the better choice.
tamz_msc, Saturday at 9:16 AM
Intel’s upcoming 8th generation desktop platforms have taken the internet by storm and there’s nothing that can stop them. In reaction to the leaks, online retailers have started listing the CPUs (Coffee Lake) up for pre-order along with engineering samples that are available on several China based retail sites.
Intel Coffee Lake Pre-Orders Pop Up at Online Retail Stores – Core i7-8700K, Core i5-8600K and Core i3-8350K Included In The Listings
There are a plethora of online retail sites that have listed the Intel 8th gen desktop processors codenamed Coffee Lake. Of course, I won’t be mentioning prices as they vary from retailer to retailer and are not indicative of the final MSRPs which would be much lower. Regardless, you might want to check out the following package names to find if your nearest retailers has the Coffee Lake desktop chips listed for pre-order.
Note – Final clocks are still not determined, the clock frequencies mentioned below are directly from Intel slides however pre-orders reveal different clock speeds so take them with a pinch of salt.
Intel Core i7-8700K – The Coffee Lake Flagship With 6 Cores and 4.3 GHz Boost Across All Cores
The Intel Core i7-8700K will be the flagship processor of the lineup. It will feature 6 cores and 12 threads. The chip will be compatible with the LGA 1151 socket and rumors are that board makers will extend support of Coffee Lake onto 200-series and even 100-series platforms. The chip will be Intel’s first hexa core product and will be based on the 14nm process node.
In terms of clock speeds, we are looking at a 3.7 GHz base frequency which boosts up to 4.3 GHz (6 core), 4.4 GHz (4 core), 4.5 GHz (2 core) and 4.7 GHz (1 core). These clocks are really impressive so we can expect much faster gaming performance on Coffee Lake parts than the current generation of CPUs. The chip is fully unlocked allowing for overclocking and comes with a TDP of 95W. There’s 12 MB of L3 cache on board along with a GT2 tier iGPU. The chip supports DDR4-2666 MHz memory (native) and up to 4400 MHz+ (OC). We can expect the Core i7-8700K to cost around $349 US when it launches in Q3 2017.
Intel Core i5-8600K – The Gamer Aimed Hexa Core With Overclocking Capability and 4.4 GHz Boost
The Intel Core i5-8600K is the second unlocked chip in the lineup and features a hexa core design. We have learned that Intel Core i5 series will stick with 6 cores but they will not feature a multi-threaded design. The chip probably features 9 MB of L3 cache while being supported on the LGA 1151 socket.
In terms of clock speeds, we are looking at 3.5 GHz base, 4.2 GHz (6 core), 4.2 GHz (4 core), 4.2 GHz (2 core) and 4.4 GHz (1 core) boost clocks. The chip will feature a TDP of 95W and we can expect a price range of around $249 US which will make it a pretty good option for gamers that want some extra cores in their hands.
Intel Core i3-8350K – A Quad Core i3 Processor With 4 GHz Clock Speed
The Intel Core i3-8350K might be the only Core i3 chip in the 8th Gen family that enables overclocking support. Intel’s 8th Gen Core i3 family is also the first to feature 4 cores which will be a great deal for budget builders. The Intel Core i3 8350K comes with 4 cores and 4 threads that are clocked at 4.0 GHz. The chip features 8 MB of cache and will feature a TDP of 95W.
There’s also the Intel Core i3-8100 which is a 4 core, non-multi threaded SKU. This chip has a lower clock speed of 65W and features a clock speed of 3.6 GHz. There’s total of 6 MB of L3 cache on the processor itself which confirms the exclusion of multi-threading. The Intel Core i3 chips will not feature boost clocks but they will have a more stable base clock speed and we can expect prices of the chips to be in the sub-$200 US range.
Intel Pentium G4660, Pentium G4700 and Pentium G4720 With 2 Cores, 4 Threads and Upto 3.9 GHz Clock Speeds Also Detailed – Under $100 US
The Intel Coffee Lake family will also feature at least three Pentium series SKUs that include the Pentium G4660, Pentium G4700 and the Pentium G4720. These chips will feature 2 cores and 4 threads with clock speeds rated up to 3.90 GHz. There will also be 4 MB of L3 cache on board the chips and prices will be adjusted around the $100 US bracket. Rest of the processors that will be featured in the family are listed below.
We have some good news for low-power AMD builders this morning: AMD has (finally) begun to sell the 35W versions of their "Bristol Ridge" desktop APUs. Overall the company has released 3 35W retail Bristol Ridge SKUs, the A12-9800E, A10-9700E, and A6-9500E, with these chips fleshing out the low-power segment of AMD's AM4 platform through the end of the year.
AMD originally released its Bristol Ridge A9000-series APUs to OEMs in mid-2016, targeting desktops and laptops. The SoCs integrate one or two Excavator v2 modules (two or four x86 cores in AMD’s nomenclature), a Radeon R5/R7 iGPU featuring AMD’s GCN 1.2 (3rd generation) architecture and up to 512 stream processors, a dual-channel DDR4 memory controller and so on. Earlier this year AMD finally decided to release a rather broad lineup of its 7th-generation A9000-series APUs on the retail market, enabling end-users to build their own inexpensive AM4 systems, essentially popularizing the AM4 ecosystem compatible with the company’s latest Ryzen processors in general.
The originally planned retail lineup for Bristol Ridge included 11 SKUs with a mix of 35 W and 65 W TDPs, but until now, only select 65 W SKUs were available. In the meantime, the interest towards inexpensive Mini-ITX systems is high these days. AMD spent a little over a month after the formal introduction of the family to bring the 35 W A12-9800E and A10-9700E APUs to store shelves. In the last days of August, the new chips hit store shelves across the world: they can be purchased in the U.S., Europe and in Japan, a clear indicator that they are indeed available broadly.
The AMD A10-9700E costs $85 – $91 in the U.S., whereas the more advanced AMD A12-9800E is available for $105 – $113, depending on the retailer. Meanwhile, the A6-9500E, the most basic 35 W Bristol Ridge APU, can be purchased for around $55 – $58. In Europe and Japan, prices are traditionally higher, see the table below for details. Overall, the 35 W APUs from AMD do not look very affordable, but it is typical for chipmakers to charge extra for energy-efficient parts.
From AMD’s standpoint, the main purpose of Bristol Ridge CPUs and APUs is to fill the gap in the AM4 lineup before the company releases Raven Ridge APUs and cheap CPUs based on the Ryzen microarchitecture later on.
Click here for a link to the articleThe Raven Ridge APU has 4 ZEN CPU Cores and the IGP gets 704 Shader Processors, which is substantial I must say. The shader procs should be similar to the Vega generation. The engineering sample APU is tagged under SKU 2M3001C3T4MF2_33/30_N. And from that we can extract a 3.0 GHz base clock frequency with a 3.3 GHz boost (again this is an engineering sample). The APU would SMT to 8 treads. The 704 shader processors run at 800 MHz. It has 2MB (4x512kB) of L2 and 4MB of L3 cache. The GPU thus has 11 Compute Units and with 64 Stream Processors per CUthat forms 704 Stream Processors.
There is a rumor that there will be an even more high-end APU based on Raven Ridge, with 1024 Shader processors, which is the equivalant to a RAdeon RX 560.