szatkus :
That's why A57 is more less two times bigger than A15?
Apple A7 is only 100mm^2. That's still not remotely massive and it leaves a lot of room for cramming in cores. But I wasn't even talking about ARM, I was talking about x86 cores and how Intel and AMD are both doing the same thing.
szatkus :
Except, again, as was discovered in the late 70s, you can't use more then a handful of cores to work on general computing tasks. You simply won't see the performance gain for the vast majority of tasks out there. So adding cores is also a dead end.
Hence why I suspect we're going to hit "peak computing" in 3-4 years, where we essentially run out of ways to improve performance through HW. At that point, we're all going to be waiting form Quantum computing to become viable.
Things have changed a lot since the 70s. People are going to have to find a solution. We're seeing things that can share memory much more efficiently and we're coming up with workloads that do scale well to lots of cores. Rendering and compiling all scale well and I don't even think we had ray tracing back then. We also have things like Apache which can serve thousands of pages a second and every page request runs in its own thread. All things that didn't exist in the 70s when the claim that more cores doesn't help.
There will be problems where more cores doesn't help though. Please do not misunderstand. But our single thread is so good right now that it's acceptable. Also, do not forget that your GPU is just hundreds or thousands of SIMD cores and those have no problem scaling to lots of cores. It can be done. It won't be easy. But a lot of things will benefit from more cores.
szatkus :
Lol no more or less the same is like -5%-+5%. I7 920 vs I7 3770K which haswell is even faster in dolphin emulator over IPC improvements, I7 920 is 15% less efficient in IPC compared to Ivy and probably closer to 20% slower per clock compared to Haswell. That is from 2008-2014 only 6 years would you like me to pull out some horrible Pentium 4 IPC benchmarks(from 04) i hope not. Amd on the other hand has been around the same.
http://alienbabeltech.com/main/platform-upgrade-core-i7-920-vs-i7-3770-at-4-2ghz-featuring-ecs-golden-series-motherboard-and-kingston/7/
http://www.anandtech.com/show/2658/16
I7 965 was around 10%-15%(some cases 30%) faster ipc IPC compared to the Intel core 2 extreme Qx 93770 at the same clock rates
Then we can see the improvements on the quad core extreme vs Pentium dual core from 05 sure its around 10-15% as well. Overall from 04-14 we saw a solid 40-50% boost in IPC compared to a pentium 4 extreme and a I7 4770K.
We can also see sandy to haswell is around 10-15% faster in IPC
http://www.hardocp.com/article/2013/06/01/intel_haswell_i74770k_ipc_overclocking_review/1#.U-O-2fldXJY
I'm not to sure how much longer these core improvements will continue.
Compare the instruction sets of Haswell to predecessors and you will understand that Haswell's performance in Dolphin does not come from architecture improvements, but from new instructions. Here's a little shocker for you. When I custom compile Blender in Gentoo for both my i7 920 AND by FX 8350, they see massive improvements. i7 920 is 50% faster and FX 8350 is over 80% faster. Dolphin's performance is not some sort of Haswell miracle, it is the Dolphin Team doing a very good job at compiler optimizations, specially because they have something that won't even run on older hardware well so they have no reason to even care about being compatible with Pentium 3 level instruction sets like 99% of Windows applications.
Also, I was arguing that after Nehalem things slowed down massively. P4 was a blunder and an exception but the differences between the AMD and Intel cores of the time were generally huge improvements.
szatkus :
That was fascinating.
But, really, do you believe that? I don't see any reason single-thread performance is at a standstill, because it has steadily gone upward since the beginning of CPUs.
Haswell did improve single thread performance, and as the size of the process technology drops lower and lower (14nm, 10 nm, etc.) we will only be able to fit more processing components and transistors, really.
I don't think it will stop, and even as the industry leans toward more and more cores, the single thread performance will only improve as the techology improves.
Yeah, I do.
http://www.tomshardware.com/reviews/core-i7-4770k-haswell-review,3521.html
Look at what changed with Haswell:
out of order window increased 14%
an extra ALU
an extra store
larger L2 TLB
And you don't see a gain over 10% unless you're using new instructions anywhere. Intel has already plucked the low hanging fruit from branch predictors and stuff.
Take a look at the [H] review since they locked frequency and you don't have to be concerned with CPU turbos mucking up scores and not having any clue what the chips frequency actually is during a benchmark.
4% faster in Hyper Pi, 5% faster in wPrime, 12% cinebench (which is multi-threaded), 21% in POV-Ray (multi-threaded), 9% in LAME, hardly anything in WinRar. The only time Haswell shows big gains is when it's using more than one thread, which shows me more that Intel increased performance of how cores work together more than they did with increasing actual single thread performance. Those are huge gains and Haswell was supposed to be a big "tock" architectural change and we get a whole lot of tiny increases with a few oddball gains.
Compare to a review of FX-57 or E6600
http://www.anandtech.com/show/1722/4
http://www.trustedreviews.com/Intel-Core-2-Duo-Conroe-E6400-E6600-E6700-X6800_PC-Component_review_photoshop-virtualdub-results_Page-5#tr-review-summary
X6800 is 53% faster than P4EE 955 in Photoshop. Those are the kinds of changes we will never see again that I am talking about.