AMD CPU speculation... and expert conjecture

Page 427 - Seeking answers? Join the Tom's Hardware community: where nearly two million members share solutions and discuss the latest tech.
Status
Not open for further replies.

juanrga

Distinguished
BANNED
Mar 19, 2013
5,278
0
17,790


Several years ago, studies were made on how to design a supercomputer that would be 1000 times more powerful than actual ones. The research revealed that current engineering paradigms don't work at the exascale level. The findings were summarized in a DARPA report. One of the challenges identified in the report was that at the exascale level the principle of locality holds. A traditional design dCPU+dGPU violates the principle of locality. That is the reason why the engineers of Nvidia and of AMD have selected APUs for their respective exascale supercompute designs. A dCPU+dGPU doesn't work at that scale of performance.

I believe it is time that some people at this forum stop repeating wrong mantras such as "APUs are for cheap gaming".
 

etayorius

Honorable
Jan 17, 2013
331
1
10,780
Let me make it clear, i am not saying Kaveri is crap... it is not, the GPU inside Kaveri and other features like HUMA, HSA, MANTLE and OpenCL will be massively boosted by Kaveri, i am talking about the laughable numbers Kaveri is getting compared to Richland and Trinity in regards of IPC.

As i already stated regarding Kaveri´s laughable Skyrim performance improvement against Trinity in the Spanish Forum and the Pudget review, it makes me believe Kaveri has IPC between Trinity and Richland... is this bad news? not necessarily... Kaveri is a damn great upgrade to anyone who is on a budget or who is not a hardcore gamer, like one person asked if Kaveri should be worthy upgrade compared to Llano, HECK IT IS you bet!

It is just not worth the upgrade for me (PhenomII 980 at 3.9Ghz) or anyone who has an FX or i5 CPU paired with a dGPU or greater, doing so will only be a downgrade.

Until MANTLE and HSA are well established, Kaveri will be aimed at the low-mid range budget gaming.

It`s just how it is.

But heck, we are almost hitting release Date... and if i am wrong, ill be happy to take my words back and hurry to the store and buy one, but i will not accept anything Less than 15% improvements against RICHLAND as a possible Invest in Kaveri, if Kaveri does not even manages a 15% improvement on average against Richland, there is no way i will even consider Kaveri as upgrade... hell NO.

On the contrary, the GPU inside Kaveri is really good in my opinion and MANTLE will push the performance even further, that´s heck of a Win for AMD one way or the other.
 


It's mostly about heat vs thermal solutions required to cool them. A single CPU with a 125W TDP combined with a separate GPU with 250W TDP is always, 10 times out of 10, going to be higher performance then a single 125W CPU + GPU combo chip. And that's before going into the die space issue vs cost's. Juan has been blowing smoke for a long time now for no matter what they put in an APU they can always put more/ faster of the same in a dCPU + dGPU. The laws of thermal dynamics are what limits Intel / AMD here.

EX:
Richland / Kavari vs
8350 / i7 + R9 290

In every conceivable performance profile the second combo wins as whatever vector work the iGPU can do, the dGPU can do more of and faster. HSA doesn't change any of that, it just allows the iGPU to be used for something while a dGPU does something else so your not wasting 50% of your CPU's die space.
 
I don't know why you guys are basing everything off benchmarks of cinebench 15 which looks worse than ever as a benchmark. The reviews aren't even out yet. If you look at 11.5, the performance is up 11% at stock performance and IPC is up at least 12% in cinebench.
vgno.jpg



sxrg.jpg

In a different benchmark, its up 16% at stock and IPC increase is almost 18%

 


Not really. For an HTPC you don't really need that much CPU horsepower. In particular, Just for software decoding duties, you would need lots of CPU grunt work, which I'm sure Kaveri will do just fine where Llano can't (new instructions sets + HSA promise). What you really need is decoding beauty (nVidia has the upper hand with CUDA, regrettably) and a great MoBo (sound card in particular).

Games is just a plus on the side IMO and I'm sure in that department Kaveri will be a better ride than Llano.

Cheers!
 

jdwii

Splendid


Ok what is IPC in your eyes and does if differ from performance per clock?
 

griptwister

Distinguished
Oct 7, 2012
1,437
0
19,460
So... look at what I stumbled upon... ASRock came out with a MoBo that supports 64Gb of RAM for AM3+...

http://www.asrock.com/mb/AMD/Fatal1ty%20990FX%20Killer/

Looks like a "Killer" deal. Bahahaha! Looks like I found a MoBo for my primary Rig.
 

etayorius

Honorable
Jan 17, 2013
331
1
10,780


We are not quite there yet, was not the whole reason why bulldozer was such a fail and Piledriver was introduced?
 

Ranth

Honorable
May 3, 2012
144
0
10,680


But wouldn't we run into problems with diesize and thermal limits/density negating the performance increase of the principle of locality?
 

Cazalan

Distinguished
Sep 4, 2011
2,672
0
20,810


They are for cheap gaming until someone actually ships a more powerful APU to break the paradigm. What looks good on paper doesn't mean they can actually FAB it. Until then its vaporware.
 

juanrga

Distinguished
BANNED
Mar 19, 2013
5,278
0
17,790

This is the second time that you ignore what I say just to post your anti-Nvidia rants here.

I am not "fanboy" for low power anything. I am a fan of efficient and clean ISAs/muarches. I only know one ISA that is efficient, clean and modern. If you know more let me know. If tomorrow AMD and Intel join and develop a new "x86-B" that is not an extension of an bloated and ancient ISA I will be fan of that as well.

My claimed 16FLOPs/clock figure was not said by any PR marketing guy. I got it. I can also estimate the DMIPS/MHz from the arch. I also got by myself the TDP. I got a dissipation slightly above the 6W. Marketing slides claim other number. In fact, in the other thread, people is asking from where the 6W number has come. You know it because you are participating in it, but with a different nickname sd***
 

juanrga

Distinguished
BANNED
Mar 19, 2013
5,278
0
17,790


Thanks by this new personal attack. I could not expect anything different from you.

Now you only need to take your deep understanding of thermodynamics and meet all Nvidia, AMD, Intel, Cray... engineers (Cray is one of the seven partners of Nvidia in their exascale supercomputer project).

Please meet them and explain them your "laws of thermal dynamics". It is funny that you believe that the same people who designs the current supercomputers are not aware that "10 times out of 10" a dCPU+dGPU offers more performance than an APU. Poor engineers, they must be smoking the same than me. :lol:

end-sarcasm

In fact, the same laws of physics (that you don't understand) say that your a dCPU+dGPU would offer about 10x less performance than the APU designs that those same engineers are developing for exascale level computing.



At contrary, it is the same principle (based in the laws of physics) which puts thermal and power limits to the traditional dCPU+dGPU. Scientists/engineers refer to it as the "power wall" or the "energy wall".

Or do you believe that you know more than all the engineers working at AMD, Nvidia, Intel, Cray... together?



Wrong. Berlin APU is not for cheap gaming. Its successor (Toronto APU) is not for cheap gaming...

Moreover, we already know (there is consensus among researchers) that only an APU can power exascale level compute. And no it is not waporware, but actual research made under government and other grants.

In fact, the original 10TFLOP APU design made by AMD details a FSA system architecture that is the precursor of the HSA finally introduced in Kaveri.
 

juanrga

Distinguished
BANNED
Mar 19, 2013
5,278
0
17,790


Therefore the Spanish review showing that the CPU in Kaveri offers about a 20% IPC gain (more than 30% in a pair of cases) is what you call "laughable numbers"? Wow! What were you expecting? Maybe a 2000% increase in IPC in a single generation?

The numbers that they are obtaining are very much that I was expecting. Re-read my article.

The Spanish reviewer finishes is review with

Otros juegos como Battlefield 4, Diablo 3, Naruto Ninja Storm, Xcom EU, etc han mejorado sobremanera. Pero comentaré más, mas adelante.

Por lo demás, tenemos justo los resultados anunciados y esperados.

Translated to:

Other games like Battlefield 4 Diablo 3, Naruto Ninja Storm, Xcom EU, etc have improved greatly. But I will comment more later.

Otherwise, we just have the announced and expected results.
 

noob2222

Distinguished
Nov 19, 2007
2,722
0
20,860
@juan

If you want to average the benchmarks, dont just pick the best 7 benchmarks and ignore the other 9 just because they dont coincide with your predictions of kaveri being equal to an i5 2500k.
 

jacobian

Honorable
Jan 6, 2014
206
0
10,710
Another set of benchmarks are out, including overclocked A10-7850K.

http://wccftech.com/amd-kaveri-a10-7850k-overclocked-45-ghz-benchmarked-a105800k/

Most benchmarks show a healthy gain on A10-5800K. However, when speaking of CineBench R11.5 multi-threaded:

"Only at overclocked frequencies does the A10-7850K matches the FX-6300 which is a 6-Core Piledriver based CPU."

It seems like for guys who don't need the iGPU, the FX-6300 is still a good value, being priced $120 on newegg right now.

 

Ranth

Honorable
May 3, 2012
144
0
10,680


Would you care to explain in detail what physics you are refering to because that makes no sense AT ALL...... What principle? The laws of physics, well that broad, and leave me with no clue of what you're trying to say..

So what I think you're saying is that cramming more onto a single dies reduces power-consumption, thermal density, increase performance? Sorry if I find that hard to believe.
 
Actually Kaveri looks pretty good.

I did warn the fanbois not to get all baronmatrix over it ... did anyone listen ... no.

If the damned thing was that good don't you think the reviewers would have slipped and told one of us by now who has some credibility?

:)
 
i'll freak out if toms review shows that the igpu is still bottlenecked and amd traded off keeping the igpu well-fed for cpu bandwidth.
and if there isn't a good number of well-known and used softwares supporting hsa (using vce's treatment as reference).
edit: not in this thread. in the review thread. there will be a rant.
 
I honestly have not had a chance to ask one of them but I imagine it will be seen positively ...

crashman may have a rant ... we encourage him to "let it all out" once in a while ... otherwise he gets depressed and starts retesting the PSU's under a dead short to see what holes the smoke comes out of.

I think they stopped his extreme overclocking efforts because he deliberately likes blowing stuff up.

crash is my fav because he is as diplomatic as a Chinese Tank driver ... when the cameras are not on.

:)
 
Well, de5_roy... The iGPU inside the APUs has been bottlenecked by the RAM since Llano (hitting 2133 as a ceiling, IIRC). We were all hoping for a GDDR5, EDRAM or DDR4 approach with Kaveri, but timing and because reasons we're stuck with DDR3 still. Since APUs are a cheap solution while HSA catches up peoples attention, they can't force people to switch, yet again, from Trinity/Richland to a more expensive platform without everything sorted out. Keeping Kaveri in FM2 is a cost decision, not a technical one, I think.

Anyway, the Spanish review gives more hope than not. Let's wait for Toms and the rest to confirm and see how it turns out.

Cheers!
 


Ah, but you assume that space/heat/power is not an issue. In the consumer space, it is, and given the design sacrifices that you have to make to slap a CPU/GPU on the same die, maximum performance is still made via a dedicated CPU, dedicated GPU, and a very fast link between them.

Also FYI, off die memory also violates locality. That's why we have more and more levels of CPU cache; its to hide the fact memory access is SLOW compared to the speed the CPU operates at. If you had infinite money, infinite die space, infinite levels of thermal control, then you could slap on 16GB of L1 cache if you wanted to, to maximize performance.

You have maximum performance, and maximum performance that makes financial sense.
 
Status
Not open for further replies.