AMD CPU speculation... and expert conjecture

Page 328 - Seeking answers? Join the Tom's Hardware community: where nearly two million members share solutions and discuss the latest tech.
Status
Not open for further replies.

Hmm, that is an quite interesting assumption..
 


How much die savings would they get though? If AMD wanted a bigger piece of the HPC market they should do that.
 


Some savings by not including the texture units and some of the prefetch stuff. GCN is an interesting design, each CU is essentially a weak 64-core vector CPU that can also do scalar instructions. You could theoretically run an entire OS off a GCN processor, if you could find some way to socket it on a MB with some form of BIOS boot function. In a HSA environment the scalar components aren't needed as your gonna use the CPU for that anyway.

The big decision comes in how many CU's to employ. Each CU is a vector processor with 64 vector units (believe each is 32-bit's wide but not sure), that is one helluva SIMD processor. A CU has pathetic scalar performance and outstanding, phenomenal vector performance. It's also the component that takes up the most room on the iGPU side. You can reduce the number of CU's to fit in another scalar module. That reduces your vector performance while enhancing your scalar performance. Desktop world needs more scalar then vector while HPC world often purchase's dedicated vector processors (single dGPU has 32CU's for comparison).

It's a big balancing act, I can see a 3M 8CU design being made.
 
Good to see AMD correcting the misunderstandings in this thread

“Sometimes there’s a misconception that ARM means very low power and very low performance,” says Kamal Khouri, director of product management for AMD’s embedded division, ” but this is truly a high performance SOC taking advantage of ARM’s first 64-bit architecture."

I estimate HieroFalcon would perform as the top Kaveri Steamroller.

AMD continues without confirming/denying the 3 ALUs per core diagram.

 


Berlin has been presented as both APU and CPU, therefore I would expect the same for Steamroller in the desktop: Kaveri (APU) and Athlon (CPU)
 

He is likely reffering to FX...
 
Here's one tiny FM2+ board for you.

gigabyte-a88x-mitx.jpg


http://www.fudzilla.com/home/item/32878-gigabyte-has-a-killer-fm2%20-itx-board
 


I think there is no FX Steamroller.
 


SR code has been posted for some time now. (BDVER3.MD) There are 2 ALUs.

http://gcc.gnu.org/ml/gcc-patches/2012-10/msg01079/bdver3.md

;; The bdver3 contains three pipelined FP units and two integer units.
;; Fetching and decoding logic is different from previous fam15 processors.
;; Fetching is done every two cycles rather than every cycle and
;; two decode units are available. The decode units therefore decode
;; four instructions in two cycles.


HieroFalcon is such a different product than Kaveri. It is CPU only and no graphics. They're not very comparable.
 


It may not be called FX specifically, though I have assurances that there will be a suitable replacement. It may be on FM2+, there may be an entirely new socket for it...I haven't been able to pry out any further information yet. (The rep has been extremely resourceful about dodging questions in such a manner as to address them, but ever so vaguely as to give nothing concrete...)
 
Given that AMD sales of HEDT went up this quarter, and sales of APU-based mobile devices went down, I would be pretty surprised if they didn't produce something for the HEDT crowd with Steamroller.

Make that rep squirm! He or she gonna break eventually 🙂
 


My point is that GloFO can not keep having vaporware nodes. They have to be cancelled for a reason and I'm suggesting that the reason is that they are just moving faster and cancelling what they were working on.

GloFo is backed by ATIC and a whole lot of oil money. I really wouldn't be surprised if the folks in charge of GloFo had no qualms about just tossing money away.

Also, the Extreme in Extreme Mobility can either mean it's

1. Made for extremely mobile products
2. is designed for mobile products but can scale extremely well to higher end products.

I have not seen proof towards either one, however I am at least pointing out that there are alternatives to the suggestions provided about 14nm XM. For all we know GloFo got lucky and has a 14nm/20nm hybrid process that sips power and scales better than everyone else. Or worst case it is another vaporware product.

Also, new roadmaps from AMD:

http://www.slideshare.net/wilfredlin/amdembeddedroadmapunveilvfinal-130906181555

Of note (copy pasted from my OCN account of a different name)


1. AMD is not telling us what 60% to 50% of their future business plans are (slide 3).
2. ARM is not replacing Steamroller APUs (slide 9)
3. ARM is not replacing Jaguar (slide 9)
4. AMD is not even getting close to replacing Jaguar with ARM as the only ARM part on slide 9 is an (up to) 8 core A57 APU

Thus, I think it's safe to infer that:

1. AMD's products are either mobile/embedded or classic HEDT/server. If AMD is moving 40% to 50% of their product lineup to mobile/embedded high growth markets, it means that the remainder can ONLY go to HEDT/server as that's the only other things AMD makes

2. AMD has a Steamroller v2 out there and it has better HSA features according to these slides.

3. AMD has little faith that ARM will cause a massive disruption in x86 but instead will co-exist.

AMD's own market projections have ARM taking about .5% from x86 and .5% from MIPS by 2016. That is hardly disruptive at all.

I find it curious that there are no low power ARM chips with GCN cores. Perhaps AMD is looking at Tegra and realizing doing something similar is a giant waste of time and resources. Regardless it means Temash is here to stay.

Surface 2 Pro reviews popped up today with the new Haswells.

http://www.anandtech.com/show/7440/microsoft-surface-pro-2-review/4

~50% increase in battery life and it still is blown away by ARM. And this is what Intel is optimizing their big cores for.

Now do you see why I think it's absolutely foolish for AMD to walk away from the HEDT/Gaming CPU market? They can beat Intel with Temash and then beat Intel in HEDT because Intel is off trying to compete with efficient, simple RISC ARM designs with a bloated x86 CPU.
 

Well, looks like we found the A10-X800K for Kaveri. Because of the DDR3 bottleneck, I would assume its iGPU has about 7770GHz performance.
 
Look at the iGPU frequency and take the DDR3 RAM into account... I think the performance will be around a 7750, maybe less... still very impressive.

832sp would make a hell of an iGPU.
 


Good find of that slide, hadn't seen the updated pics for the GPU+RAM before. It seems to correspond with what others are saying, that the ARM parts are tailored more as network processors than a replacement for Kabini/Temash.
 
AMD makes GPU comute a reality with hQ
Heterogeneous Queues make GPUs a full peer in compute
http://semiaccurate.com/2013/10/21/amd-makes-gpu-comute-reality-hq/
http://techreport.com/news/25545/amd-heterogeneous-queuing-aims-to-make-cpu-gpu-more-equal-partners
looks good for kaveri
http://www.tomshardware.com/news/amd-apu-performance-increase-drivers-windows-8.1,24773.html


yeah. so far none of amd's arm socs have gcn attached. amd associates the 8 core arm soc with a57 cores as high performance... yet the soc has vanilla arm cores and looks.... generic.

the slideshow was about amd's embedded cpus/apus/socs but it points to amd's general direction and it's not HEDT. amd seems to assume that arm and x86 will have near-equal overall marketshare in 2016 and it wants a piece of both. good call, highly dependent on timely execution and consumers - as current arm devices are highly dependent on consumers. to have that much marketshare, arm cannot slow down. we'll get a clearer idea when amd seperates it's semi-custom dept. numbers from gfx numbers.

as for amd's HEDT sales going up: amd said fx cpus sold more, never said which ones. so far, 4 core fx cpus (e.g. fx4300) have far outsold 6 and 8 core ones and those are much cheaper too. quad core fx is part of amd's hedt lineup by brand. and amd's hedt is very different from intel's hedt, don't confuse the two. richland seems to have underwhelmed in laptops - that was expected. amd can't expect to screw with customers for long.
 
so far, 4 core fx cpus (e.g. fx4300) have far outsold 6 and 8 core ones and those are much cheaper too

Do you have a reference for that? I can't imagine why anyone would buy FX4300 when FX6300 is almost the same price. Not sure what you mean by 'those are much cheaper too'- where I live only 40 euros separate the FX4300 and the FX8320.
 

prices are different in different places.

the latest reference i found was this:
http://www.techpowerup.com/175751/apus-make-up-nearly-75-of-amds-processor-sales.html
the link has a pie chart showing core numbers. i think 2013 numbers will come out later this year or in 2014.

edit: this year could be different if new games and softwares, consoles and price drops pursuade users to buy 6-8 core fx cpus. all the info i've read so far don't indicate if the situation in the tpu link has changed.
 


Each Steamroller _module_ contains three FPU units and two integer units. But I am discussing about something different. I am asking AMD if each _integer unit_ has 2 or 3 ALUs. Hot Chips 2012 showed two integer units with 2 ALUS each (4 ALUs per module). Sep Kaveri talk showed two integer units with 3 ALUS each (6 ALUs per module).

When I wrote "Kaveri Steamroller" I did really mean only the CPU part of Kaveri. I was comparing CPU to CPU.



My point was not about branding name.



You mention slide 3 and 9 but I think you mean 4 and 10. AMD plans are about moving outside the PC up to a point where most of the revenue comes from non-PC market.

Steamroller APUs will be replaced by Carrizo APUs.

The replacement of x86 by ARM will be made in phases. AMD is _first_ replacing jaguar in server and tablets _then_ other products will follow.



With new head, AMD projections are overly-conservative. Their Q3 results are much better than predicted, for instance. Also relevant this AMD slide:

why-arm-will-win.jpg


The slide is pretty clear as are also AMD statements about Warsaw and their future server plans.



As mentioned before, AMD will be replacing Temash tablets by ARM+GCN tablets this or next year

http://techreport.com/news/25461/report-amd-to-introduce-arm-based-tablet-chip-this-year

Finally I want to emphasize again that AMD is already devoting time and resources to a custom high-performance ARM chip as Nvidia does. The problem with AMD was the myopia of the _former_ head. AMD is too late to the party, that is why Seattle and HieroFalcon use standard A57 cores. AMD will compete with standard cores before has ready its own cores.
 


No surprise here. It has been know since day one Xbox1 has three OSs, one for games, another for w8 apps and a third one that manages the other two

http://www.engadget.com/2013/05/21/xbox-one-runs-three-operating-systems/

You can have a game on the one part of the screen and then a W8 app in other side. Something looking as this

Xbox-One-Snap-Mode-620.jpg


Not all W8 apps will work on Xbox1.



This is the chart

146a.jpg


That 2012 data shows that dual and quad-cores make for >90% of the market, whereas octo-cores make for 1%. That data coincides very well with last Steam stats, except that last Steam statistics give 0.3% for octo-cores.

Those numbers are the reason why Kaveri comes as dual and quad-core and there is no FX octo-core Steamroller.
 

those numbers are not the reason kaveri is dual module. apus are not aimed at high-perf. high-end. they're chiefly for laptops and entry level desktops. that's why they incorporate cost-cutting measures like lack of L3 cache, less number of modules, dual channel memory controller, etc. price point, die-area, fab. process and kaveri's architecture are the reasons kaveri is dual module.
 

+1
 
Status
Not open for further replies.