[citation][nom]belardo[/nom]Title is wrong... Oh well.I'm an AMDer, in general... even selling some AMD systems last year. But when Bulldozer came out... I've gone intel... I've waited, and waited.. and was disappointed. First of all, Sandy Bridge is about 5~50% faster than anything AMD has. Ivy Bridge is about 10% faster than Sandy Bridge.AMD requires almost twice the power over intel... more heat. By all means, Lliano is great in its class - but is becoming over-priced. AMD is NOT #1. They haven't since Core2 hit the market... and intel screwed AMD with illegal business practices, which did hurt AMD's R&D abilities.AMD is not competing well against intel, much less themselves. When their "8 core" CPU has trouble surpassing their own 4-6core CPUs from a year ago... its a PROBLEM. AMD did the most retarded thing, they made their own version of the Pentium 4/netburst... and seeing so many people LEAVE AMD shows the screw up. We will, someday find out WHO came up with the crappy design.The FX chips are marketed as 4/6/8 cores... but their design is Hyper-Threaded (netburst) so they are really nothing more than 2/3/4 core CPUS. They should be sold as such. The FX8150 should be a $150, at the most... as its still (STILL) slower than the i5-2400k which sells at about $140~150. The fx8000 series should be priced well below $200... not $260.Someone said that the older X6 AMD960T was $200 cheaper than intel... ? HUH>I can walk into a store in Dallas and pick up the i5-2500k and motherboard for $240. the 960T with a typical board is $200~220. Also the 960T is the first X6 from AMD pretty much... the i5-2400 will bitch slap it anyday... and still run much much cooler.[/citation]
Sandy is only ~50% faster clock-for-clock than FX, but it is closer to 40% faster clock-for-clock than Phenom II. Ivy will be more like 50-60% faster clock-for-clock than Phenom II, but Intel isn't THAT far ahead of Phenom II just yet. Llano is only good if you compare it against other systems that lack a graphics card. A8-3870K is something like $130 or $140 and a Pentium G620 plus Radeon 6670 are about $130. Guess what? The Pentium system has about 50% higher frame rates in most games. Even AMD CPU plus discrete video card solutions are better because you can get an Athlon and a Radeon instead of a Llano and save money. Llano is like an $70 CPU plus a $40 sold for a higher price than the two halves are worth together. Performance wise, it's basically a lowly clocked Athlon II plus a Radeon 5550 (A8s only, all A6s and A4s are even slower). The 6670 is far faster and a standalone Athlon II has higher clock frequencies for the money, thus higher performance or similar performance for less money.
Intel has no such quad core CPUs this low end except for the cheapest Core 2 Quads from several years ago and the Nehalem i3s (dual core with Hyper-Threading, but similar quad threaded performance nonetheless), so technically, AMD wins here for performance, but that's just because Intel is neglecting the sub $100 CPU market. The cheapest Intel CPUs truly worth buying are the i3s. The cheapest low end CPU (and really, the only ones anyway) worth buying is the Celeron G530 at $42 or so for a dual core, 2.4GHz part that beats out the Semprons in the same price range. Besides it, AMD pretty much wins until FX and Phenom II versus the i3s.
Bulldozer actually isn't a bad architecture. Most of it's problems are it's memory/cache interface's huge latencies and it's poor design methods (BD is a purely computer-designed chip whereas all other CPUs are partially or purely designed by hand). Fixing those two problems alone should bring FX up to Nehalem like performance per clock at between Nehalem and Sandy power efficiency. Considering FX's huge clock frequency potential, it could fight it out with Sandy at least in performance with a much improved IPC and slightly higher clock frequencies. Wouldn't yet match Sandy efficiency, but if it were priced right, it would actually be a massive step in the right direction.
If AMD makes some architectural fixes too then it may be able to compete with Ivy. Basically, Piledriver and it's successors have great potential so long as AMD doesn't have more serious screw-ups. I'm worried about that because that seems to be something that AMD's CPU teams managed to consistently do over the past few generations. Of the last five or six years, AMD has had three architectures, but two out of the three had significant problems. I think that FX is having new architectural growth pains rather than being a complete flop like Netburst was. Look at Phenom, it gave way to Phenom II, a design good enough that we still see it in common usage. Phenom II is like a more power efficient Core 2. Similar IPC, but more efficient.
For highly threaded work, the FX-8150 runs circles around all i5s. In gaming it gets ran around by i5s and sometimes even i3s, but not in highly threaded productivity. Hyper-Threading is NOT affiliated with Netburst (most Netburst processors did NOT have it), some of the Netburst processors were simply the first to have it. Also, no, Bulldozer is not like Hyper-Threading. Hyper-Threading takes a single, fast core and uses it's left over resources in another thread that runs on the core when it's main thread is not running on those resources. Bulldozer has two cores per module that share some resources, but not the ALUs and such that are shared in Hyper-Threading. The two technologies are actually very different. Look at the results too. We have the Sandy Bridge i3s that have two very fast cores. Hyper-Threading usually gets you another ~30% more performance by using otherwise unused resources. Bulldozer has two slow cores.
Here is some oversimplified math that helps to explain it. Lets say we have an i3 at 3.3GHz (I think the 2120 runs at 3.3GHz) and the FX-4100 at 3.6GHz and the Fx has it's turbo disabled (i3s don't have turbo anyway). The i3 has two 3.3GHz cores and has about 50% more IPC than the FX and Hyper-Threading improves highly threaded performance by about 30%. 3.3*2 (two cures)*1.3(hyper-threading)*1.5(IPC difference between Sandy and Bulldozer)=~12.87
FX is 3.6*4=~14.4 Oversimplified, but shows the difference in performance quite clearly. We see the FX-4100 outperforming the i3 in theoretical highly threaded performance by almost 12%. In reality, the difference is fairly close, but often a little less.
Here, we see that Hyper-Threading increases performance significantly, but only by about 30% (cited from many sources across the internet, look it up if you want proof), but the four cores of the FX scale much more linearly because they are full cores. Basically, Hyper-Threading duplicates some resources whilst Bulldozer duplicates cores. They are closer to polar opposites than being similar.
One thread on the i3 runs at more or less full speed on the core, two does the same. The same is true for the FX, so it has a huge disadvantage here because two cores of it and two of the i3, the i3's cores are FAR faster. However, Hyper-Threading and more cores after the first two have somewhat different effects on performance. The third and forth thread on the i3 only run at around 30% the performance of the first two because the first two get most of the CPU's per core resources. The FX, however, nearly doubles in performance because it has four full cores to use instead of two that have two threads that share each core. The FX scales much better, but the i3 has much higher single and dual threaded performance while having similar quad threaded performance. With four threads performance is similar, but with a mere two threads, the performance difference is huge.
Obviously not similar technologies. Hyper-threading duplicates certain per-core resources to have two threads share a single core, Bulldozer's modules share non-core resources between two more or less independent cores. Intel's solution favors lightly threaded performance whilst AMD's solution favors highly threaded performance. Honestly, I prefer Intel's, but AMD's obviously works too.
The problem with FX's high core CPUs is that their performance scales very well for highly threaded work, but hardly at all for lightly threaded work. The FX-4100 is almost identical to the FX-8150 for gaming, but the 8150 is almost twice as fast for highly threaded work. So, if AMD prices the 8150 at $150 and the 4100 at $110, well for a mere 27% price increase, you get nearly double the highly threaded performance. Sure, consumers would love that, but that leaves AMD selling CPUs hugely undervalued for their highly threaded performance just because their lightly threaded performance is negligibly higher than the much cheaper models that have far fewer cores. For the money, the FX CPUs have more highly threaded performance than Intel already. We have the 8120 under $200 despite it only being right behind a $300 i7 in highly threaded performance and the 8150 somewhat more expensive.
The 8150 is stupid for anyone who overclocks because despite it's higher price and clock frequency than the 8120, it isn't really even higher binned than the 8120. The two overclock to the same frequencies at the same voltages, using the same amount of power. The 8150 is only for anyone not willing to touch BIOS settings to get a minor performance boost. Intel actually bins their chips, but AMD's binning is more about finding defects rather than performance at a given voltage. For example, the i5-2500K uses less power than the i5-2400 if you put then both at the minimum stable voltage for the same clock frequency because it has a slightly higher binning. The i7-2600K also overclocks slightly better than the i5-2500K and the 2700K a little better than the 2600K. Phenom IIs, Athlon IIs, Semprons, and FXs (if not going even further back) tend to overclock equally well to their higher clocked brothers.
The FX CPUs only have trouble beating Phenom IIs in situations where clock-for-clock performance matters. For example, the FX-4xxx CPUs are beaten by Phenom II x4s, but the Phenom II x6s are beaten by the FX-8xxx CPUs in highly threaded performance. Windows 8 is supposed to give FX a decent performance boost too, so lets watch Windows 8 performance numbers closely to get a more whole performance picture to analyze.