[citation][nom]pyr8t[/nom]I would like to see more development along the lines of computing done with 2 processors, 1 serial-optimized, 1 parallel optimized. You will still need the serial-optimized CPU to provide guidance to the parallel processor in any case.[/citation]
If I understand what you're trying to say, AMD's actually pursuing something along those lines with their "Bulldozer" core. They've been a bit silent on the exact specifics of it, but it is known that it will pair Phenom II-based cores with stream processors similar to what're seen in the AMD Evergreen GPUs. Such a design would give a selection of both conventional CPU processing resources, along with a lot of power that could be used for more strongly parallel tasks. (i.e, tasks where the CPU's extensive complexity would be wasted)
[citation][nom]ALongerUsernameThatProbablyIsntTake[/nom]This 240 core claim is an outright lie, too. I was amazed when I heard the following at an academic conference: "How do you expect Larrabee to compete with nvidia's cards when it only has 32 cores compared with hundreds?"[/citation]
Well, the use of "cores" can be a bit hazy for a term, as the architectures can sometimes blur these distinctions. Normally, when it comes to parallelism there's a distinct hierarchy, from lowest to highest, in terms of solutions:
-Word length (bit-level parallelism)
-Vector (SIMD)
-Superscalar (multi-pipeline; instruction-level parallelism)
-"multi-core"
-SMP (multi-CPU)
The problem is that the definition of "multi-core" is often a bit arbitrary, as shown by the evolution of CPU designs. This solution can be achieved by simply using multiple CPU dies, or alternatively, through a design that removes many components (like the memory interface) that might otherwise be redundant. In the end, the line between "superscalar" designs and "multi-core" designs can be a bit blurred, to where some reference of "cores" can be at the very least forgiven. This would hold especially true in Larrabee's case, since its "cores" aren't exactly fully-fledged; a lot of the control circuitry only exists on the total chip level, that is further passed down into the individual elements that Intel refers to as "cores."
At any rate, that last question you quote still raises a valid point, that does have a valid answer; as the processing elements CAN be compared and contrasted, it's a fair statement. Of course, the counter-point is that the SIMD units in each "CUDA core" is only a total of 128 bits (4x32) wide, contrast to 512 bits for Larrabee. But in terms of design describing both GPUs' elements as "cores" isn't really more incorrect for one than the other.