This is a fair statement, though I thought POVray was FPU intesive and not SSE... nonetheless, the most and I emphasize the MOST we can say about all of this whoopla is that is appears likely that Barcey did not improve IPC for the code type used by POVray.... and that is all we can say, it is in no way indicative of how Barcey will fair in general.
However, I would argue that POVray is indeed K8 friendly as it is one of the few cases where Opty/Athlon K8's can keep up with Core uArch... see any TechReport comparision for an example:
http://www.techreport.com/reviews/2007q1/quad-core/index.x?pg=8
Rendering apps such as Cinebench and POVray make heavy use of FPU, and the K8 FP capabilities are still strong even compared to Core uArch.
Jack
I think it depends which compiled version of POVray 3.7 you are running. The benchmark you referenced here is using the 64-bit version, which as far as I can tell is either not using optimized instructions or is optimized for AMD64. I think the SSE2-optimized version of POVray 3.7 is faster, and in those benchmarks Intel is about 40-45% faster than K8 clock for clock.
Yeah, I don't diagree -- I wish people/sites would stop trying to make any comparisions from this data to Intel HW, it is simply not possible to draw any kind of meaningful conclusions.
However, considering that POVray.org has not released source code for any thing higher than version 3.6 (including no 64-bit source code), we can assume (how valid is debatable), that all systems are using the same compiled versions. While your point that recompiling with different compilers using platform specific options can significantly change the results, in the context of this one data point this is not applicable.
Why?
The real message to look into this POVray 'demo' that AMD sponsored is only meaningful when talking about Barcey compared to Opty's, as such a very vague statement can be made -- Barcey does not appear to have improved much on the FPU IPC efficiency.
Now, using your argument to try to explain away this result sorta insults AMD as they would need to intentionally alter the code (if the source code existed) to favor one way or the other and they did a very poor job of making Barcey look it's very best.
It is a moot point, however, because the real purpose of the demo was not to get performance absolutes but relative performance with a drop in upgrade --- which is a major thrust of AMD's strategy.... numbers and comparisions notwithstanding.
Jack
The source code for 3.7 beta is not published, but they do distribute it with several different executables that are compiled with optimizations for different processor types. See http://www.povray.org/beta/
I'm not trying to discredit AMD. It is entirely possible that the AMD Barcelona demo used a version of the executable that doesn't use any optimized instructions, which is a possible explanation why the raw numbers do not compare well to Intel POVRay benchmarks.
It is also a possible explanation why the IPC over K8 is not that much improved. For example, maybe if they had used the SSE2 optimized version there would be a better IPC difference since this is something that is supposed to be much improved in K10. But this would assume that the people giving this demo were incompetent and didn't chose a good demo to showcase their product.