AMD states K8L aka Barcelona faster than all Intel cores

Page 5 - Seeking answers? Join the Tom's Hardware community: where nearly two million members share solutions and discuss the latest tech.
I wonder if Socket F's Register DDR2 memory controller is also backwards compatible with unregistered DDR2 memory. Otherwise they would have to design a separate chip which isn't very convenient. Opteron 1xxx which use unregistered DDR2 are use AM2 not Socket F so they don't really have an example of that yet.

As far as I see it (no pun intended!), there would be no probs for both AMD's sockets (AM2 & F) regarding unregistered DIMMs, since AMD & mainboard manufacturers could use terminators for the extra [registered] DIMMs & Socket's pins/pads traces (is this correct?); if true, the issue would rely on the mainboard's HT traces, hence, on the very compatibility when using registered DIMMs. This would lead, compulsorily I think, to two different mainboard designs for the same socket, one with & one without terminators. Would it have to imply a chip redesign, in any case? (assuming the IMC could support both DIMM types & no chip's pinout diff.).

Edit: I'm taking into account BM's words, when he addresses socket AM2 & a "crippled" 1 pin less ES chip, for compatibility's sake.


Cheers!


I meant that 1207 would become 1206. What's the difference between Opteron and FX?

1 pin.

939 vs. 940. That allows for the assumption that ECC only needs one extra pin. AMD is supposedly building a dual type controller into Rev H which will run EITHER DDR2 or DDR3.
 
I meant that 1207 would become 1206. What's the difference between Opteron and FX?

1 pin.

939 vs. 940. That allows for the assumption that ECC only needs one extra pin. AMD is supposedly building a dual type controller into Rev H which will run EITHER DDR2 or DDR3.

Correct. And, two sockets/boards, as well. That, was my point.


Cheers!
 
You have a X6800? Lucky. Just for interest did you get a P965, i975X or other chipset? People say the P965 is better for overclocking, but the i975X gives better performance at a given clock speed.

In any case, I may have worded my response too strongly so hopefully I wasn't taken the wrong way. From your describtion I gather you were testing for decreases in performance scaling as you increased clock speed via multiplier at a given FSB speed. The linear scaling even on a paltry 400Mhz FSB is pleasantly surprising. I guess we have the large 4MB L2 cache and the superior prefetchers to thank for that since most of the FSB traffic is just pre-filling the L2 caches, something that is rather orderly and planned by the prefetches, rather than the FSB traffic being consumed by supplying information to satisfy pipeline stalls.

I was wondering if you could run some tests with the following configurations: 2.67GHz (133x20), 2.67GHz (167x16), 2.6GHz (200x13), 2.67GHz (267x10), and 2.67GHz (333*8) with as standard a memory speed as possible. I'm interested in this sort of configuration since it'll take into account the latency benefits of a faster FSB in addition to the bandwidth benefits. Personally, I don't think there will be much difference between a 1333MHz and 1067MHz FSB, but I'm thinking there may be a more noticable difference between 1067MHz and 800MHz and performance will drop off from there. Obviously, if you don't have time or aren't interested that is fine.

Your efforts are much appreciated and I'd certainly like to more of the linear scaling data that you've already obtained. Maybe you could make another chart like the one with SuperPi scaling?

Yep -- I originally slated myself up for an E6600, but then cancelled the order and put in for the X6800 for this experiment. People have berated me for the costs :) but I am a fanatical more or less when I have a nagging question that I want to answer.

You are correct in the gist of the experiment and the reason I needed an X6800. Most OC reviews are overclocking via the system clock, this scales the FSB with the CPU speed and gives no indication if the FSB will choke the processor.

There are a few ways to get to this answer, hold the system clock fixed and drive the processor higher using multipliers. I have completed the 100 MHz system clock (400 MHz FSB) run using Cinebench 9.5, superpi at 1,2,4 and 8 M runs, Quake 4 1.2 with and without SMT, FEAR, Winrar, Everest memory, PCMark full, CPU, and memory suites. I am finishing up repeating some 200 MHz runs to verify the numbers.

Just quickly.

All apps are scaling linearly except winrar. Winrar is tracking 1:1 with measured memory bandwidth. This was expected based on how I am using it as a bench --- I am not crunching files, I am using it's random memory bench test. Using Perfmon it would appear the 4 meg cache is putting demands on the FSB of 5-25%, much less that I was expecting.

I am using a 975x MB (Asus P5W DH) initially I installed bios 1305, but ran into issues from a cold boot using a multiplier of 20, I then went to 1105 and was able to get data but ran into similar issues. I finally figured out what was going on so have now installed 1305 again and am repeating a few runs to make a consistent data set on 1305. Bios for most boards is still buggy.

Now, here is what is really fun about this chip... at 400 MHz FSB here are the idle full load temperatures ambient was 22 deg C, core temperatures measured with Core Temp.

Multiplier/Speed/Idle/Fullload
6/600/31/31
8/800/31/31
10/1000/31/32
12/1200/31/32
14/1400/31/32
16/1600/32/33
18/1800/32/34
20/2000/32/34

Phenomenal.

I plan to do a super post with a complete write up of all the data. I have collected screen dumps, super PI CRC numbers, and uploaded all my PC mark data to ORB so the data can be verified.

Sadly, I have been unable to really play with my new system because I have stayed at a clean install of windows and have only installed drivers, benchmark apps, and temperature monitors. Unfortunately, I was using Perfmon to look at cache miss rates, page reads/writes to work bandwidth. But Intel's Vtune software will measure FSB bandwidth directly, however when I installed it it changed window's system files to the point where now when the system gets to the welcome screen it reboots. So I need to rebuild the drive this weekend.

Finally, yes I will run your suggested experiments as soon as I can :) ...

Jack


Can you really load the system with generic apps and then do the games benchmarks. I don't like clean machine benchmarks. No one defrags and reboots before playing games or starting Cinebench.

No flames please.
 
You make an excellent point. The C2D's individual core performance is what is powering the CPUs success. Were Intel to market a single core varient, it would outclass, by no small margin, any of AMDs A64 single core CPUs.


The last tests Anand ran showed an 85% increase going from 1 core to 2.

http://www.anandtech.com/video/showdoc.aspx?i=2828&p=3

That means if two cores get 45 fps one core will get less than 22fps.
 
Can you really load the system with generic apps and then do the games benchmarks. I don't like clean machine benchmarks. No one defrags and reboots before playing games or starting Cinebench.

No flames please.

Not flaming but can I ask why not? And who did you check this with?
 
I think what no one is realising is that the shrink to 65mn not only means less power, but higher clock speeds. Imagine if Conroe was on 90nm. It would be as hot as a Death-scott, maybe.
Basically, it's competition. Whether or not the marketing is complete BS (it has to have at least some shred of truth), we get to save money! More money saved is more money on computers, or less important things, like, mortgages, food, etc.

P.S. There are edit buttons, people. Stop posting 3 times in a row (I'm looking at you, DaSickNinja, and at the first page with my lazy eye that I don't have)

EDIT: Lookie! I edited my post! Wow!
 
Reply to BM:

No one defrags and reboots before playing games

ROFL.

You are not a hardcore gamer. You don't deserve a powerful system.

You're right. I don't care if I play at 2048x1536. My 19" LCD at 1280 is lovely. The 7800 GT powers Q4 and D3 ( my favorite games) at UHQ with all options on. I do care if my HDD runs for a minute after quitting a game though which is why I have 4GB RAM ( well at least when I reinstall X64).
 
Can you really load the system with generic apps and then do the games benchmarks. I don't like clean machine benchmarks. No one defrags and reboots before playing games or starting Cinebench.

No flames please.

Not flaming but can I ask why not? And who did you check this with?

Do YOU reboot and defrag before you play a game?
 
On my gaming rig? HELL YES. I defrag weekly, and reboot when I change games.
On my general purpose rig? No, I turn my chair 90 degrees instead. To face my gaming rig.
 
ok when u release something a year after somthing else, if its not better, every and engineer should piss themselves rite now

intel quad core is coming out end of nxt month, while k8l q3 2007 think about it, by that time intel would hav come out with something better as well

why they bs with 4x4 instead of jumping onto k8l early beats me

and i dont defrag weekly, but i reboot when changing games or been doing a lot of stuf before i go into one
 
ok when u release something a year after somthing else, if its not better, every and engineer should piss themselves rite now

intel quad core is coming out end of nxt month, while k8l q3 2007 think about it, by that time intel would hav come out with something better as well

why they bs with 4x4 instead of jumping onto k8l early beats me

and i dont defrag weekly, but i reboot when changing games or been doing a lot of stuf before i go into one

Me again, this thread also appears to have lost focus. Well it really didn't have anything to do with what the title implied but that’s neither here nor there. I think it's time to give it a rest, because frankly you can't have a serious discussion in the midst of a flame war. For the forums sake just walk away, please.
 
ryokinshin said:
why they bs with 4x4 instead of jumping onto k8l early beats me

AMD was caught by suprise, design a CPU take minimum 2 years to production. The fastest way is put second socket to mobo, and claim it 4x4. AMD is buying time to get people not to abandon ship. Next year, we will see strong competitions and a price war of CPU. I think Intel pull their sh*t together now.
 
You make an excellent point. The C2D's individual core performance is what is powering the CPUs success. Were Intel to market a single core varient, it would outclass, by no small margin, any of AMDs A64 single core CPUs.


The last tests Anand ran showed an 85% increase going from 1 core to 2.

http://www.anandtech.com/video/showdoc.aspx?i=2828&p=3

That means if two cores get 45 fps one core will get less than 22fps.

Did you actually read the post you are replying to?

Your reply is pointless and has 0 (zero) bearing on the post subject
 
I just don't think the Barcelona core will be able to match Core 2's performance. Even if Barcelona had a higher IPC than Core2, I don't think it will match Core2's clockspeed. A 3.6Ghz Core2 will be hard to beat. Barcelona being a 4-core processor, it will be a large die. There's just no way it can scale with Core. Intel has a more mature 65nm process. Plus Intel's Core2 revision is coming out around the same time.

I hope AMD succeeds but the future doesn't look promising.
 
ok when u release something a year after somthing else, if its not better, every and engineer should piss themselves rite now

intel quad core is coming out end of nxt month, while k8l q3 2007 think about it, by that time intel would hav come out with something better as well

why they bs with 4x4 instead of jumping onto k8l early beats me

and i dont defrag weekly, but i reboot when changing games or been doing a lot of stuf before i go into one


You mean like Prescott and Presler?

PS. You're psychotic. Windows isn't that bad; fragmentation only occurs when you add and remove files a lot. If your system stays static, you won't need to defrag - that includes installs.

If you need a link for that you are the noob.
 
I just don't think the Barcelona core will be able to match Core 2's performance. Even if Barcelona had a higher IPC than Core2, I don't think it will match Core2's clockspeed. A 3.6Ghz Core2 will be hard to beat. Barcelona being a 4-core processor, it will be a large die. There's just no way it can scale with Core. Intel has a more mature 65nm process. Plus Intel's Core2 revision is coming out around the same time.

I hope AMD succeeds but the future doesn't look promising.


Look at it this way. Core 2 is 70% faster than P4. K8 is 30% faster (on average) than P4. If Barcelona gets 60-80% faster than K8 it is 20-40% faster than Core 2.

Why?
I can add.

That of course also depends on whether AMD meant overall, per core or per socket when they quoted ~80% faster.



P.S.
AMD has already succeeded. They have cracked the Top 50 supercomputers and are currently the standard for 4P price\perf.
 
Apples and Oranges.

Barcelona is quad core, Core 2 DUO is dual core.

Compare Barcelona to Kentsfield, or whatever quad core Intel has around H2 07.
 
You're psychotic. Windows isn't that bad; fragmentation only occurs when you add and remove files a lot. If your system stays static, you won't need to defrag - that includes installs.

The dictionary definition of static is quite clear
Refers to something that is fixed and unchanging. Contrast with dynamic.
The nature of computing is dynamic not static. Not by a longshot.
According to what you say, if your system stays static then its the virtual equivilent of not using it at all. The use of files or the deletion of files automatically causes fragmentaion. Are you trying to say that you don't delete anything off your computer?
A better assumption would be that you don't use any of your superpowered design computers.

Please feel free to clearify this subject.
 
You make an excellent point. The C2D's individual core performance is what is powering the CPUs success. Were Intel to market a single core varient, it would outclass, by no small margin, any of AMDs A64 single core CPUs.


The last tests Anand ran showed an 85% increase going from 1 core to 2.

http://www.anandtech.com/video/showdoc.aspx?i=2828&p=3

That means if two cores get 45 fps one core will get less than 22fps.

Did you actually read the post you are replying to?

Your reply is pointless and has 0 (zero) bearing on the post subject

Yeah right dufus. He said Core2 single core would be faster than Athlon single core. Unless X2 is 85% faster than single, Conroe-L as it's called will have no chance except against Sempron.
 
It's nice to see that AMD seems ready to take on Intel again allready when the Quadcores (AKA Barcelona and Kentsfield) will meet.

I personally especially like the idea of the processor not using more power than needed, which might be the weak point in Intels armor, especially when the whole low power platform is developed(One of the reasons ATI will be AMD).

The only sad fact is we'll have to wait a while before Barcelona comes :? But that might just mean that Intel will have something to respond with(=Competition, which is good for us all)