You'll have to show me when that changed from theory to silicon, as well you'll have to show me a x86 machine that can do overlapping loop execution, without hardlocking.
The relevance, is that x86 machines
can do overlapping loop execution in HW, thanks to Tomasulo's algorithm.
Well you are going to have to point me to the compiler white sheet that says that. I am looking as well and I keep comming up with Itanium only support.
Finally i see where we misunderstand each other.
Sorry, but you can quit your search, as you won't find compiler white sheet referring to dynamic scheduling in OOO CPUs.

In fact, EPIC's Rotatating Registers for loop unrolling are a "poor man's version" of Register Renaming in Tomasulo's Algorithm.
With Register Renaming, the CPU has a larger set of registers which are used with Tomasulo's algorithm for dynamic scheduling, in a way that is completely invisible to the compiler and the programmer.
For example, the K7 has a 88 physical FP registers, while the instruction set of course supports only 8; however the FP scheduler has 36 entries, which means that up to 36 instructions can be "in-flight" each with its own registers. (the loop unrolling happens inside the reorder buffers of the CPU, with OOO)
Basically, the difference with EPIC is that in this case you have the compiler who is doing the loop unrolling and explicitly tells the CPU to "rotate" the register file, to resolve naming conflict among registers, while an OOO CPU does this completely in HW internally, without the compiler knowing anything about what's going on.
Again I can provide whitepapers in the defence of my position, you will have to do the same thing otherwise your just another person trying to discredit me and frankly your not doing a very good job at this point.
Gee. :roll:
I was not trying to discredit you, i was just trying to discuss with you.
If you were seeing this in terms of idiotic "ownage", well i feel like i just wasted a lot of time.
Anyway, i previously posted another link (not that one of Ace's HW, which you didnt understand anyway) which explained even with an example how loop unrolling works on OOO CPU.
What "white papers" do you need?
And whether i did a good job or not, it's not yours to decide, but i'll leave it at the other forum members.
Sorry, but your posts are just repeatedly showing that you don't understand dynamic HW scheduling at all.