The fact that Intel has its own fabs that are so far ahead of the competition gives Intel the power(pun intended) to conquer the competition. It's not a very fair comparison when Intel will have their shiny, new, fresh out of the lot 22nm tri-gate up against last year's 28nm poly gates. If they don't even have HKMG, that's a nail in the coffin right there.
Out-of-order does seem like it would be a bad design decision for extreme low power designs. It reduces the dependancy on the memory and cache subsystem, but it increases power immensely for the core. It seems like it would be a better idea to enhance the memory handling of the ARM chips instead of going out-of-order. Though the better the algorithms get, the more efficient the design will be overall.
Hyperthreading really makes sense for low power in-order designs, at least to me. It allows for 'out-of-order', parallel in actuality, execution without all of the complexity=greater efficiency boost on in-order designs than out-of-order designs.
Graphics look like big, in-order, parallel processors. This is why they're so efficient compared to x86 processors. Get rid of all of that out-of-order complexity while keeping the instruction execution efficiency, and you have huge amounts of processing power. They don't reach as high on sustained flops thanks to being in-order, but they make up for it in raw processing power. In fact, Knight's Corner looks like a GPU with x86 instruction support.
Intel's execution is superb... I would really love to see them create a fresh architecture and move on with it... if only it could gain support.