Aspiring techie :
what if Intel created almost like a co-processor in place of the instruction scheduler. It would be like another core or a processor in a processor, but it wouldn't have the capabilities of another cpu core, instead it's purpose was to schedule instructions. It would be able to blast through sceduling with ease. I don't know if it's possible, but maybe Intel can do something like that and then add more parts to the cpu?
The instruction scheduler needs to tie into:
1- the L1 data and instruction caches
2- the instruction decoder
3- the reservation unit
4- the instruction trace cache
5- the reorder queue
6- the retirement units
7- likely more
Since the scheduler ties into every critical stage of the execution pipeline, it is physically impossible to separate it from the CPU. Putting it off-chip would add so much latency that it would kill the CPU performance outright. In a CPU operating at 4GHz, the scheduler has to reassess scheduling at the picosecond scale, far too fast for any sort of off-chip communications.
Even if you had a scheduler with infinite instruction look-ahead and infinite chip resources, the effective look-ahead would still be limited by data dependencies, conditional and indirect branches. This is a fundamental limitation of sequential software and there is very little left that hardware can do about it that has not already been done.