ALLEGEDLY done, in one experiment, that was never reproduced by anyone, let alone the people who actually RAN it the first time.
or we can let gamerk316 tell us about how a computer program cannot use more than 4 cores because programmers don't understand how it can be done, yet.
A+B=C;
C+D=E;
E+F=G;
Guess what? I can't make that parallel, ever.
Secondly, programs typically do make use of all processor assets, but most of those threads don't do any meaningful work, so they don't show up in Task Manager. You still end up with 2-3 threads that do the majority of the work, and all the smaller tasks get offloaded to other cores. So you get the typical two cores >50%, and the rest hovering well below that. You are NEVER going to see equal work across cores, except in very rare cases where the algorithm in question scales naturally.
OpenMP will help optimize the low hanging fruit (FOR loops being a prime example) but you won't see any significant change in core usage going forward. Though I'm sure the next generation of consoles will be blamed for that too...