gamerk316 :
No its not "we can make everything parallel, if we try"... the problem is that it was not really tried.
Yes it has; its been tried since the 70's, with the same exact results every single time: Most tasks DO NOT SCALE.
Soft devs are right, some things don't scale and that is it. But that is why something like HTM (hasfail has it), automatic vectorization and speculative multithreading, might very well appear... for the things that does scale... and those are invariably the most important jobs a computer can have, the rest is nonsense. Else we wont have "more core chips" for the desktop/client, it just doesn't make sense, so the non scaling software will dictate the non-scaling hardware, and all of the sudden all this discussions lose interest. Lets hope the paradigma tries harder with better tools (perhaps like HSA), lets hope a dead end will not happen.
Taking the case of speculative multithreading and HTM, both have MAJOR performance downsides in the worst case outcome. You have to do a performance analysis against your system to determine if you will gain an overall benefit using these techniques. For example, HTM, if its found that another thread has modified the data you have processed, the thread has to dump its results, put a traditional lock in place, and begin the computation a second time. Thats over 2x the performance loss of using standard software locks. If your fail rate is high enough, you can easily tank performance. In large databases, where you have dozens of CPU cores, and a lock can kill performance, HTM makes sense. For a consumer PC? Not so much.
As for automatic vectorization: Its a good performance enhancement, but not one thats going to appear on benchmarks. Algorithmic performance tends to drive application performance, and if the algorithm itself is serial, there isn't much you can do to improve performance.
No Its the same thing of automatic vectorization, its only different in your mind... like the "force" lol
But yes you are right, some form of performance analyses have to be done, but remember that even BDver1 has hardware profiling mechanisms with a few instructions of ISA extansion, and
the purpose of speculative multithreading is being "transparent" for the developers, they only have to insert the possible break points for profiling according to an analyses, which could be done kind of automatically by the compiler toolchains, and continue more or less designing the code for low thread count.
The speculative multithreading apparatus will decide to split the sequential code "ON-THE-FLY" according to the profiles... it could tremendously accelerate things, most logical it will be done by the data graphs analyses, and in there is where the HTM apparatus can enter to help, since it will be speculative threads as transactions by teh data flow grahs... or thread level data speculation...
OF course HTM code design from origin to be "transactional" (like DB engines) will be better, but it will also be extensivelly MT, it will be for HPC/server... a totally different paradigma and approach, not invalidating HTM can help in spMT.
In this case there is no penalty... neither a hardware bloat explosion, since like HSA the compiler is JIT, and there is a very advanced "runtime" to decide what to split what not and when... the code can even be single-thread.
For this a "dataflow" kind of uarch like advanced by a paper of The University of Washington can be the best approach, more so because GPU are already half way to a full blown dataflow paradigma, and spMT with HTM can also go for GPUs, for the compute jobs not kernel based, even for game physics.
So if there is an approach in model of CPU and GPU, a "dataflow" approach, having GPU ALUs and CPU ALUs very close tied together is not so complicated.
http://wavescalar.cs.washington.edu/wavescalar.pdf
That is why i suspected, the "excavator" exposé can be a 1th April joke, but the inventor knows what he is talking about
http://diybbs.zol.com.cn/11/11_106489.html (it could be truth... but exposed only in china ? .. why not ?.. its an exploding market, and the west is pretty much intel turf on minds and hearts )