Bulldozer is very much a tradeoff between cost, ease of manufacture, ability to add more cores, and per-thread performance. More decoders, wider/deeper buffers, beefier FPUs, faster IMC, lower latency all lead to a chip that is larger, more complex, and more expensive to manufacture. You would end up with something much more similar to Sandy Bridge (using Sandy Bridge as it is a 32 nm chip like BD/PD) than to the current Bulldozer/Piledriver. Sandy Bridge performs very well per core but notice what Intel charges for them and that there are few SBs with more than 4 cores. Above that the total core area gets large and you also need a huge amount of cache to keep everything fed. AMD's 8 integer core die takes up 315 mm^2 while Intel's 8 integer core SB-E die takes up a whopping 435 mm^2. The complexity of the full speed ring bus with that many stops and that much cache leads to much lower speeds in the higher core count parts as well. The current Piledriver design is able to compete very well with Intel's parts in anything reasonably multithreaded despite being smaller, less-complex, and a less expensive. My hunch is that AMD will tweak a little bit here and there where it is relatively cheap and easy to do. They won't try to revamp the chip to directly compete thread for thread with Intel but rather entire platform vs. entire platform.