gamerk316 :
I will remark once again that the reason why AMD doesn't make a large APU today (e.g. 600 mm² size) is due to the DDR bottleneck. It is the same reason why the low graphics cards use DDR memory but the top cards use GDDR5 memory.
Very low end GPUs use
GDDR3, not DDR. Two totally different technologies.
https://en.wikipedia.org/wiki/AMD_Radeon_Rx_300_series#Desktop_products
https://en.wikipedia.org/wiki/AMD_Radeon_Rx_300_series#Mobile_products
Lowest ones use DDR3, low-middle uses GDDR5 and top uses HBM.
gamerk316 :
Secondly, the memory bottleneck is still going to exist. All AMD is doing is putting HBM on die, but the main system memory is still going to be DDR4. Meaning that you are still bound by DDR transfer rates
No. Current mobile/desktop/server APUs
only have DDR3 memory and the iGPU is bandwidth bottlenecked. For comparison, the GPU on Fury X card has access to 512GB/s of bandwidth provided by HBM stacks, whereas the iGPU on top Kaveri APU (dual-channel DDR3 @2133MHz) only can access to 34GB/s, which are shared with the iCPU. That is the reason why Kaveri APU is limited to 8 CUs: the slow DDR3 memory [1].
Check the diagram given above. The iGPU on the high-performance APU announced by AMD has access to HBM memory stacks with 512GB/s of bandwidth. Evidently the DDR-bottleneck of current iGPUs is eliminated and engineers can add lots of CUs up to get >4TFLOPS.
Of course, the main system memory of the APU will be DDR4. But this is exactly the same situation that with a dCPU+dGPU configuration. The Fury card has a private memory poll of HBM but the main system memory continues to be DDR3 or DDR4 on mobo.
As I already explained multiple times, the APU concept is better, because the iGPU can access directly to the main memory system via hUMA links, whereas the dGPU has to access to the main memory system
indirectly via a slow PCIe interconnect, and wasting CPU cycles on synchronizing accesses (the dGPU is a second class device and works in slave mode).
[1] In fact, AMD had planned a superior Kaveri model, with 6CPU cores and a larger number of CUs; that superior performance APU was served by GDDR5 memory. It was canceled because one of the providers of the SO-DIMM modules was out of business. The Kaveri die still has a GDDR5 memory controller.