gamerk316
Glorious
blackkstar :
Oh, the assumptions. AMD says that the traditional GPGPU configuration requires copying memory from system RAM to VRAM. Then AMD releases slides showing GPU can access system memory via pointers.
Technically, dGPUs can access system memory, just like any external device. Theres no architectural reason why they can't. The main reason to send over a copy of the data to the GPU is to avoid the really slow process of sending a ton of data over the PCI-E bus. Same reason why CPUs have three levels of cache, instead of just reading everything out of main memory every time it needs it.
VRAM, like CPU Cache, is an attempt to hide really slow memory transfers.
You know, latency. There is a huge concern for doing tasks the AMD HSA guide says is not for latency intensive workloads being overly affected by latency and I don't really understand it.
If the added latency plus the decreased time to perform a task is faster than one with lower latency and longer time to perform the task, it's a win.
Would you rather have 100ms of latency and a second to finish a task or 1ms of latency and 4 seconds to finish a task?
Depends on the workload in question. There are times where you accept longer execution time for lower latency.
Simple example: FPS versus Frame Times. The i3 can still spit out 60 FPS in games, so you can obviously game with it, right? Oh right, the frame spikes. That's your latency. But you still meet your 60 FPS per second average, even if the individual frame graphs look like crap. So in this case, I'd easily take 45 FPS and lower latency over 60 FPS and latency that looks like a roller coaster.