PC Doldrums: Quarterly Shipments Hit Lowest Levels Since 2007

bit_user · Jul 31, 2017

For the curious, various parameters of Nvidia's GPUs:

http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#compute-capabilities

If you want to map these to a sort of timeline, you can see the compute capabilities of each GPU listed here:

https://developer.nvidia.com/cuda-gpus

InvalidError · Aug 1, 2017

bit_user :

Let's play a different game: what cache tier analogue is actually missing from GPUs compared to CPUs?
- L3 in CPUs traditionally sits between the memory controllers and everything else, which is where L2 in GPUs is and makes adding L3 redundant as there already is cache there.
- L2 in CPUs sits at the boundary between the uncore and the core(s) it serves, which is where L1 in GPUs is.
- L1 in CPUs has always been dedicated to individual cores, which GPUs currently don't have since the tasks of getting data where it needs to be before thread batches are executed is delegated to the thread scheduler and the massive data crossbars tying everything within the SM/CU together.

If GPUs had to operate at 4GHz, those crossbars and associated resources would need to get much smaller and more local to beat the clock.

bit_user · Aug 1, 2017

I already stated my case. I have nothing more to add.

If you require further insight, check out the above CUDA Programming Guide or AMD's OpenCL optimization guide from 2012:

http://developer.amd.com/amd-accelerated-parallel-processing-app-sdk/opencl-optimization-guide/
(the fonts now seem a bit off, particularly some of the headings)

Search

PC Doldrums: Quarterly Shipments Hit Lowest Levels Since 2007

bit_user

Titan

InvalidError

Titan

bit_user

Titan

TRENDING THREADS

Latest posts

Moderators online

Share this page