When I've taught computer classes, I've always enjoyed using this admittedly crude analogy:
Visualize each CPU core as a radio frequency transmitter.
Each core can now change one 64-bit register at a rate of 4 Billion cycles per second:
Thus, 1 core @ 4 GHz x 64 bits per cycle = 256 Gigabits per second / 8 = 32 GB/sec
4 cores @ 32 GB/second = 128 GB/second issuing from one 4-core CPU
How many PCIe 3.0 lanes are needed to accommodate that data rate?
Answer: 130 lanes @ 8 GHz / 8.125 bits per byte
Now, add hyper-threading.
Also, upgrade to an 8-core CPU.
Modern CPUs are capable of easily overwhelming
downstream chipset capacity.
Visualize each CPU core as a radio frequency transmitter.
Each core can now change one 64-bit register at a rate of 4 Billion cycles per second:
Thus, 1 core @ 4 GHz x 64 bits per cycle = 256 Gigabits per second / 8 = 32 GB/sec
4 cores @ 32 GB/second = 128 GB/second issuing from one 4-core CPU
How many PCIe 3.0 lanes are needed to accommodate that data rate?
Answer: 130 lanes @ 8 GHz / 8.125 bits per byte
Now, add hyper-threading.
Also, upgrade to an 8-core CPU.
Modern CPUs are capable of easily overwhelming
downstream chipset capacity.