Nickiel12 :
What is an L2 and L3 cache? and what is the difference?
The L1, L2, and other levels of cache refer to different cache units. As a general rule, each level of cache is larger, but slower to access then the one before. The L1 cache is by far the fastest, but smallest of the three. As a comparison, reading/writing to the L2 cache is 10-20 times slower then it is accessing the L1.
The primary reasons for adding so many levels of cache on the CPU is really to just get around the fact memory access is slow. Even the slowest level cache for consumer CPUs, the L3 cache, is still about twice as fast as accessing main memory.
I don't know what the current typical access speeds are, but for the Pentium 4 architecture:
L1 cache: 1 clock cycle
L2 cache: ~20 clock cycles (20x slower then the L1)
Main Memory: ~80 clock cycles (4x slower then the L2, 80x slower then the L1)
HDD (Page Fault): ~100k clock cycles (you get the idea)
Which highlights why we use CPU cache; it's a lot faster then accessing main system memory. GPU VRAM operates pretty much the same way; it's local memory on the die that is a lot faster to access then main system memory.