SAM and equivalents aren't there to enable the CPU to use VRAM as extra memory, they are there to reduce or remove managerial overhead: with the CPU having flat address space access to the entire VRAM, it does not need to go through the extra overhead of mapping the correct VRAM region into host memory address space before every read/write to a different VRAM region, which saves 200+ns of latency on each operation where address mappings would otherwise need to be changed. It also means that threads don't have to compete for access to limited windows into VRAM address space either, which means fewer locks/mutexes between host CPU threads and GPU IO for greater concurrency and reduced overheads there too.You're right, I hadn't considered that. In this case, it only seems useful in cases you'd expect the system RAM to be saturated, as they can work in tandem.
It makes CPU-GPU information exchanges more efficient.