Most of the resources on one GPU are duplicated on the other, which means putting two 4GB GPUs together still only gives you 4GB effective.
Accessing the other GPU's RAM would not be very effective anyway: HBM and 384+bits wide GDDR5 are capable of 300+GB/s while the PCIe 3.0 x16 interface is only capable of 16GB/s. The other GPU would only have access to 1/16th of the other's memory bandwidth, 1/32 of the bandwidth if you are doing CF with x8x8 PCIe. You would be better off using a single GPU and letting it access system RAM at 16GB/s since DDR4-2133 can do 34GB/s.
The memory interface may be slightly different but it changes nothing to off-GPU bottlenecks. Some other solution like NVLink is required if you want a high bandwidth...