OneFXupdude
Distinguished
The Ampere CPU you are referring to has 80 cores, no SMT, though they are planning on a 128 core variant. That, however, is very much a server oriented processor, having more in common with AMD's EPYC or Ryzen Threadripper Pro platforms than any consumer chips. Impressive, yes, but Nvidia would have different plans (not that I am ruling out them coming up with their own server cpu). The CUDA idea you speak of is very interesting, though I think at much lower thread counts at this point in time. Die shrinks and further advancements in the architecture will dictate if it will if they could pull off those kind of thread counts.To me, Nvidia's intentions for Arm are fairly obvious. They want to be even bigger in supercomputers and the ARM architecture is very well suited for those because of its low power consumption. It becomes even more clear when you consider that Nvidia has said they are porting CUDA to run on the ARM instruction set. Then consider that ARM-based CPUs have been made with 80 cores and 4-way SMT. It seems to me what Nvidia wants to do is make a massively parallel MCM-based machine. They will go with at least 4-way SMT but think they will aim for 8 or 16-way. This would make one ARM CPU look like a streaming multiprocessor which are 64-way SMT currently. Then they will put 64 of these on chip module which will be good for 512 or 1024 threads. Then they will put 8 of those on one MCM and have 4096 or 8192 threads available on one module. This will be look like just another GPU to CUDA code. The biggest benefit is it won't be a co-processor so data transfers will be minimized. All together, this architecture will be astonishingly fast and will change the design of super computers for years to come.
That's where I think Nvidia is going with this and they want Arm as a design resource more than anything else.
That said, I would be very interested in seeing a Desktop ARM based solution from Nvidia. Windows having finally caught up with ARM support and offering a worthwhile version of 10 for ARM is nothing to sniff at (Windows has been the major driving force for desktop computing for decades no matter how much Apple or Linux fans try to dismiss it). If they were to do a 32c/64t ARM processor, 3.0+GHz with dual channel DDR5 and 20-40 PCIe4.0 lanes around 100W or less, that would be a very intriguing processor indeed.
As for the other ARM CPU Licensees, I really don't think Nvidia would be stupid enough to mess with them. With RISC-V as a very real option for them to move to, it would be in Nvidia's best interest to do what they can to keep ARM Licensees happy.