News Nvidia and Mistral AI's super-accurate small language model works on laptops and PCs

Admin · Aug 22, 2024

Nvidia and Mistral AI have developed a highly optimized language model that can work locally on laptops and PCs while offering class-leading accuracy.

Nvidia and Mistral AI's super-accurate small language model works on laptops and PCs : Read more

Tom_Neverwinter · Aug 22, 2024

I mean llama 3, 3.1 and most 8b models will run on cpu with 0 issues. I am literally running these on a orange pi 5 plus for fun. if they get model switching working so it can load whisperai, then unload it or be more efficient. I can then load a 8b model. process the data. unload the model. then load a xtts model output voice to the user and repeat. all on 8gb. my orangepi5plus has 16GB of ram. so I dont need to offload whisper the model or xtts but the cpu bottleneck even at 6TOPS is painfully slow at this time. [lets also be honest here, anyone can run a 4bit quantized model. even most toasters]

Search

News Nvidia and Mistral AI's super-accurate small language model works on laptops and PCs

Admin

Administrator

Tom_Neverwinter

Distinguished

TRENDING THREADS

Latest posts

Moderators online

Share this page