llm

Forum discussion tagged with llm.

H
Question modern GPU on older motherboard

Hello everyone! I have an old server from 2013 that has a tesla M40 gpu, im trying to run llms locally on that gpu through ollama but haven't had luck, I know its not officially supported since it has a compute capability of 3.5 but I tried using a modified version of ollama but still it falls...
- HAROONDINHO
- Thread
- Apr 28, 2025
- gpu llm ollama rtx3060 server
- Replies: 3
- Forum: Graphics Cards
P
Question Advice on a Motherboard/Case capable of accommodating 4 x 4090 GPUs ?

I'm interested in building a system initially with 2x 4090 GPUs but with the ability to add two more later if the need arises. This is for running LLMs and perhaps fine tuning small ones. So far I have found a very limited number of motherboards that have more than 3 PCI slots, and of those...
- pjw
- Thread
- Feb 29, 2024
- llm sysem building
- Replies: 6
- Forum: Systems
Z
Question Best value GPU with 27+GB VRAM for running LLMs ?

My use case is I have installed LLMs in my GPU using the method described in this Reddit Post. I want to run WizardLM-30B, which requires 27GB of RAM. (I also have a quantized version with lower RAM requirements, but I do not have a number on the RAM requirements. I just know that I can fit...
- Zork283
- Thread
- Jul 17, 2023
- gpu llm VRAM
- Replies: 8
- Forum: Graphics Cards
Z
Question How to shop for GPUs (or other hardware) for LLM Workloads ?

Hello, I would appreciate some guidance on what hardware (GPU or otherwise) I should purchase to enable me to run LLMs locally on my machine. Here are my system specs. CPU: AMD Ryzen 9 7950X Motherboard: ASRock X670E PG Lightning AM5 ATX Mainboard. RAM...
- Zork283
- Thread
- Jul 3, 2023
- AI gpu intel arc a770 llm
- Replies: 1
- Forum: Graphics Cards

RESOURCES

Top Bottom

llm

Question modern GPU on older motherboard

Question Advice on a Motherboard/Case capable of accommodating 4 x 4090 GPUs ?

Question Best value GPU with 27+GB VRAM for running LLMs ?

Question How to shop for GPUs (or other hardware) for LLM Workloads ?