The best open LLMs you can run on 24 GB of VRAM

Open LLMs that fit in 24 GB of VRAM at their default quant — the RTX 3090 / 4090 / 7900 XTX tier where serious local models like 32B-class checkpoints become runnable. Ranked by the largest model that fits, with honest $0-local and rent-a-GPU costs. We guarantee the fit; you judge the quality.

How this is ranked: Objective fit filter only. 'Best' = 'runs on a 24 GB card.' VRAM is engine-computed; ordering by size, never a quality verdict.

Showing the top 40 of 267. See all →

More: all "best" lists · cost calculator · all models

Open the free Spanvero advisor → · Honest, $0-markup. © 2026 Cynosure LLC.