The best open LLMs you can run on 8 GB of VRAM

Every open LLM in our catalog whose weights plus KV-cache actually fit in 8 GB of VRAM at its default quant — the size of an RTX 3060/4060 or an 8 GB laptop GPU. Ranked by how much model you get for that budget (largest parameter count that still fits), with the honest $0-on-your-own-hardware cost for each. You pick the one whose quality you like; we just guarantee it fits.

How this is ranked: Pure objective filter: 'best' = 'fits your 8 GB card.' VRAM is computed by our shared cost engine from params, quant and context — not a quality opinion. We order by model size (most capability per GB) and never claim a #1 is 'smartest.' Quality judgment is the user's.

Showing the top 40 of 173. See all →

More: all "best" lists · cost calculator · all models

Open the free Spanvero advisor → · Honest, $0-markup. © 2026 Cynosure LLC.