The best open LLMs you can run on 16 GB of VRAM

Open LLMs that fit in 16 GB of VRAM at their default quant — enough for an RTX 4060 Ti 16 GB, 4070 Ti Super, or a 16 GB Mac. Ranked by the largest model that fits, with the honest $0-local cost. We confirm the fit; quality is yours to judge.

How this is ranked: Objective filter: 'best' = 'fits 16 GB.' Engine-computed VRAM, ordered by size. No subjective quality claim.

Showing the top 40 of 237. See all →

More: all "best" lists · cost calculator · all models

Open the free Spanvero advisor → · Honest, $0-markup. © 2026 Cynosure LLC.