The cheapest open LLMs to run on your own hardware

Open LLMs ranked by how little VRAM they need to run locally — the smaller the footprint, the cheaper the GPU you need and the closer to truly $0 it gets. Sorted by computed VRAM-to-run (lowest first), with the honest local and rent-a-GPU cost for each.

How this is ranked: Objective: ranks by engine-computed VRAM (proxy for self-hosting cost — lowest VRAM = cheapest hardware to buy/rent). All runs are $0-markup. Not a quality ranking; the user judges which cheap-to-run model is good enough.

Showing the top 40 of 354. See all →

More: all "best" lists · cost calculator · all models

Open the free Spanvero advisor → · Honest, $0-markup. © 2026 Cynosure LLC.