All AI models

Browse 354 open models by size, by publisher, or by the GPU you have. Every one shows the genuinely cheapest way to run it — free on your machine, a rented GPU at the vendor's price, or your own API key.

By your GPU: 8 GB of VRAM · 16 GB of VRAM · 24 GB of VRAM · 48 GB of VRAM

By publisher: Qwen (Alibaba) · Meta Llama · DeepSeek · Google (Gemma) · Microsoft (Phi) · Mistral AI · NVIDIA · OpenAI (gpt-oss) · IBM Granite · Z.ai (GLM) · Moonshot AI (Kimi) · MiniMax · Cohere · Ai2 (OLMo) · Nous Research · Liquid AI · TII (Falcon) · EleutherAI · InternLM · Xiaomi (MiMo)

Flagship (80B+)

The biggest open models — multi-GPU or API territory.

Large (34–80B)

Top quality that still fits a single big rented GPU.

Medium (14–34B)

The single-GPU sweet spot — strong and self-hostable.

Small (4–14B)

Runs on a good laptop or consumer GPU.

Tiny (under 4B)

Edge / on-device — runs almost anywhere.

Compare models head-to-head →

Open the free advisor → · Prices as of 2026-06-17. We're an honest advisor — $0 markup, your own accounts, we never resell compute. © 2026 Cynosure LLC.