Spanvero How it works Find a model Compare models Pricing

NVIDIA models

Models published by NVIDIA, including Nemotron and optimized releases. The honest, $0-markup cost to run each — free on your machine, a rented GPU, or your own API key.

Qwen3.6 35B A3B NVFP4 — 18.7B, nvidia · $0.25/1M API est.
Gemma 4 31B IT NVFP4 — 20.9B, nvidia · $0.27/1M API est.
NVIDIA Nemotron 3 Super 120B A12B NVFP4 — 67.2B, nvidia · $0.64/1M API est.
NVIDIA Nemotron 3 Nano 30B A3B BF16 — 31.6B, nvidia · $0.35/1M API est.
Gemma 4 26B A4B NVFP4 — 14.4B, nvidia · $0.22/1M API est.
DeepSeek R1 0528 NVFP4 v2 — 393.6B, nvidia · $3.25/1M API est.
NVIDIA Nemotron 3 Nano 4B BF16 — 4B, nvidia · $0.13/1M API est.
NVIDIA Nemotron 3 Super 120B A12B BF16 — 123.6B, nvidia · $1.09/1M API est.
NVIDIA Nemotron 3 Nano 30B A3B NVFP4 — 18.2B, nvidia · $0.25/1M API est.
NVIDIA Nemotron Nano 9B v2 — 8.9B, nvidia · $0.17/1M API est.
Qwen3.6 27B NVFP4 — 18.2B, nvidia · $0.25/1M API est.
Llama 3 3 Nemotron Super 49B v1 5 — 49.9B, nvidia · $0.50/1M API est.
Nemotron Labs Diffusion 8B Base — 8.5B, nvidia · $0.17/1M API est.
MiniMax M2.7 NVFP4 — 116.3B, nvidia · $1.03/1M API est.
NVIDIA Nemotron 3 Ultra 550B A55B NVFP4 — 335B, nvidia · $2.78/1M API est.
diffusiongemma 26B A4B it NVFP4 — 14.4B, nvidia · $0.22/1M API est.
NVIDIA Nemotron Nano 9B v2 Japanese — 8.9B, nvidia · $0.17/1M API est.
Qwen3 235B A22B NVFP4 — 132.8B, nvidia · $1.16/1M API est.
DeepSeek V4 Flash NVFP4 — 166.7B, nvidia · $1.43/1M API est.
Nemotron Labs Diffusion 8B — 8.5B, nvidia · $0.17/1M API est.
Llama 3 3 Nemotron Super 49B v1 — 49.9B, nvidia · $0.50/1M API est.
NVIDIA Nemotron Nano 12B v2 — 12.3B, nvidia · $0.20/1M API est.
Qwen3 14B NVFP4 — 8.2B, nvidia · $0.17/1M API est.
GLM 5.1 NVFP4 — 381.5B, nvidia · $3.15/1M API est.
Qwen3 8B NVFP4 — 4.7B, nvidia · $0.14/1M API est.
NVIDIA Nemotron 3 Ultra 550B A55B BF16 — 560.5B, nvidia · $4.58/1M API est.
DeepSeek V4 Pro NVFP4 — 910B, nvidia · $7.38/1M API est.
Nemotron Labs Diffusion 3B Base — 3.8B, nvidia · $0.13/1M API est.
MiniMax M2.5 NVFP4 — 116.3B, nvidia · $1.03/1M API est.
NVIDIA Nemotron 3 Nano 30B A3B Base BF16 — 31.6B, nvidia · $0.35/1M API est.
Qwen3 30B A3B NVFP4 — 15.6B, nvidia · $0.22/1M API est.
GLM 5.2 NVFP4 — 381B, nvidia · $3.15/1M API est.
GLM 5 NVFP4 — 435.2B, nvidia · $3.58/1M API est.
Nemotron H 8B Base 8K — 8.1B, nvidia · $0.16/1M API est.
Qwen3 32B NVFP4 — 17.2B, nvidia · $0.24/1M API est.
Nemotron Labs Diffusion 3B — 3.8B, nvidia · $0.13/1M API est.
NVIDIA Nemotron Labs 3 Puzzle 75B A9B NVFP4 — 44.5B, nvidia · $0.46/1M API est.
Nemotron Cascade 2 30B A3B — 31.6B, nvidia · $0.35/1M API est.
MiniMax M3 NVFP4 — 246.6B, nvidia · $2.07/1M API est.
Mistral Medium 3.5 128B NVFP4 — 83.8B, nvidia · $0.77/1M API est.
Llama 3.1 Nemotron Safety Guard 8B v3 — 8B, nvidia · $0.16/1M API est.
Nemotron Labs Diffusion 14B — 13.5B, nvidia · $0.21/1M API est.
Qwen3.5 122B A10B NVFP4 — 64.6B, nvidia · $0.62/1M API est.
Nemotron Labs TwoTower 30B A3B Base BF16 — 63.2B, nvidia · $0.61/1M API est.

Other publishers

Qwen (Alibaba) · Meta Llama · DeepSeek · Google (Gemma) · Microsoft (Phi) · Mistral AI · OpenAI (gpt-oss) · IBM Granite · Z.ai (GLM) · Moonshot AI (Kimi) · MiniMax · Cohere · Ai2 (OLMo) · Nous Research · Liquid AI · TII (Falcon) · EleutherAI · InternLM · Xiaomi (MiMo)

All models → · Compare →

The weekly price index

A short email of real AI price moves, straight from the daily log — no hype. We're collecting the list now; the first issue goes out when it opens. Unsubscribe with one click.

Joining the list needs JavaScript — or just email support@spanvero.com and we'll add you.