Home › Models › NVIDIA NVIDIA models Models published by NVIDIA, including Nemotron and optimized releases. The honest, $0-markup cost to run each — free on your machine, a rented GPU, or your own API key.
Qwen3.6 35B A3B NVFP4 — 18.7B, nvidia · $0.25/1M APIGemma 4 31B IT NVFP4 — 20.9B, nvidia · $0.27/1M APINVIDIA Nemotron 3 Super 120B A12B NVFP4 — 67.2B, nvidia · $0.64/1M APINVIDIA Nemotron 3 Nano 30B A3B BF16 — 31.6B, nvidia · $0.35/1M APIGemma 4 26B A4B NVFP4 — 14.4B, nvidia · $0.22/1M APIDeepSeek R1 0528 NVFP4 v2 — 393.6B, nvidia · $3.25/1M APINVIDIA Nemotron 3 Nano 4B BF16 — 4B, nvidia · $0.13/1M APINVIDIA Nemotron 3 Super 120B A12B BF16 — 123.6B, nvidia · $1.09/1M APINVIDIA Nemotron 3 Nano 30B A3B NVFP4 — 18.2B, nvidia · $0.25/1M APINVIDIA Nemotron Nano 9B v2 — 8.9B, nvidia · $0.17/1M APILlama 3 3 Nemotron Super 49B v1 5 — 49.9B, nvidia · $0.50/1M APINemotron Labs Diffusion 8B Base — 8.5B, nvidia · $0.17/1M APIMiniMax M2.7 NVFP4 — 116.3B, nvidia · $1.03/1M APINVIDIA Nemotron 3 Ultra 550B A55B NVFP4 — 335B, nvidia · $2.78/1M APIdiffusiongemma 26B A4B it NVFP4 — 14.4B, nvidia · $0.22/1M APINVIDIA Nemotron Nano 9B v2 Japanese — 8.9B, nvidia · $0.17/1M APIDeepSeek V4 Flash NVFP4 — 166.7B, nvidia · $1.43/1M APILlama 3 3 Nemotron Super 49B v1 — 49.9B, nvidia · $0.50/1M APINVIDIA Nemotron Nano 12B v2 — 12.3B, nvidia · $0.20/1M APIQwen3 14B NVFP4 — 8.2B, nvidia · $0.17/1M APINVIDIA Nemotron 3 Ultra 550B A55B BF16 — 560.5B, nvidia · $4.58/1M APIDeepSeek V4 Pro NVFP4 — 910B, nvidia · $7.38/1M APINemotron Labs Diffusion 3B Base — 3.8B, nvidia · $0.13/1M APIMiniMax M2.5 NVFP4 — 116.3B, nvidia · $1.03/1M APIQwen3 32B NVFP4 — 17.2B, nvidia · $0.24/1M APINemotron Labs Diffusion 3B — 3.8B, nvidia · $0.13/1M APINemotron Cascade 2 30B A3B — 31.6B, nvidia · $0.35/1M APILlama 3.1 Nemotron Safety Guard 8B v3 — 8B, nvidia · $0.16/1M APIQwen3.5 122B A10B NVFP4 — 64.6B, nvidia · $0.62/1M APIOther publishers Qwen (Alibaba) · Meta Llama · DeepSeek · Google (Gemma) · Microsoft (Phi) · Mistral AI · OpenAI (gpt-oss) · IBM Granite · Z.ai (GLM) · Moonshot AI (Kimi) · MiniMax · Cohere · Ai2 (OLMo) · Nous Research · Liquid AI · TII (Falcon) · EleutherAI · InternLM · Xiaomi (MiMo)
All models → · Compare →
Open the free advisor → · Prices as of 2026-06-17. We're an honest advisor — $0 markup, your own accounts, we never resell compute. © 2026 Cynosure LLC.