How to run CogVideoX-2B locally

Zhipu AI / THUDM · Text → video · 2B params · Apache-2.0 (commercial OK)

The smaller, fully Apache-2.0 CogVideoX model that made open text-to-video runnable on free Colab T4 GPUs and is one of the most-used entry-level video models.

What it costs to run — $0 markup

Note: generative-media models are billed per image / per second / per minute on hosted services — not per token. Running locally or on your own rented GPU is usually far cheaper and keeps your data on your machine.

Key facts

DoesText → video
VRAM to run~4.0 GB (fp16; runs on a free T4 Colab with quant/offload; ~12-16GB unoptimized)
Download~5 GB
Parameters2B
LicenseApache-2.0 (commercial use OK)
Run withComfyUI / Diffusers

Get CogVideoX-2B on Hugging Face →

More video models

Browse: all media models · chat / LLM models

Open the free Spanvero advisor → · We point you to the open weights + your own accounts, $0 markup, never resell compute. © 2026 Cynosure LLC.