How to run BAGEL-7B-MoT locally

ByteDance Seed · Unified understanding + image gen + edit · 7B params · Apache-2.0 (commercial OK)

ByteDance's open Mixture-of-Transformer-Experts model (7B active, 14B total) that unifies image understanding, SD3-class text-to-image and strong free-form editing, plus emergent abilities like multiview synthesis and world navigation.

What it costs to run — $0 markup

Note: generative-media models are billed per image / per second / per minute on hosted services — not per token. Running locally or on your own rented GPU is usually far cheaper and keeps your data on your machine.

Key facts

DoesText → image, Image editing, Visual understanding, Free form manipulation, Multiview synthesis
VRAM to run~24 GB (~29GB total weights (7B active / 14B total MoT); the community INT8 build runs on a single 24GB card, BF16 wants 40GB+.)
Download~29 GB
Parameters7B
LicenseApache-2.0 (commercial use OK)
Run withTransformers (BAGEL repo)

Get BAGEL-7B-MoT on Hugging Face →

More multimodal / omni models

Browse: all media models · chat / LLM models

Open the free Spanvero advisor → · We point you to the open weights + your own accounts, $0 markup, never resell compute. © 2026 Cynosure LLC.