Image Generation Guides

Deploy GPU-accelerated text-to-image generation models on Spheron GPU instances.

VRAM requirements

Black Forest Labs text-to-image models. FLUX.1-dev delivers state-of-the-art photorealism on an RTX 4090; FLUX.2 on H100 for highest fidelity.

Hardware: RTX 4090 24GB (FLUX.1-dev) · H100 80GB (FLUX.2)

Stability AI diffusion models from SD 1.5 (6GB) through SDXL (10GB) and SD 3.5 Large (24GB). FastAPI /generate endpoint for programmatic use.

Hardware: 6–24GB VRAM depending on model variant

Node-based visual workflow server for image generation. Docker container on port 8188 with SSH tunnel setup. Supports custom workflows via JSON API.

Hardware: RTX 4090 24GB (recommended)