What is Z-Image?

Z-Image is a family of image generation models by Alibaba / Tongyi-MAI.

  • Z-Image-Base: Base model (Unreleased)
  • Z-Image-Turbo: Photo-realistic text2image distilled from Base for few-step generation
  • Z-Image-Edit: Model for editing (Unreleased)

Currently, only Z-Image-Turbo is available for local use, so this page focuses on Z-Image-Turbo.


Model Download

📂ComfyUI/
└── 📂models/
    ├── 📂diffusion_models/
    │   └── z_image_turbo_bf16.safetensors
    ├── 📂text_encoders/
    │   └── qwen_3_4b.safetensors
    └── 📂vae/
        └── ae.safetensors

text2image

Z-Image-Turbo is a distilled model of the same type as Flux.1 dev.

Z-Image-Turbo.json
  • steps ... 6-8
  • cfg ... 1.0

Z-Image-Turbo Fun ControlNet Union

A ControlNet-style patch for Z-Image-Turbo.

Model Download

📂ComfyUI/
└── 📂models/
    └── 📂model_patches/
        └── Z-Image-Turbo-Fun-Controlnet-Union-2.1.safetensors

workflow

Z-Image-Turbo-Fun-Controlnet-Union-2.1.json
  • 🟩 Add model and control image to QwenImageDiffsynthControlnet.
  • 🟩 In this workflow, Depth Anything V2 is used to create a depth map.