CivArchive
    Preview 112999068
    Preview 112999115

    🚀 Z-Image AIO | Official Workflows

    Turbo (8 Steps) & Base (28-50 Steps) • Photorealistic Generation • Bilingual Text • FP8 / FP16 / BF16

    ⚠️ Requires ComfyUI v0.11.0+Download here


    📦 Eight Official Workflows Available

    All workflows work with FP8 (~10GB), FP16 (~20GB), and BF16 (~20GB) versions!


    🆕 Z-Image-Base-AIO Workflow (ZIB-AIO-Base)

    Full foundation model with maximum creative control!

    The undistilled 6B parameter model for professional work. Features full CFG control (3.0-5.0), negative prompt support, and high output diversity. Ideal base for LoRA training and complex prompt engineering. Includes SeedVR2 upscaler and Seed Variance Enhancer as optional features.

    Key Features:

    • ✅ Full CFG control (3.0-5.0)

    • ✅ Negative prompts supported

    • ✅ High output diversity

    • ✅ Ideal for LoRA/ControlNet development

    • ✅ Optional LoRA loading via LoraManager

    • ✅ Optional SeedVR2 upscaling

    • ✅ Optional Seed Variance Enhancer

    Required Custom Nodes: 5 nodes (see below)


    🚀 Standard Workflow v1.0 & v2.0 (ZIT-AIO-v1.0 / ZIT-AIO-v2.0)

    Simple text-to-image workflow with improved upscaler and dual sampler options. Features automatic metadata saving and denoise control for upscaling. Perfect for beginners and quick generations. Requires 2 custom nodes.

    v2.0 improvements: Enhanced upscaler with denoise control, better sampler options (res_multistep or euler_ancestral), dual scheduler support (simple or beta).


    🎮 ControlNet Workflow (ZIT-AIO-Control)

    Guided generation with reference images using ControlNet Union (Canny, HED, Depth, Pose, MLSD). Uses megapixel scaling that maintains aspect ratio automatically. Perfect for sketch-to-photo, pose transfer, and precise composition control. Requires ComfyUI 3.77+ and ControlNet Union file.


    🎲 Seed Variance Enhancer Workflow (ZIT-AIO-Variance)

    Adds diversity to outputs by introducing controlled noise to text embeddings. Compensates for low seed variance - get more varied results with the same prompt. Includes manual seed control for reproducibility. Requires SeedVarianceEnhancer custom node.


    🎬 SeedVR2 Video Upscaler Workflow (ZIT-AIO-SeedVR2)

    Professional diffusion-based upscaling using DiT (Diffusion Transformer) models. Delivers superior quality with temporal consistency for videos and images. Supports multiple model variants (3B/7B with FP16/FP8/GGUF) and memory optimization options. Requires SeedVR2 custom node.


    🌊 Depth Anything V3 ControlNet Workflow (ZIT-AIO-DepthV3)

    State-of-the-art depth-guided generation with dual modes: create depth-controlled images OR preview depth as 3D point clouds. Superior multi-view depth consistency compared to traditional methods. Features toggle system for easy mode switching. Requires Depth Anything 3 custom nodes.


    🖼️ Z-Image-Turbo-Anime Workflow (ZIT-AIO-Anime)

    This workflow includes several small but meaningful adjustments and integrates multiple custom nodes. It features the Seed Variance Enhancer, which helps generate different image variations from the same prompt by increasing effective seed diversity. Additionally, SeedVR2 is included as an alternative upscaling solution.

    Z-Image-Turbo-Anime: https://civarchive.com/models/2259646/z-image-turbo-anime


    📊 Quick Comparison

    Turbo Workflows (8-9 Steps, CFG 1.0)

    🚀 Standard v1/v2 → Text-to-image → Simple & fast → 2 custom nodes

    🎮 ControlNet → Guided generation → 5 control types → 3 nodes + ControlNet file

    🎲 Seed Enhancer → Output diversity → More variations → SeedVarianceEnhancer node

    🎬 SeedVR2 → Professional upscaling → Diffusion-based → SeedVR2 node + models

    🌊 DA3 DepthV3 → Depth-guided + 3D → Dual modes → Depth Anything 3 nodes

    🖼️ Anime → Anime style → Custom merged → 5 custom nodes

    Base Workflow (28-50 Steps, CFG 3.0-5.0)

    🆕 Base-AIO → Full control → CFG + Negative prompts → 5 custom nodes


    🔄 Model Versions Available

    Z-Image-Turbo-AIO (8 Steps, CFG 1.0)

    🟡 FP8-AIO (~10GB) - Recommended for most users

    🔵 FP16-AIO (~20GB) - Wide GPU compatibility

    🌟 BF16-AIO (~20GB) - Maximum quality

    Z-Image-Base-AIO (28-50 Steps, CFG 3.0-5.0) 🆕

    🟡 FP8-AIO (~10GB) - Fast, daily use

    🔵 FP16-AIO (~20GB) - Wide GPU compatibility (RTX 2000/3000)

    🌟 BF16-AIO (~20GB) - Max quality, ideal for LoRA training

    All versions work on 8GB VRAM!


    🆚 Turbo vs Base - When to Use Which?

    Use Turbo when:

    ⚡ Speed is priority - 8 steps = 3-5 seconds

    📸 Production workflows - Consistent high quality

    💾 Quick iterations - Rapid prototyping

    🎯 Simple prompts - Less complex scenes

    Use Base when:

    🎨 Creative exploration - Higher diversity across seeds

    🔧 LoRA/ControlNet development - Undistilled foundation

    📝 Complex prompt engineering - Full CFG control

    🚫 Negative prompting needed - Remove unwanted elements

    🎯 Maximum control - Fine-tune every aspect


    ⚙️ Settings by Model Type

    Z-Image-Turbo Settings (All Turbo Workflows)

    📊 Steps: 8-9 🎚️ CFG: 1.0 (don't change!) 🚫 Negative Prompt: ❌ Not used 🎲 Sampler: res_multistep (sharp) / euler_ancestral (smooth) 📈 Scheduler: simple (clean) / beta (balanced)

    Z-Image-Base Settings (Base-AIO Workflow) 🆕

    📊 Steps: 28-50 🎚️ CFG: 3.0-5.0 🚫 Negative Prompt: ✅ Full support! 🎲 Sampler: euler ⭐ / dpmpp_2m 📈 Scheduler: normal ⭐ / karras


    💡 Prompting Tips

    Natural Language Works Best

    Professional food photography of artisan breakfast plate. 
    Golden poached eggs on sourdough toast, crispy bacon, fresh 
    avocado slices. Morning sunlight creating warm glow. Shallow 
    depth of field, magazine-quality presentation.
    

    Bilingual Text Rendering

    Neon sign reading "COFFEE SHOP" in bright blue letters
    Sign with "咖啡店" in elegant gold calligraphy
    

    Important

    Turbo:

    • ❌ NO negative prompts (model ignores them)

    • ✅ Natural language, not tags

    • ✅ Detailed (100-300 words)

    Base:

    • ✅ Negative prompts work great!

    • ✅ Natural language, not tags

    • ✅ Detailed (100-300 words)

    • ✅ Use CFG 3.0-5.0 for control


    📥 Downloads

    Main Models

    Z-Image-Turbo-AIO:

    Z-Image-Base-AIO: 🆕

    Additional Files


    📦 Custom Nodes

    Required for ALL Workflows

    rgthree-comfy https://github.com/rgthree/rgthree-comfy

    comfyui_image_metadata_extension https://github.com/edelvarden/comfyui_image_metadata_extension

    Additional per Workflow

    ZIB-AIO-Base (Base Workflow): 🆕

    ZIT-AIO-Control:

    ZIT-AIO-Variance:

    ZIT-AIO-SeedVR2:

    ZIT-AIO-DepthV3:

    ZIT-AIO-Anime:

    💡 Tip: Use ComfyUI Manager → "Install Missing Custom Nodes" for easy installation!


    🎯 Workflow-Specific Details

    🆕 Base-AIO (ZIB-AIO-Base)

    • Steps: 28-50 (more = better quality)

    • CFG: 3.0-5.0 (4.0 recommended start)

    • Sampler: euler (sharp) / dpmpp_2m (smooth)

    • Scheduler: normal (standard) / karras (refined)

    • Negative prompts: ✅ Full support!

    • Upscaler: Optional with denoise 0.35

    • LoRA: Optional via LoraManager node

    • SeedVR2: Optional for AI upscaling

    • Seed Variance: Optional for diversity

    🚀 Standard v2.0

    • Improved upscaler with denoise control (0.4-0.6)

    • Dual sampler support

    • scale_by parameter for output size

    • Perfect for everyday use

    🎮 ControlNet

    • 5 control types: Canny, HED, Depth, Pose, MLSD

    • Megapixel scaling (auto aspect ratio)

    • ControlNet strength: 0.6-0.8 recommended

    • ⚠️ Save ControlNet in: ComfyUI/models/model_patches/

    🎲 Seed Variance Enhancer

    • randomize_percent: 50

    • strength: 20-30

    • noise_insert: 'noise on beginning steps'

    • Trade-off: Diversity vs prompt adherence

    🎬 SeedVR2

    • resolution: 1536 (target for short edge)

    • batch_size: MUST be 4n+1 (1, 5, 9, 13, 17, 21...)

    • color_correction: 'lab' (recommended)

    • Models: 3B (faster) or 7B (higher quality)

    🌊 Depth Anything V3

    • Two modes: Generate images OR 3D preview

    • Models: da3_small/base/large/giant

    • 8GB VRAM: Use up to da3_large

    • ⚠️ Toggle correctly: Generate mode ≠ 3D Preview mode


    ❓ FAQ

    Q: Which workflow should I use? A: Turbo Standard v2 for quick text-to-image. Base-AIO for full control & LoRA work. Others for specific needs.

    Q: FP8, FP16, or BF16? A: FP8 for most users (10GB). FP16 for older GPUs. BF16 for maximum quality (20GB).

    Q: Turbo or Base? A: Turbo for speed (8 steps). Base for control & quality (28-50 steps, CFG, negative prompts).

    Q: Which sampler is better? A: Turbo: res_multistep (sharp) or euler_ancestral (smooth). Base: euler or dpmpp_2m.

    Q: Metadata saved automatically? A: Yes! All workflows save metadata for easy CivitAI uploads.

    Q: Works on 8GB VRAM? A: Yes! All workflows tested on RTX 4060 8GB.

    Q: Do I need all custom nodes? A: No! Only install nodes for workflows you'll use. Base requirement is 2 nodes (rgthree + metadata).

    Q: Can I use LoRAs with Base? A: Yes! Base-AIO includes LoraManager for easy LoRA loading. Turbo doesn't support LoRA training well.


    🌟 Key Features

    All Workflows

    Fast Generation - Turbo: 3-5 sec | Base: 30-60 sec 📦 All-in-One - VAE + Text Encoder integrated 📸 Photorealistic - Professional quality 📖 Bilingual - English & Chinese text rendering 💾 Metadata Auto-Save - Easy sharing 🎯 8GB VRAM Friendly - Accessible to everyone

    Base-Specific 🆕

    ⚙️ Full CFG Control - 3.0-5.0 range 🚫 Negative Prompts - Remove unwanted elements 🎨 High Diversity - More variation across seeds 🔧 LoRA Ready - Ideal for training & using LoRAs

    Turbo-Specific

    Ultra-Fast - 8-9 steps only 🎯 Consistent - Same quality every time 💾 Efficient - Lower compute needed


    🎨 Perfect For

    Base-AIO:

    • LoRA training & testing

    • Complex compositions

    • Fine-tuned control

    • Professional projects

    • Creative exploration

    Turbo Workflows:

    • Quick iterations

    • Production workflows

    • Social media content

    • Marketing materials

    • Rapid prototyping

    Both:

    • Product photography

    • Architectural visualization

    • Food photography

    • Portrait photography

    • Bilingual content


    📐 System Requirements

    Minimum:

    • VRAM: 8GB

    • RAM: 16GB

    • ComfyUI: v0.11.0+

    Recommended:

    • VRAM: 8GB+ (perfect for all workflows)

    • RAM: 32GB

    • Storage: 50GB+ (for all models)

    Tested Hardware:

    • RTX 4060 8GB @ 1920×1088

    • All FP8, FP16, and BF16 versions work perfectly


    🙏 Credits

    Original Model: Tongyi Lab (Alibaba Group) T

    ext Encoder: Qwen3-4B

    ControlNet Union: Alibaba PAI Team

    SeedVR2: ByteDance Seed Team

    Depth Anything V3: ByteDance Seed Team

    Architecture: Single-Stream DiT (6B parameters)

    License: Apache 2.0

    Workflows: Optimized for ComfyUI with metadata support

    Community: Thanks to all testers and contributors!


    📊 File Sizes

    Main Models:

    • Turbo FP8-AIO: ~10GB

    • Turbo FP16-AIO: ~20GB

    • Turbo BF16-AIO: ~20GB

    • Base FP8-AIO: ~10GB

    • Base FP16-AIO: ~20GB

    • Base BF16-AIO: ~20GB

    Additional Files:

    • ControlNet Union: ~2.5GB

    • SeedVR2 models: 10-20GB (3B-7B variants)

    • Depth Anything V3: 80MB-1.15GB (model dependent)

    Total (all workflows): ~70GB for complete setup


    🎯 Getting Started

    1️⃣ Download model (FP8, FP16, or BF16 - Turbo or Base)

    2️⃣ Install base custom nodes (rgthree + metadata)

    3️⃣ Choose workflow based on your needs

    4️⃣ Install workflow-specific nodes if needed

    5️⃣ Load workflow into ComfyUI v0.11.0+

    6️⃣ Generate!


    Updated: January 2026 Tested: RTX 4060 8GB @ 1920×1088 ComfyUI: v0.11.0+ required


    Eight powerful workflows for every creative need! 🚀

    Turbo for speed | Base for control | Choose what fits your workflow!

    Description

    Official ComfyUI ControlNet workflow for Z-Image-Turbo-AIO photorealistic

    generation with precise control guidance.

    FEATURES:

    - ControlNet Union support - 5 control types in one model

    - Canny (edge detection), HED (soft edges), Depth (3D structure)

    - Pose (human figures), MLSD (architecture/lines)

    - Megapixel scaling - maintains aspect ratio automatically

    - Automatic metadata saving - easy CivitAI uploads

    - Works with both FP8 (10GB) and BF16 (20GB) versions

    - 8GB VRAM friendly - tested on RTX 4060

    REQUIREMENTS:

    - ComfyUI 3.77+ (ModelPatchLoader compatibility)

    - Z-Image-Turbo-AIO model (FP8 or BF16)

    - Z-Image-Turbo-Fun-Controlnet-Union (~2.5GB)

    - Save ControlNet in: ComfyUI/models/model_patches/

    - Custom nodes: rgthree-comfy, comfyui_image_metadata_extension,

    comfyui_controlnet_aux

    SETTINGS:

    Steps: 9 • CFG: 1.0 • Sampler: res_multistep or euler_ancestral

    Scheduler: simple or beta • ControlNet Strength: 0.6-0.8

    Megapixels: 1.0 - 2.08 MP recommended

    PERFECT FOR:

    Sketch to photo conversion, pose transfer, scene composition control,

    architectural visualization, edge-guided generation, depth-based layouts.

    Generation time: 3-5 seconds @ 1920×1088 on RTX 4060 8GB.

    All nodes pre-configured and connected. Just upload input image, choose

    preprocessor type, write prompt, and generate!

    FAQ

    Workflows
    ZImageTurbo

    Details

    Downloads
    427
    Platform
    CivitAI
    Platform Status
    Available
    Created
    12/7/2025
    Updated
    4/26/2026
    Deleted
    -

    Files

    zImageTurboBaseAIO_zitAIOControl.zip

    Mirrors