π Z-Anime | Full Anime Fine-Tune on Z-Image Base
Full Fine-Tune β’ Rich Aesthetics β’ Strong Diversity β’ Full Negative Prompt Support
BF16 & FP8 & GGUF & AIO β’ Natural Language Prompts β’ 8GB VRAM
π€ Now also on Hugging Face: huggingface.co/SeeSee21/Z-Anime β including the full Diffusers folder for ZImagePipeline.from_pretrained() use.
β¨ What is Z-Anime?
Z-Anime is a full fine-tune of Alibaba's Z-Image (Base) architecture β not a LoRA merge, but a completely retrained model optimized for anime aesthetics from the ground up.
Built on the S3-DiT (Single-Stream Diffusion Transformer) with 6 billion parameters, Z-Anime inherits everything that makes Z-Image Base special: rich diversity, strong controllability, full negative prompt support and a high ceiling for fine-tuning β now fully tuned for anime.
This page contains the complete Z-Anime family:
π Z-Anime Base β Full quality, full control, full creativity
β‘ Z-Anime Distill-8-Step β Great results in 8 steps
π Z-Anime Distill-4-Step β Maximum speed, 4 steps
π¦ GGUF Variants β Q8_0 + Q4_K_S for low VRAM / CPU / AMD
π¦ AIO Variants β All-in-one checkpoints (Base + 4-Step + 8-Step)
Each main variant is available in BF16 (~12 GB) and FP8 (~6 GB).
π― Key Features
β Full fine-tune on Z-Image Base β not a LoRA merge
β Rich anime aesthetics with strong style diversity
β Natural language prompts β detailed descriptions, not tag lists
β High diversity across characters, poses, compositions and layouts
β LoRA training ready β perfect base for further fine-tuning
β Partially NSFW capable
β 8 GB VRAM compatible
β All variants supported by the official Z-Anime ComfyUI Workflow
πΊοΈ Z-Anime Roadmap
β Released
π Z-Anime Base β Full fine-tune on Z-Image Base, BF16 & FP8
β‘ Z-Anime Distill-8-Step β fast anime generation in 8 steps, CFG 1.0, BF16 & FP8
π Z-Anime Distill-4-Step β ultra-fast anime generation in 4 steps, CFG 1.0, BF16 & FP8
π¦ GGUF Variants β for low VRAM and AMD GPUs. Since CivitAI currently has no dedicated GGUF category, here is what the files represent:
Z-Anime-Base-Q8_0 = Pruned Model FP8 (6.73 GB)
Z-Anime-Base-Q4_K_S = Pruned Model NF4 (4.2 GB)
π¦ AIO Versions β All variants with VAE + Text Encoder integrated in a single file:
z-anime-base-aio (BF16 + FP8)
z-anime-distill-8step-aio (BF16 + FP8)
z-anime-distill-4step-aio (BF16 + FP8)
π§ Z-Anime ComfyUI Workflow β Official workflow, supports all variants (auto-detects Diffusion / GGUF / AIO loaders, optional LoRA, optional 1.5Γ upscale)
π€ Hugging Face Repo β full mirror including the Diffusers folder for Python users: huggingface.co/SeeSee21/Z-Anime
More updates coming β follow to stay notified! π
π¦ Versions Overview
π’ BF16 (~12 GB)
Maximum precision. BFloat16 format, no quality compromise. Best for professional or commercial work and LoRA training. Still runs on 8 GB VRAM.
π‘ FP8 (~6 GB)
Recommended for most users. Half the file size, much faster downloads. Excellent quality, barely distinguishable from BF16. Perfect for everyday use and testing.
π΅ GGUF
Optimized for lightweight inference setups, especially useful for low VRAM, CPU inference, or alternative backends.
π£ AIO
All-in-one checkpoints with image model + Text Encoder + VAE integrated into a single file. Single-file convenience, no extra loaders needed.
π Z-Anime Base
The foundation of the Z-Anime family. A full fine-tune with the highest quality ceiling, the widest creative range and full negative prompt support.
Recommended Settings:
Steps: 28β50
CFG: 3.0β5.0 (up to 9.0 possible)
Sampler: euler_ancestral
Scheduler: beta
Negative: strongly recommended β very responsive!
CFG Guide: 3.0β5.0 is the sweet spot for balanced quality and creativity. 5.0β7.0 gives tighter prompt adherence. 7.0β9.0 is for maximum control β watch for over-saturation. Above 9.0 is not recommended.
Negative prompts have full effect on Z-Anime Base. The official workflow ships with an optimized negative prompt ready to use.
β‘ Z-Anime Distill-8-Step
The sweet spot of the family. Distilled from Z-Anime Base, delivering strong anime results in just 8 steps. Much faster than Base while keeping most of the quality intact.
Recommended Settings:
Steps: 8
CFG: 1.0 (max ~1.5)
Sampler: euler_ancestral
Scheduler: beta
Negative: limited effect
CFG Guide: Runs best at CFG 1.0 by design. Small nudges up to 1.3β1.5 are possible for slightly tighter prompt adherence. Do not go above 1.5 β artifacts may appear.
Negative prompts have limited effect at this distillation level. Use ConditioningZeroOut (included in the workflow) instead of writing a full negative prompt.
π Z-Anime Distill-4-Step
The fastest Z-Anime variant. Built for maximum throughput β rapid prototyping, batch generation and situations where speed matters most.
Recommended Settings:
Steps: 4
CFG: 1.0 (max ~1.5)
Sampler: euler_ancestral
Scheduler: beta
Negative: limited effect
CFG Guide: At 4 steps the model has very little correction room. Stay at CFG 1.0 for the most stable results. Nudging up to 1.3β1.5 is possible but increases instability. Do not go above 1.5.
Tips for 4-Step: Be specific and front-load the most important details early in your prompt. The optional upscaler (hires fix or SeedVR2) in the workflow is especially useful here to recover fine detail.
π Resolution Guide
| Use Case | Resolution | |---|---| | β Portrait / Character art | 832 Γ 1216 | | Landscape / Scenes / Backgrounds | 1216 Γ 832 | | Square / General purpose | 1024 Γ 1024 | | Tall / Full body / Phone wallpaper | 768 Γ 1344 | | Cinematic / Wide scenes | 1920 Γ 1088 | | High quality / Detailed portraits | 1024 Γ 1536 |
Supported range: 512 Γ 512 to 2048 Γ 2048, any aspect ratio. All resolutions run on 8 GB VRAM.
π‘ Prompting Guide
Natural language β not tag lists!
β Good
A young anime girl with long silver hair and golden eyes, wearing a
traditional shrine maiden outfit with white haori and red hakama.
She stands in a sunlit bamboo forest, cherry blossoms falling softly
around her. Warm afternoon light filtering through the trees,
detailed fabric shading, expressive face, calm serene expression.
High quality anime illustration with fine line work.
β Avoid
anime girl, silver hair, shrine maiden, bamboo, cherry blossom, warm light
Character portraits
Detailed anime portrait of [character], soft rim lighting,
expressive eyes with detailed reflections, fine hair strands,
clean linework, professional anime illustration quality.
Action scenes
Dynamic anime [scene], dramatic angle, motion energy, speed lines,
particle effects, cinematic composition, detailed shading,
high quality anime art.
Backgrounds & landscapes
Anime [location] at [time of day], [lighting], [atmosphere],
Studio Ghibli inspired detail level, beautiful background art,
wallpaper quality.
π§ Installation
Step 1 β Download your version (BF16, FP8, GGUF or AIO) for the variant you want.
Step 2 β Place the files:
Standard BF16 / FP8 models:
ComfyUI/models/diffusion_models/
βββ z-anime-base-bf16.safetensors
βββ z-anime-base-fp8.safetensors
βββ z-anime-distill-8step-bf16.safetensors
βββ z-anime-distill-8step-fp8.safetensors
βββ z-anime-distill-4step-bf16.safetensors
βββ z-anime-distill-4step-fp8.safetensors
GGUF variants:
ComfyUI/models/unet/
βββ z-anime-base-q8_0.gguf
βββ z-anime-base-q4_k_s.gguf
Text Encoder & VAE (for the non-AIO variants):
ComfyUI/models/clip/
βββ qwen_3_4b.safetensors
ComfyUI/models/vae/
βββ ae.safetensors
AIO variants β single file, no extras needed:
ComfyUI/models/checkpoints/
βββ z-anime-base-aio-bf16.safetensors
βββ z-anime-base-aio-fp8.safetensors
βββ z-anime-distill-8step-aio-bf16.safetensors
βββ z-anime-distill-8step-aio-fp8.safetensors
βββ z-anime-distill-4step-aio-bf16.safetensors
βββ z-anime-distill-4step-aio-fp8.safetensors
Step 3 β Load in ComfyUI:
Use the Load Diffusion Model node for the model file, a CLIPLoader for the text encoder and a VAELoader for the VAE.
For the GGUF versions: load the GGUF model from the
models/unet/folder, use the same CLIP and VAE files as above.For the AIO versions: just use a standard Checkpoint Loader β no extra CLIP or VAE loading required.
Or use the official Z-Anime ComfyUI Workflow β it handles all variants and precisions with a built-in model switch.
π¦ Custom Nodes (for the official workflow)
rgthree-comfy
ComfyUI-Lora-Manager
ComfyUI-GGUF (only for the GGUF variants)
ComfyUI-SeedVR2_VideoUpscaler (optional, only for SeedVR2 upscale)
π€ Hugging Face Repo
The complete model family is also mirrored on Hugging Face:
π huggingface.co/SeeSee21/Z-Anime
The HF repo additionally contains:
The full Diffusers-format folder (
diffusers/) β drop-in compatible withZImagePipeline.from_pretrained()for Python usersAn alternative Text Encoder by BennyDaBall β Engineer V4 (full fine-tune of the Z-Image text encoder with SMART training, drop-in compatible β often produces more varied outputs from the same seed)
π Version History
v1.0 β Initial Release
Z-Anime Base in BF16 & FP8
Z-Anime Distill-8-Step in BF16 & FP8
Z-Anime Distill-4-Step in BF16 & FP8
GGUF Variants added:
Z-Anime-Base-Q8_0 = pruned FP8 model (6.73 GB)
Z-Anime-Base-Q4_K_S = pruned Q4_K_S / NF4-style model (4.2 GB)
AIO Variants added (all 6):
z-anime-base-aio-bf16 / -fp8
z-anime-distill-8step-aio-bf16 / -fp8
z-anime-distill-4step-aio-bf16 / -fp8
Official ComfyUI Workflow included β supports all variants
Hugging Face mirror with full Diffusers folder for Python users
Optimized for euler_ancestral + beta, simple practical use across the family
π Credits
Base Architecture: Tongyi Lab (Alibaba) β Z-Image
Fine-Tune: SeeSee21
License: Apache 2.0
Architecture: S3-DiT (Single-Stream Diffusion Transformer, 6B parameters)
Base Model: Tongyi-MAI/Z-Image
GitHub: Tongyi-MAI/Z-Image
Engineer V4 Text Encoder (HF only): BennyDaBall/Qwen3-4b-Z-Image-Engineer-V4
Z-Anime β Anime at its finest, powered by Z-Image Base. π
Description
π§ Text Encoder
Z-Anime is built for the standard Z-Image Base text encoder pipeline in ComfyUI.
Please use the matching text encoder provided for Z-Anime / Z-Image Base, together with its correct tokenizer.
Using the wrong text encoder or tokenizer can lead to:
black images
pure noise
broken prompt understanding
errors during generation
To make setup easier, I am uploading matching Text Encoder files in BF16 and FP8, so you do not have to search for the correct one yourself.