CivArchive
    Kirazuri (Anima) - v1.0 [anima-preview-1]
    NSFW
    Preview 125431505
    Preview 125431605
    Preview 125431766
    Preview 125431889
    Preview 125431739
    Preview 125431687
    Preview 125431697
    Preview 125431734
    Preview 125431745
    Preview 125432270
    Preview 125432309
    Preview 125432324
    Preview 125432359
    Preview 125432423
    Preview 125432459
    Preview 125432465
    Preview 125432514
    Preview 125432523

    Kirazuri (Anima)

    Version 2 (Latest)

    A full finetune of the Anima preview3-base predominantly trained on high-resolution 1536x1536 AR buckets.

    Expanded the dataset with more recent data and included the full dataset used for my previous model Kirazuri Lazuli (Noobai V-Pred).

    Total training dataset of 35,537 non-synthetic images manually curated including quality and aesthetic ratings with a dataset cutoff now of 2026/04/15.

    Training Details

    Main training with diffusion-pipe commit: d5b78a2c49a07db8f7d9a4c795e4cfe7ba1c3dfe

    Final stage for high-res used fix in commit: b0aa4f1e03169f3280c8518d37570a448420f8be

    • Samples seen(unbatched steps): ~680,000

    • Training time: ~220 hrs

    • Learning Rate: 4e-6 (General Training) and 2e-6 (Aesthetic)

    • LLM Adaptor Learning Rate: 8e-7 (General Training) and 2e-7 (Aesthetic)

    • Per-resolution Effective Batch size: 128 (512p), 96 (1024p), and 48 (1536p)

    • Precision: Mixed BF16

    • Optimizer: AdamW8bit with Kahan Summation

    • Weight Decay: 0.01

    • Timestep Sampling Strategy: Logit-Normal (General Training)

    • Tag Dropout: 30% with protected first 8 tags

    Additional Features used:

    • Structured dataset by resolutions and manual ratings for staged training

    • multiscale_loss_weight=0.5 and flux_shift=true for high-resolution training

    • Mixed Natural Language captions with diffusion-pipe captions.json format:

      "image_1.jpg": [
          "{tags}",
          "{first_n_tags}.\n{nl_caption}",
          "{dropout_tags1}.\n{nl_caption}",
          "{nl_caption}\n{dropout_tags2}"
      ]

    Installing and running

    Workflow:

    Reference the anima preview base instructions. The model is natively supported in ComfyUI. The above image contains a workflow; you can open it in ComfyUI or drag-and-drop to get the workflow.

    Note: Most preview images on the model card additionally use the custom comfyui-prompt-control node for schedule prompting syntax to mix concepts i.e. [word1|word2]
    This custom node is entirely optional but required to exactly recreate the outputs in ComfyUI.

    The model files go in their respective folders inside your model directory:

    Generation Settings

    Trained in mixed resolutions for the majority of training, and finished with dedicated high resolution training.

    Previews are generated mostly at 1536x1024 or 1024x1536.

    1024 resolutions. E.g. 1024x1024, 896x1152, 1152x896, etc.

    30-50 steps, CFG 4-5.

    Same samplers as recommended for the base model work, I like to use:

    • er_sde: the recommended default for 30-50 steps.

    • sa_solver_pece: can converge with good detail in 15-20 steps.

    Prompting

    Like the base model, this model is trained on Danbooru-style tags, natural language captions, and combinations of tags and captions.

    Tag order

    [quality/meta/safety tags] [character] [series] [artist] [1girl/1boy/1other etc] [general tags]

    Mostly the same order as the base model, only the [1girl/1boy/other etc] groups position is towards the end in this models dataset.

    [quality/meta/safety tags] [character] [series] [artist] tag groups are also not shuffled, so their order may have some influence on generations.

    Quality and Aesthetic tags

    Human score based: masterpiece, best quality, very aesthetic, aesthetic

    The very aesthetic and aesthetic tags are where this model diverges from the base, with the intent these can be used to guide the model toward a different aesthetic - a kind of house model bias.

    Meta tags

    absurdres, official art, etc

    Styles

    painterly, chiaroscuro, ligne claire, flat color, no lineart, blending, etc

    traditional media, oil painting \(medium\), watercolor \(medium\), etc

    Known Limitations & Issues:

    Concept Bleeding

    Some character/outfit details and concept bleeding is noticeable when using short prompts.

    Longer tag strings and natural language prompts describing appearance should help somewhat with this.

    Intent for future training is to find the right balance to converge faster on new data while preserving more of the existing knowledge.

    Recognitions

    • Thanks to CircleStone Labs for the Anima Preview base model.

    • Thanks to tdrussell of CircleStone Labs for the diffusion-pipe trainer.

    • Thanks to bluvoll for support using their fork of diffusion-pipe.

    • Thanks to narugo1992 and the deepghs team for open-sourcing various training sets, image processing tools, and models.

    License

    This model is released under the same license as the base model.

    See the base model for details of the CircleStone Labs Non-Commercial License.

    Built on NVIDIA Cosmos

    Description

    Version 1

    This is an experimental full finetune of the Anima Preview version 1 base.

    Total training dataset of 15,420 images curated with manual human quality and aesthetic ratings from 2025/07/03 to 2026/03/19, the model should have a fairly strong recency bias and be capable of generating many characters/concepts/styles prominent from that time period.

    Training Details

    • Samples seen(unbatched steps): ~700,000

    • Training time: ~85 hrs

    • Learning Rate: 5e-6 (General Training) and 2e-6 (Aesthetic)

    • Text Encoder Learning Rate: 1e-6

    • Effective Batch size: 24 (General Training) (1 Batch Size) and 32 (Aesthetic) (16x2 Batch Size)

    • Precision: Mixed BF16

    • Optimizer: AdamW8bit with Kahan Summation

    • Weight Decay: 0.01

    • Timestep Sampling Strategy: Logit-Normal, Shift 3

    • Tag Dropout: 10%

    • Uncond Dropout: 10%

    • Tag Shuffle: True with keep first 8 tags

    Additional Features used:

    • Protected Tags

    • Mixed Natural Language prompts at ratio:

      • tags 50%, nl 10%, tags+nl 20%, nl+tags 20%

    FAQ

    Comments (15)

    deitychaserMar 26, 2026· 1 reaction
    CivitAI

    I liked the noobai Kirazuri so I will def check out this anima version. Could you please upload it to your TensorArt profile?

    luckmiriwindsMar 26, 2026· 2 reactions
    CivitAI

    I love you Motimalu!!!

    empek17Mar 26, 2026
    CivitAI

    is it created on preview 1 version or preview 2?

    motimalu
    Author
    Mar 26, 2026· 2 reactions

    Trained from the preview 1 version

    RakoszMar 27, 2026
    CivitAI

    Is there a list of artists that work with this finetune?

    motimalu
    Author
    Mar 27, 2026

    Sorry, but there are legal reasons why I don't want to explicitly list that

    SfdwackysMar 29, 2026
    CivitAI

    In your example prompts you use @[arist style1|2|3]. What does that do exactly? Is that something like a dynamic prompt in Forge, or is it something only Comfy's architecture understands?

    motimalu
    Author
    Mar 29, 2026

    Hello, this is referred to as prompt scheduling. The feature is present since the early A1111 days and usable in most A1111 derivatives.
    For example afaik currently Forge Neo supports it natively and can be used with Anima.
    It is not actually natively supported in ComfyUI, my workflows using it require the custom comfyui-prompt-control node.

    empek17Apr 3, 2026· 2 reactions
    CivitAI

    Model is great and beautiful.
    I hope that we get preview 2 version as it is still better at prompt adhersion with longer/more complicated prompts.

    motimalu
    Author
    Apr 4, 2026· 3 reactions

    Thank you, a new version based off preview 2 is training now

    stygianwizard42Apr 7, 2026· 1 reaction

    @motimalu preview 3 just got released

    motimalu
    Author
    Apr 8, 2026

    @stygianwizard42 looks nice!

    motimalu
    Author
    Apr 12, 2026· 7 reactions

    Preview 2 based version collapsed before converging on small details • ᴖ •
    Going to make some adjustments for Preview 3 training and finetune it instead

    suede2031691Apr 21, 2026
    CivitAI

    关心一下,你对Preview 3的训练尝试进行得怎么样了?可以指望它最近发布吗。

    你的 背景lora 已经用上了。现在期待这边的更新。

    motimalu
    Author
    Apr 21, 2026· 2 reactions

    一切进展顺利,其中也包含了的背景lora之类的概念。
    训练的最后阶段正在进行中,敬请期待。(机器翻译)

    Checkpoint
    Anima

    Details

    Downloads
    451
    Platform
    CivitAI
    Platform Status
    Available
    Created
    3/26/2026
    Updated
    5/25/2026
    Deleted
    -

    Files

    kirazuriAnima_v10AnimaPreview1.safetensors