Introduction
For version 1.0:
This model is based on 'Illustrious XL 1.0' with some minor modifications and was trained on the Danbooru2023 along with the dataset I previously used for training my LoRA models.
For version 2.0:
This developed model is intended to allow everyone to experience the v-pred version of Illustrious XL, instead of having to spend a large amount of STARDUST to unlock the Illustrious XL v3.0 v-pred and v3.5 v-pred versions.
I independently researched and developed this version based on various existing XL model architectures. However, due to the many modifications I made, I’m not sure it can still be considered 'Illustrious XL'.
The model was trained on the danbooru2024, danbooru_newest-all datasets, as well as a custom dataset (which I collected and labeled using natural language with GPT-4.5, and later manually verified by me).
I put a lot of time and effort into developing this version, so if you don't mind, please consider bidding on it so that others can use it through the CivitAI generator. Thank you all very much!
For version 3.0:
With this version, the model was created with the purpose of adapting to as many styles as possible, while also balancing detail stability in the generated images. This model includes styles and artist styles (from Danbooru and e621).
Although it is oriented towards being a pre-trained model, you can use it normally. However, to achieve optimization, I suggest you combine it with LoRA or fine-tune it to create the style you desire.
The model was trained on the danbooru2024, danbooru_newest-all datasets, e621 as well as a custom dataset, with 40% of this data annotated using both tags and natural language.
This model is an epsilon-prediction model that can easy to use.
For version 3.1:
This version improves the issues encountered in version 3.0. In addition, it also enhances image quality related to styles and artist styles (from Danbooru and e621).
This model was trained on the same dataset as version 3.0, but I re-annotated it, added many new anime characters, and improved the quality of existing ones.
The model improves stability when generating images at a resolution of 1536x1536.
This version will have two variants: one for v-pred and one for e-pred (the e-pred version will be released first).
For version 3.2:
This model is a refined version of 3.1, incorporating hotfixes and enhancements. It features improved detailing in the eyes and more accurate anatomical proportions for the character.
Additionally, the model demonstrates enhanced creativity and a better ability to accurately understand prompts
This model is also capable of generating images at large resolutions, e.g., 1024x2048 (I tested it and found the image quality to be quite decent). (Note: during training, I only trained it with images at a resolution of 1536x1536).
For version 3.5:
This model was trained on the Danbooru dataset, updated as of May 9th, 2025, with image sizes of 1536x1536.
It fixes an important bug that appeared in version 2.0 of the v-pred variant.
The model also improves stable style, anatomy, and prompt understanding compared to the previous version.
Important Note
This is the first base model I've created, so any feedback is welcome. Feel free to share your thoughts so I can improve it in future versions.
Version 2.0 is a V-prediction model (unlike epsilon-prediction), and it requires a number of specific parameters.
Version 3.0 should be set with a low CFG value, around 2 to 4. When you encounter images generated with high contrast (I don't know why CFG affect this, i will investigate and find the solution :v)
Currently, the model is not available for use via Civitai Generation. You can visit the following website to use it:
Suggested settings:
All example images were generated using the following settings:
Positive prompt: masterpiece,best quality,amazing quality
Negative prompt: bad quality,worst quality,worst detail,sketch,censor, simple background,transparent background
CFG: 5-7 (For version 3.0 i suggest you should set this lower from 2-4 )
Clip skip: 2
Step: 20-30
Sampler: Euler a/DPM++ 2S a
Note: I don't use any post-processing and Lora to enhance the example images. I only use these settings and a custom prompt with my base model to generate.
Acknowledgments
Thanks to narugo1992 and Nyanko for sharing such valuable data.
If you'd like to support my work, you can do so through Ko-fi!
Description
FAQ
Comments (6)
Struggling with V3. Haven't gotten one of my old v2 gens come out better. Maybe it is Reforge UI? The only Schedule type that doesn't complete explode is DDIM. Any thoughts?
Hi, for Version 3, I recommended to set the guidance scale (CFG scale) between 2 and 4 (When you encounter images generated with high contrast) for optimal quality. Please make sure to keep this important detail in mind. I have an example of images with different CFG
I'm not sure why but overall 3.0 feels like a downgrade to 2.0. Somehow its harder to use, and the artist styles are worse. I'll try it more and will add my findings
P.S It feels like it has lost the clarity from 2.0, everything feels hazy and less detailed (seems like a vpred thing). Not sure about this but it feels like prompt following also worsened
"v-pred is harder to use" bunch of idiots who need to learn how to use a good v-pred model now 3.0 is ruined because of it. Skill issue always ruining good things
Turning on zsnr for V2 version seems to impair its ability to generate very bright or very dark images. Even without zsnr it still has some trouble with high brightness, would that be fixed in the next v-pred version?
When training a LoRA with the V2 version, it collapses. Even with a low learning rate, it still collapses.















