Introduction
For version 1.0:
This model is based on 'Illustrious XL 1.0' with some minor modifications and was trained on the Danbooru2023 along with the dataset I previously used for training my LoRA models.
For version 2.0:
This developed model is intended to allow everyone to experience the v-pred version of Illustrious XL, instead of having to spend a large amount of STARDUST to unlock the Illustrious XL v3.0 v-pred and v3.5 v-pred versions.
I independently researched and developed this version based on various existing XL model architectures. However, due to the many modifications I made, I’m not sure it can still be considered 'Illustrious XL'.
The model was trained on the danbooru2024, danbooru_newest-all datasets, as well as a custom dataset (which I collected and labeled using natural language with GPT-4.5, and later manually verified by me).
I put a lot of time and effort into developing this version, so if you don't mind, please consider bidding on it so that others can use it through the CivitAI generator. Thank you all very much!
For version 3.0:
With this version, the model was created with the purpose of adapting to as many styles as possible, while also balancing detail stability in the generated images. This model includes styles and artist styles (from Danbooru and e621).
Although it is oriented towards being a pre-trained model, you can use it normally. However, to achieve optimization, I suggest you combine it with LoRA or fine-tune it to create the style you desire.
The model was trained on the danbooru2024, danbooru_newest-all datasets, e621 as well as a custom dataset, with 40% of this data annotated using both tags and natural language.
This model is an epsilon-prediction model that can easy to use.
For version 3.1:
This version improves the issues encountered in version 3.0. In addition, it also enhances image quality related to styles and artist styles (from Danbooru and e621).
This model was trained on the same dataset as version 3.0, but I re-annotated it, added many new anime characters, and improved the quality of existing ones.
The model improves stability when generating images at a resolution of 1536x1536.
This version will have two variants: one for v-pred and one for e-pred (the e-pred version will be released first).
For version 3.2:
This model is a refined version of 3.1, incorporating hotfixes and enhancements. It features improved detailing in the eyes and more accurate anatomical proportions for the character.
Additionally, the model demonstrates enhanced creativity and a better ability to accurately understand prompts
This model is also capable of generating images at large resolutions, e.g., 1024x2048 (I tested it and found the image quality to be quite decent). (Note: during training, I only trained it with images at a resolution of 1536x1536).
For version 3.5:
This model was trained on the Danbooru dataset, updated as of May 9th, 2025, with image sizes of 1536x1536.
It fixes an important bug that appeared in version 2.0 of the v-pred variant.
The model also improves stable style, anatomy, and prompt understanding compared to the previous version.
Important Note
This is the first base model I've created, so any feedback is welcome. Feel free to share your thoughts so I can improve it in future versions.
Version 2.0 is a V-prediction model (unlike epsilon-prediction), and it requires a number of specific parameters.
Version 3.0 should be set with a low CFG value, around 2 to 4. When you encounter images generated with high contrast (I don't know why CFG affect this, i will investigate and find the solution :v)
Currently, the model is not available for use via Civitai Generation. You can visit the following website to use it:
Suggested settings:
All example images were generated using the following settings:
Positive prompt: masterpiece,best quality,amazing quality
Negative prompt: bad quality,worst quality,worst detail,sketch,censor, simple background,transparent background
CFG: 5-7 (For version 3.0 i suggest you should set this lower from 2-4 )
Clip skip: 2
Step: 20-30
Sampler: Euler a/DPM++ 2S a
Note: I don't use any post-processing and Lora to enhance the example images. I only use these settings and a custom prompt with my base model to generate.
Acknowledgments
Thanks to narugo1992 and Nyanko for sharing such valuable data.
If you'd like to support my work, you can do so through Ko-fi!
Description
FAQ
Comments (15)
Illustrious XL 3.5 stands out with its ability to define composition directly through prompts. For example, "A girl with red hair on the right, a girl with blue hair on the left". In almost 100% of cases, you’ll get exactly that result, which is something the SDXL architecture can’t reliably achieve.
Does this model understand prompts like that too?
Nice job!
Sir, v2 of this model is awesome!
I need time for more testing but I can see it becoming my default model. If only Noob knowledge can be added it will become perfect. ChromaYume v3 is suffering from color burn issues and represents 2D styles worse. On the other hand this model is spot on for artists styles. I am actually not sure if Noob knowledge is even needed anymore.
Great
best checkpoint
wow this one is super good, quality is best of what Im used
A little comparison of v2 with Noob vpred as a reference anime sdxl model.
* This model can't generate pure black backgrounds, Noob can. I guess conversion from eps to vpred is not complete yet?
* This model tends to generate multiple copies of things mentioned in prompt. For example "holding spoon" may place spoons in both hands. "vase" will generate 2-3 vases all over the place etc.
* Overall prompt comprehension seems a little big weaker than Noob's.
* Noob rendition of pure flat color artist styles is more faithful. This models tends to add a little bit of 2.5D shading to them, especially with the presence of quality modifiers like "masterpiece" or "extremely aesthetic".
Don't get me wrong. The model is still excellent despite these shortcomings. I can recommend it to anyone: it is incredibly easy to use (unlike Noob), aesthetic, flexible, knows many styles and have little bias. And it can generate decent, sane backgrounds quite often too! With Noob it is almost impossible. I just wanted to mention things that can be further improved.
IllumiYumeXL v2.0 is a great model!
☆ I think it’s better than many models for its high resolution and stable eyes and fingers. The fingers not needing ADetailer is amazing😍👍
A nude or semi-clothed torso tends to look a bit long, but that’s a common and tricky issue with many models🫠
I saw reports in other communities that Vpred’s ZeroSNR doesn’t work well, but it makes colors too intense when turned on, so I don’t think it’s necessary. I’m really happy with the model as is.
Thank you!
Great work, I'm blown away at all the characters it knows, the hands are also the best I have seen. Bravo!
It was almost impossible to generate 2 similar characters like Kaguya Shinomiya and Yor Briar on one picture. Now it's possible (not always, but it works).
My model is now available on CivitAI. I hope to use it and share some amazing images here. Thank you so much!
i love it
Very good model that makes my GPU spin👍
V2 is based on NoobAI isn't it? It has a ton of e6 knowledge that only exists in noobAI models, and wouldn't be added by any of the additional training you claim to have done. Please label your models correctly.
Hello everyone, I've completed version 3.0. I also have a comparison table showing the effectiveness of the new version compared to 2.0 and some other models. Please note that I have switched this model to the epred format instead of vpred (since some people mentioned that vpred was harder to use than epred). This model is a pre-trained model, which contains many artist styles, styles, etc.... For easy to use i suggest you should prompt correctly or just tune this model to satisfy your style












