Model Introduction
This image generation model, based on Laxhar/noobai-XL_v1.0, leverages full Danbooru and e621 datasets with native tags and natural language captioning.
Implemented as a v-prediction model (distinct from eps-prediction), it requires specific parameter configurations - detailed in following sections.
Special thanks to my teammate euge for the coding work, and we're grateful for the technical support from many helpful community members.
⚠️ IMPORTANT NOTICE ⚠️
THIS MODEL WORKS DIFFERENT FROM EPS MODELS!
PLEASE READ THE GUIDE CAREFULLY!
Model Details
Developed by: Laxhar Lab
Model Type: Diffusion-based text-to-image generative model
Fine-tuned from: Laxhar/noobai-XL_v1.0
Sponsored by from:
Collaborative testing:
How to Use the Model.
Guidebook for NoobAI XL:
ENG:
https://civarchive.com/articles/8962
CHS:
https://fcnk27d6mpa5.feishu.cn/wiki/S8Z4wy7fSiePNRksiBXcyrUenOh
Recommended LoRa List for NoobAI XL:
https://fcnk27d6mpa5.feishu.cn/wiki/IBVGwvVGViazLYkMgVEcvbklnge
Method I: reForge
(If you haven't installed reForge) Install reForge by following the instructions in the repository;
Launch WebUI and use the model as usual!
Method II: ComfyUI
SAMLPLE with NODES
Method III: WebUI
Note that dev branch is not stable and may contain bugs.
1. (If you haven't installed WebUI) Install WebUI by following the instructions in the repository. For simp
2.Switch to dev branch:
git switch dev
3. Pull latest updates:
git pull
4. Launch WebUI and use the model as usual!
Method IV: Diffusers
import torch
from diffusers import StableDiffusionXLPipeline
from diffusers import EulerDiscreteScheduler
ckpt_path = "/path/to/model.safetensors"
pipe = StableDiffusionXLPipeline.from_single_file(
ckpt_path,
use_safetensors=True,
torch_dtype=torch.float16,
)
scheduler_args = {"prediction_type": "v_prediction", "rescale_betas_zero_snr": True}
pipe.scheduler = EulerDiscreteScheduler.from_config(pipe.scheduler.config, **scheduler_args)
pipe.enable_xformers_memory_efficient_attention()
pipe = pipe.to("cuda")
prompt = """masterpiece, best quality,artist:john_kafka,artist:nixeu,artist:quasarcake, chromatic aberration, film grain, horror \(theme\), limited palette, x-shaped pupils, high contrast, color contrast, cold colors, arlecchino \(genshin impact\), black theme, gritty, graphite \(medium\)"""
negative_prompt = "nsfw, worst quality, old, early, low quality, lowres, signature, username, logo, bad hands, mutated hands, mammal, anthro, furry, ambiguous form, feral, semi-anthro"
image = pipe(
prompt=prompt,
negative_prompt=negative_prompt,
width=832,
height=1216,
num_inference_steps=28,
guidance_scale=5,
generator=torch.Generator().manual_seed(42),
).images[0]
image.save("output.png")
Note: Please make sure Git is installed and environment is properly configured on your machine.
Recommended Settings
Parameters
CFG: 4 ~ 5
Steps: 28 ~ 35
Sampling Method: Euler (⚠️ Other samplers will not work properly)
Resolution: Total area around 1024x1024. Best to choose from: 768x1344, 832x1216, 896x1152, 1024x1024, 1152x896, 1216x832, 1344x768
Prompts
Prompt Prefix:
masterpiece, best quality, newest, absurdres, highres, safe,
Negative Prompt:
nsfw, worst quality, old, early, low quality, lowres, signature, username, logo, bad hands, mutated hands, mammal, anthro, furry, ambiguous form, feral, semi-anthro
Usage Guidelines
Caption
<1girl/1boy/1other/...>, <character>, <series>, <artists>, <special tags>, <general tags>, <other tags>
Quality Tags
For quality tags, we evaluated image popularity through the following process:
Data normalization based on various sources and ratings.
Application of time-based decay coefficients according to date recency.
Ranking of images within the entire dataset based on this processing.
Our ultimate goal is to ensure that quality tags effectively track user preferences in recent years.
Percentile RangeQuality Tags> 95thmasterpiece> 85th, <= 95thbest quality> 60th, <= 85thgood quality> 30th, <= 60thnormal quality<= 30thworst quality
Aesthetic Tags
TagDescriptionvery awaTop 5% of images in terms of aesthetic score by waifu-scorerworst aestheticAll the bottom 5% of images in terms of aesthetic score by waifu-scorer and aesthetic-shadow-v2......
Date Tags
There are two types of date tags: year tags and period tags. For year tags, use year xxxx format, i.e., year 2021. For period tags, please refer to the following table:
Year RangePeriod tag2005-2010old2011-2014early2014-2017mid2018-2020recent2021-2024newest
Dataset
The latest Danbooru images up to the training date (approximately before 2024-10-23)
E621 images e621-2024-webp-4Mpixel dataset on Hugging Face
Communication
QQ Groups:
427280545
677964513
852429527
914818692
635772191
870086562
Discord: Laxhar Dream Lab SDXL NOOB
How to train a LoRA on v-pred SDXL model
A tutorial is intended for LoRA trainers based on sd-scripts.
article link: https://civarchive.com/articles/8723
Utility Tool
Laxhar Lab is training a dedicated ControlNet model for NoobXL, and the models are being released progressively. So far, the normal, depth, and canny have been released.
Model link: https://civarchive.com/models/929685
Model License
This model's license inherits from https://huggingface.co/OnomaAIResearch/Illustrious-xl-early-release-v0 fair-ai-public-license-1.0-sd and adds the following terms. Any use of this model and its variants is bound by this license.
I. Usage Restrictions
Prohibited use for harmful, malicious, or illegal activities, including but not limited to harassment, threats, and spreading misinformation.
Prohibited generation of unethical or offensive content.
Prohibited violation of laws and regulations in the user's jurisdiction.
II. Commercial Prohibition
We prohibit any form of commercialization, including but not limited to monetization or commercial use of the model, derivative models, or model-generated products.
III. Open Source Community
To foster a thriving open-source community,users MUST comply with the following requirements:
Open source derivative models, merged models, LoRAs, and products based on the above models.
Share work details such as synthesis formulas, prompts, and workflows.
Follow the fair-ai-public-license to ensure derivative works remain open source.
IV. Disclaimer
Generated models may produce unexpected or harmful outputs. Users must assume all risks and potential consequences of usage.
Participants and Contributors
Participants
L_A_X: Civitai | Liblib.art | Huggingface
li_li: Civitai | Huggingface
nebulae: Civitai | Huggingface
Chenkin: Civitai | Huggingface
Euge: Civitai | Huggingface | Github
Contributors
Narugo1992: Thanks to narugo1992 and the deepghs team for open-sourcing various training sets, image processing tools, and models.
Onommai: Thanks to OnommAI for open-sourcing a powerful base model.
V-Prediction: Thanks to the following individuals for their detailed instructions and experiments.
adsfssdf
madmanfourohfour
Community: aria1th261, neggles, sdtana, chewing, irldoggo, reoe, kblueleaf, Yidhar, ageless, 白玲可, Creeper, KaerMorh, 吟游诗人, SeASnAkE, zwh20081, Wenaka~喵, 稀里哗啦, 幸运二副, 昨日の約, 445, EBIX, Sopp, Y_X, Minthybasis, Rakosz, 孤辰NULL, 汤人烂, 沅月弯刀,David, 年糕特工队,
Description
亲爱的朋友们,
Dear NoobAI fans,
我们收到了许多反馈,大家都在期待一个更出色的EPS版本。因此,本周我们带来了一些新变化。我们对训练逻辑进行了优化,并继续训练了EPS 1.0版本额外的2个epoch。针对1.0版本中发现的问题,我们进行了针对性的优化,提高了优质图像的产出率和对极端色彩的表现力。
We have received a lot of feedback from you all, expressing a desire for an even better EPS version. Therefore, this week we are introducing some new changes. We have optimized our training logic and continued to train the Epsilon-pred 1.0-Version for an additional 2 epochs. We have addressed the issues identified in the 1.0 version, improving the production rate of high-quality images and the representation of extreme colors.
同时,我们按照V预测版本的标准,增强了自然语言训练,并扩充了额外的训练集。另外,IPA的初始版本已上传至Hugging Face,后续我们也会将其发布到模型站。敬请期待!
At the same time, we have enhanced natural language training and expanded our additional training sets according to the standards of the V prediction version. Additionally, the initial version of IPA has been uploaded to Hugging Face, and we will also release it on our model station later. Stay tuned!
FAQ
Comments (183)
Didn't expect an update for the EPS version, nice and thank you
biggus
NICEE
It was better than Pony, now it's leaving pony on the goddam mud, congrats
Yup this model is amazing
Adding "light particles" to negative really improved one of my gens.
SD用不了吗?怎么生成全是色块
需要开v预测
怎么开启啊,有点没看懂说明
我也是醉了,网上找不到任何关于开sd V预测相关的东西。
作者就不能好好做个说明?
中文说明就轻描淡写的一句需要额外插件。
具体什么插件也没说。
搜dev 和 v-prediction 什么都没有。
难道现在已经进入全民comfyui时代,就我在玩webUI
@shoree 方法三就是给webui写的,定位到仓库目录然后git切换到dev分支
Hi, 很抱歉V预测不易上手带来麻烦。这是用户手册,其中包含了V预测的详细使用方式:https://d0xb9r3fg5h.feishu.cn/docx/WWOHdr6RMoQZxQxCZRGc5KlEnUi?from=from_copylink
如果您仍然遇到了任何困难,请继续联系我!
我自己的电脑上的dev已经切好了。
云端的还没搞定,autodl要如何打开终端切换dev?
相比于Pony1.0,pred1.1在多人构图与细节把控上更加稳定(:3[____]
After some comparison testing, v1.1 does seem less saturated than v1.0 (perhaps a little too much in some cases), but prompt adherence and anatomy are still pretty low compared to cyberfix.
still cant get it: i use Forge, Euler, 4-5cfg, all recomended settings in general, but my gens are very bad, why?
Same here. I'm using SwarmUI and nothing good is coming out of this model. I don't know what's the hype about, this is not even close to Pony.
@k1logr4m it knows far more knowledge than Pony, it is probably something to do with your guys settings. Try copying someone's output that displays on civitai and work from there.
Hi, could you please share your generation parameters? Maybe I can help you take a look. Just copy the content below the image generated by forge.
@Euge_ Steps: 35, Sampler: Euler, Schedule type: Automatic, CFG scale: 4, Seed: 4280384847, Size: 1152x896, Model hash: dfdae6eac2, Model: noobaiXLNAIXL_epsilonPred075, ADetailer model: face_yolov8n.pt, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.4, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer version: 24.6.0, Version: f2.0.1v1.10.1-previous-626-ga332f7cc, Module 1: sdxl_vae
It seems to really struggle without artist and character specified in prompt? Like reaaally struggle.
@jaimuh731354 Since you're using epsilon prediction version, I recommend to use Euler A with 5~7 cfg scale, instead of Euler.
In addition, since you didn't share the prompt information. If the aesthetic quality of the generated image is low, you may need to add the quality tag like "masterpiece" and the aesthetic tag "very awa" into your prompt (you can think it as pony's score_9, etc.)
@AkioAI I did notice it understands a ton of concepts from booru tags, like Illustrious does. But in the quality department it falls short. I will keep playing around with it, maybe I just don't know how to use it.
@k1logr4m Yeah, at first that was me for Illustrious, it took me some time to figure out how to use the model. All you need to do is use other people's settings and change it to feel right for you.
If you want a specific style, you can find a lot of amazing creators who have made styles based on NoobAI (you can probably use artist tags before using a style lora, it is likely the model already knows the artist). NoobAI knows better concepts and composition than Illustrious, at least when I was comparing the two models. You just need to find the right prompts and negative prompts to use the full potential of the model. Good luck 🙏
@AkioAI Alright, thanks for the advice.
I'm tech illiterate, what is the difference between V and Epsilon versions?
V-Pred offers higher contrast (brighter/darker) and impressive lighting, but it requires limited set of samplers/schedulers and different parameters.
v1.1 is amazing. Huge improvement. I feel no need to "fix" it by merging with IterComp anymore. I hope you can afford to continue training. One month more to learn low-count tags and this model become god-tier.
Here's my comparison test between EPS 1.1 and V-pred 0.65
Comparison test between EPS 1.1 and V-pred 0.65
EPS version by nature has worse understandings of light, color and composition compared to V-pred version. The only obvious pros I found is it's much clearer than V-pred series in terms of line art and complicated eye styles, maybe using EPS model as a tool for detail fix is a considerable choice?
I still recommend the combination of V-pred 0.65 (base image) + V-pred 0.5 (detail fix) to generate images.
Highly depends on sampler. You are stuck with euler for vpred and it gives mushy texture and messy lines, while eps produces clear and clean image. So, depending on artstyle you want to get eps may be way better.
LEEETS GOO EPS UPDATE 🔥🔥🔥🙏🙏
the provided images look insane, ill give it a shot today
Can anyone tell me what tags to use to set the low light in the room? Simple night does not help, sticking all sorts of lights that are not in the prompt. I don't want to clog up the negative...
try adding "dark" to positives
@SakanakoChan Periodic success
@anifibous try "chiaroscuro"
dim lighting, night time, dark,
you can also put tags in brackets for emphasis:
(dim lighting, night time:1.2),
For outdoor scenes you can mention tags like:
stars, moon, dark sky,
Would recommend experimenting.
Noob 1.1 is all well and good, but vpred 1.0 when?
>Load NoobAI.
>Load 10 LoRas.
>Write an essay.
>Load 1 other checkpoint for Hires fix.
>Load 1 other checkpoint for tiled upscaling.
>Load 1 other checkpoint for inpainting.
>Takes ~10 mins.
>Realized I can get similar quality by just using the *cough* model, a short prompt, and it took me 20 seconds.
isn't that the pony experience?
skill issue
cough model? what model?
You know that both models work different, right? Illustrious has a much better sentence understanding and is not locked to tags. Not to mention we toy with trained version of 0.1, so as soon as the model really comes out, it will blast the current PONY models away.
There's no native clown reaction, so you'll have to put up with the inline version, here you go - 🤡
That's what he's saying... OP is trying to say "I did a bunch of work to try to get good results. This model is so good that I didn't need to do that extra work, I just didn't know when I started."
3/10 rage bait
you mean pony that was too scared to have artist tags?
It takes 5 tags, 1 artist tag, 2 negatives. skill issue lil bro :)
There are three significant upsides with NAIXL over PDXL.
Firstly, you don't have to bloat your prompt with that lengthy score_9, score_8_up, score_over9000, score_score_over_score bullshit in two prompts. These mandatory tokens are a HUGE issue for PDXL because it eats away a lot of tokens and dissects model guidance way too much, making any attempt to prompt in natural language futile. Even if you only use tags, the shorter your prompt is the better it's going to be followed. NAIXL on the other hand is prompted precisely like AnimagineXL, so it doesn't require a lot of slop and it has decent natural language understanding. Not FLUX.1D level of natural language understanding ofc, but it's among the best for SDXL anime models.
Secondly, NAIXL, especially the V-pred series, handles drawings significantly better than Pony. It's not like Pony doesn't understand styles, it does, but it takes more to explain that, and the foundational base of the new NAIXL model clearly is more capable for that purpose. Illustrious just diffuses illustrations much better.
And finally, Pony is biased to ponies. Who would have thought! I mean, I don't judge, but that's really not my thing, and it takes a lot of tinkering with negative prompt to make sure it doesn't summon some banging furries at the background when you want a mere SFW illustration. Thankfully it obeys to the prompt well enough, but it only aggravates the issue with mandatory prompt even further.
@Erilaz Idk, last point is applicable to this model too. Not pony, but it clearly loves generating animalistic stuff on certain prompts. Especially with no humans tag and no artist tags.
Also you forgot to mention black screens for pony, because it's latent is more like a minefield
@Kellenok not even there lil bro. 99% of people are just borderline 1/10 at prompting.
Hello! Can you tell me a little bit about this? How do I set up a prompt to create two or three characters with different traits in the same scene?
I am currently creating it this way, but the character traits get mixed up and there are two of the same character.
I am creating them like this.
Number of people, character A, character A's clothing and physical characteristics, character B, character B's clothing and physical characteristics,
scene/environment/camera angle, action, expression, items, quality prompts
Translated with DeepL.com (free version)
This is sdxl based, it struggles with more than two. Easier to get ok composition then inpaint.
@Volnovik Thanks for the reply.
If you know the composition, please let me know.
Would it be better to use something like Latent Couple?
@kansann11 Composition is up to you :) Just roll until you get something satisfactory. Also you can use controlnet for that.
Check bunch of extension that I typically use, they generarly increase prompt adherence. They are explained in my article. Also be sure to use my detailer lora trained specifically on noob, I'll load eps1.1 version today.
I can't get this to play nice with onsite gen. I just get jank:
https://orchestration.civitai.com/v2/consumer/blobs/AGPR36G8TZBVJK23RHGD8S4ZF0
Tried high cfg, low cfg, lots of steps, fewer steps, Euler, Karras... What am i doing wrong?
Hi, could you please share your generation parameter? Such as prompt, sampler, scheduler type, steps, etc. So that I can help you.
@Euge_ Thank you. I was running this through the civitai generator - Rating_explicit,(masterpiece, best quality, highly detailed), very detailed face, close up, (UHD quality details), 1girl, elf, depth of field, 8k uhd, absurdres, volumetric lighting, tack sharp focus, beautiful composition, intricate detail, professional illustration
Negative prompt: (extra fingers), (bad hand), (worst quality:1.2), (low quality:1.2), simple background, censored, lowres, low detail, blurry, watercolor, monochrome, bad anatomy, bad proportions, deformed, deformed anatomy, deformed fingers, text, caption, 3d, cgi
Steps: 40, baseModel: SDXL, quantity: 3, engine: undefined, width: 832, height: 1216, Seed: 956470018, draft: false, nsfw: true, workflow: txt2img, Clip skip: 2, CFG scale: 4, Sampler: Euler, fluxMode: undefined, fluxUltraRaw: false
Having experimented a bit more, it does seem to give better results with a more complex prompt. Still get a weird smeary visual effect though. Am i overcooking the steps? 40 for euler has generally worked in the past...
@SimianHose Hi, which version are you using as the base model? Also, could you try different prompts? For example, different positive/negative prompts.
Also, you can try adding "very awa" to the quality prompts and adding "worst aesthetic" to the negative prompts.
@Euge_ I'm using v-pred-0.65S - it's the only version available on-site. Thanks for the advice, i'll try playing around with the prompt a bit more, as it sounds like my settings are basically ok.
@SimianHose Sorry I overlooked that only 0.65 works in on-site generation. With that in mind, you can try the Euler A sampler with 4~6 cfg.
@Euge_ Ok, thank you! I'll give that a try. Thanks again for your assistance with this, much appreciated.
Cannot download it in stability matrix or use onsite, only old versions are available. Something is missing in permissions.
damn bro, miquella!!! really!!! From this position I see his kind side.. very well. You blackmailed me, well take my money, my pants, underwear, you can take everything... great job bro... great job
Very impressive model. I'm glad we're finally moving past the Pony era and into more flexible anime-focused models.
I hope more focus can be put towards fixing the issues with vpred, epsilon is just ugly to see in comparison now
Don't worry, I'm doing my best to experiment and research. The latest version has solved the residual noise problem and most of the oversaturation problems. It will be released after testing.
@Euge_ thank you euge <3 I look forward to your improvements
Looking forward to vpred 1.0
I have trained LORA on more than 15 characters with vpred 0.6, and I can’t go back to epsilon.
我已經變成了vpred的形狀了
These models work differently from regular SDXL and it's confusing when the Noob Loras are muddled with the regular SDXL ones. I tried my non-Noob LoRAs (I haven't trained any on Noob) with one of the V-Pred versions out of curiosity and the results were weird and broken. I heard some people say this should get its own category like Pony and Illustrious did, any chance of that happening?
It's up to Civitai to decide whether or not it gets its own category. I think Illustrious got its own fairly quickly so I don't see why they wouldn't do it soon
It probably will once the full release is out
Man, what the fuck do "Epsilon," "Pred," "V," and the number versions even mean? What the fuck is this pretentious naming convention bullshit? Just tell me straight: which one is good for nsfw?, and which one should I use as a base model if I want to train a LoRA on it?
https://rentry.org/wtfvpred
they're all good for nsfw and you can use either as a base model for lora, but loras for one may not work well for the other since epsilon and vpred are distinctly different things that require different settings in both generation and lora training. its important they named them this way because you need to know if a model is vpred or not to use it correctly.
epsilon is what almost every other local model you've probably used is, and its why you get "sepia" "yellow/brown tinge" and grey instead of black/white. vpred is what NAI (as in the SaaS model, not noobai) uses. current noob vpred has some issues and may not be as stable, they are being worked on. I wouldn't bother with loras for vpred unless you plan to re-do them once the final version comes out. you should always bake a lora for the model you intend to use the lora with; bake it on the version of epsilon you plan to use if you're using epsilon, the version of vpred you plan to use if you're using vpred. wait til training is finished and the ecosystem settles if this is still too much for you to process
These are technical terms that refer to the way the model produces the image. Epsilon is an "older" method that predicts the noise added to the image during the original forward diffusion process. When the model is trained, it learns how much noise is added to a latent during each timestep, so it can estimate how much it should remove during inference to produce the desired image. This is literally "stable diffusion".
V-pred is velocity prediction. Instead of only predicting noise, it predicts velocity, which is a combination of the original image used during the training process plus the noise added to it. This increases stability during inference, which results in higher quality images and better convergence (less steps needed when using non-ancestral samplers), as well as better noise schedulers and sampling methods. This is why the sample images provided by NAI for these models have improved contrast, richer colors and more coherent image composition compared to Epsilon examples.
@scamm i don't think he is going to give a shi what u said 🤣
@Y_X maybe not, but hopefully others that want to know will read it or Hasoo's comment. I tried lol
Hi, you may refer to this: https://d0xb9r3fg5h.feishu.cn/docx/YpOQdtHTDoetcZxIO9fc33onnee?from=from_copylink
vpred - better colors and other things
epred - speia pred makes model yellow and grey
by pred they mean prediction of noise in training v does velocity of noise and e(epsilon) does noise
for now vpred is not stable and will be updated each week, their naming is version of model (its bad yeah) basically 1 is final model and 1.1 is aesthetic tuned in, 0.<number> basically means what percent of training is done 1 is fully done.
Might learn to read lil bro.
"pretentious" what is pretentious is believing everyone is as dumb as you
Hi, everyone! I drafted the NoobAI-XL User Manual , which you can browse here: https://d0xb9r3fg5h.feishu.cn/docx/YpOQdtHTDoetcZxIO9fc33onnee?from=from_copylink
Please note that this manual is still in the writing stage and may contain errors. Thank you for your understanding!
not sure if appropriate to ask here but is there somewhere to sign up for access to the gated models like the "Laxhar/noobai-XL-Vpred-experiments" or other similar ones that have been locked for a week+ before releasing in an open state?
@fizalpher experiment models are early models only testers (people helping euge with vpred) have access to
rest are uploaded on civit already or you can access without any permission (like vpred 6.5 without mix)
@fizalpher Thanks for your enthusiasm! Actually, it's just a repo for sharing very experimental checkpoints with a few people who are related to the recent V prediction experiments. So please just ignore it lol.
Hi, thank you for the document. I was wondering about the aesthetic tag "very awa", was it trained from the very start of the model's training?
I would love to have some help on which nodes I should be used in comfyui for v pred
@ViperF ModelSamplingDiscrete, set it to v-pred and turn on ztsnr
@kgl57944453 Thank you. Got it. I'm assuming after the Checkpoint but before the LORAs and Sampler.
@FondantViper Yep, make sure it connects to everything that needs the model node from it, if you're using facedetailer for example
@kgl57944453 It was introduced from 0.75.
gotta keep the legacy of asking civit to give this model its own category every noobai version release
how about to add Sankaku Channel/gelbooru data?
I kinda agree. Danbooru has been removing images/tags due to complaints. Its filter is just more restrictive in general (though that can be a good thing for example to filter out double postings). In terms of tagging, gelbooru has somewhat more diverse vocab (though maybe less accurate?)
Though I think to use gelbooru dataset you can't use danbooru dataset as there is significant overlap.
Agreed, danbooru is a little outdated.
Sankaku Channel and Gelbooru are definitely more up-to-date and standard.
@sonnenkind Gelbooru is basically Danbooru but gives you a worse training dataset. Sure it's less restrictive, but it was literally always seen as the second site to dump images on if your stuff doesn't get approved on Dan to begin with, or can even be posted on Dan. Yes sure, you're able to have access to more artists, but unless if they are a banned artist that actually has a really good artstyle, expect to have a lot of garbage mixed in. I'd recommend just training a LoRA on banned artists if you're interested in their styles instead of needing to pull in all the crap with Gel.
@softtaco1712 Honestly man... I sincerely hope you're joking when you suggest Sankaku Channel. There is literally no image moderation at all on Sankaku, so anyone can dump anything onto there. They literally changed the post number system so that people wouldn't be able to see how many posts go unapproved (not that it matters because posts are never autodeleted when not approved unlike Dan) so that people wouldn't see how much goes unapproved, not to mention the tagless posts that may as well not exist. It's worse than Gelbooru to train stuff with. If you want a garbage dataset, go right ahead and train a finetune with Sankaku and see how well that goes, especially because AI people dump a lot of pure shite onto there.
Sankaku does have a pretty decent scoring system though, but the problem I see is the severe rate limiting on their API which would likely hamper any attempts to pull images/tags on this sort of scale and I'm still not sure if it's the best option for an additional data source either way.
That said, the issue I see with Danbooru is that it generally does not allow content that isn't broadly defined as 'anime', and while Noob mixes e621 to give furry data to the model, we're still missing 3D and other more western content so it'd be nice to have some other data source to cover that in the same way Pony did.
I never heard of Sankaku before so I went to check it out. I have 2 adblocks and I still see NSFW ads, while I can't view ANY artwork, innocent pictures are rated R15+. Who built it like that?
@Stinkek I haven’t used it actually in awhile. Wow. They are money hungry to say the least.
If you log in with a free account, you can still view most work.
@lopi9999 you’re right tbh, haven’t used the site in awhile but wow they’re really hungry for ai content now huh? It looks like that’s what they’re trying to get into the business of, at least.
@DrA_263 That is exactly the reason I originally suggested Sankaku. They have a pretty decent ranking system (by quality) and a more wide and diverse pool of artstyles than danbooru. Scraping data off of it would likely be a nightmare though as it has lots of paid requirements. The site seems much worse off than a couple years ago, tbh.
Any VAE recomendations for EPS 1.1 guys? Or it doesn't need VAE at all?
NO NEED ,BAKED VAE IS OK.
Evening sir, if you don't mind a simple inquire: how does the last model fairs against Ipadapter and Controlnet while using their respective SDXL models?
Hy
Wow
Is it correct to understand that Epsilon pred 1.1 is a model for SDXL and illustrious uses Epsilon pred 0.75?
both are SDXL. V-pred & epsilon pred are just different ways of dealing with noise. In theory when V-Prediction is more stable/better that it should output better results, but for now Epsilon Pred 1.1 is better... And to emphasize, all of them use Illustrious as a base, finetuned off of that to start.
No they all based illustrious
@zeseren and illustrious is on SDXL.
I have noticed the model struggles with some concepts, things that could have easily been reproduced with SDXL, for example: "1950s Diner". I wonder if it's because it relies mostly on danbooru tags that it doesn't have as much SDXL data in it? Or maybe I haven't found the right tags? Still I never had issues creating some scenarios before.
Either way, I wanted to point that out, some concepts and creativity could be limited due to danbooru tags, still tho, I have been able to create some great illustrations overall. I wonder if the model can improve on that in the future, plus updated characters/series/styles data. 🤔
Nonetheless, this model is definitely fun to use and it's worth it for so many other reasons!
What characters/series/styles data are missing? I've done some pretty intensive testing across the NoobAI models and they all have been more versatile than most Pony or SDXL models I have used. This most recent iteration seems even more so. Seems to handle natural language prompting much better than previous versions, so I am not sure that the danbooru tag training is empirical evidence of lack.
@mewtsy Don't get me wrong, the amount of data this model has is up there with the best if not the best atm. But for what I read, the Databased used it's like 6 months old aprox. It has a lot of characters I wasn't expecting like Wuthering Wave characters, but it's missing some characters from ZZZ like Piper, which was a launch character, for example.
It's just a comment, that it would be cool if the characters/series/styles database could keep updating, making this model the model go for and also that would require less Lora production, since based on some peoples mentions, it seems it doesn't play well with SDXL, Pony or even some Illustrious Loras.
Regarding Natural Language use, I haven't had that much luck getting exactly what I want many of the times. I feel I get better results just using Danbooru tags instead mostly. But it could be just the way I wrote it, even so, I feel some concepts are missing in general, which is a shame. I gave the example of an 1950s Diner, I can replicate it with Danbooru to some extent, but I need to name a lot of things, like the checkered floor, stools, etc etc.
It's good
Neither the "v-pred" nor the "eps" version seems to recognize the tag of the artist "mochi_(circle_rin)".
mochi \(circle rin\) <- Try this format
don't use underscores. as stated above the correct format is mochi \circle rin\), euge's user manual would be good to read through: https://d0xb9r3fg5h.feishu.cn/docx/YpOQdtHTDoetcZxIO9fc33onnee
"Prompts should not contain any underscores "_". Influenced by websites such as Danbooru, the use of underscores "_" instead of spaces between words as tags has been circulated, which is actually a misuse and will cause the generated results to be different from using spaces. Most models, including NoobAI-XL, do not recommend including any underscores in prompts. This misuse can affect the quality of generation at best, and even make the trigger words completely invalid at worst."
@Hannibal_Lecter very thankful
@fizalpher understand. It seems that I am not careful enough. Thank you for your patient answer.
@Hannibal_Lecter @fizalpher Just tried, according to the rules in the manual after adjustment is still not recognized.
@Black130516 the artist needs to have at least 80 high quality artworks typically, also unsure if the "loli" content was trained, most of mochi's artworks aren't visible to check quality on danbooru without gold so can't confirm.
@fizalpher What about hyphens? I usually use those instead of underscores to keep unrelated words that form a phrase from being individually processed, or are spaces still better for this?
@bigredguy2 Use spaces unless specifically the danbooru tag uses a hyphen. Words shouldn't combine, use commas to separate and go full tagging. NLP begins with a capital letter and ends with a period if you add that in.
muk_(monsieur) will not work either, this tag will force the muk of the pokemon to be incorporated.
Should the base model be set to XL? It makes using Illustrious loras impossible when using onsite gen.
Have you tried to see if your concept exists without LORA? NoobAI has full danbooru training for 34 epochs and you shouldn't really need to use a LORA. If it's the case you still need to, then the LORA needs to be trained for NoobAI as it's gone through a lot more training and has a bit different formatting than Illustrious.
@fizalpher This was just testing a LoRA I made of a synthetic, consistent character(s). Mostly for me to learn how IL works at all. Formatting I don't know at all yet, alas, hence me trying these things
I need help with this model. How to prompt it? I know it's more prompt sensitive but how much? Most pony models just do "ok" without a lot of prompts so its new for me. Also no "score" tangs. I was trying to do something but nothing "good". Im using 1050 ti and I can't test a lot of things at the same time.
Best practice, is to read the model guide, it includes recommended starting prompt, schedulers, dimensions, etc. Also avoid using underscores _ . Search Danbooru and e261 for tags similar to the concept you are trying to build. In my experience, avoiding natural language (sentences) works best. Other than that, I guess just look at other people's prompts, loras, etc. . . . Kind of difficult to gen without experimentation, considering that's the nature of these models (near limitless combinations of possible images).
@mewtsy The "natural language captioning" seems to be a scam from the word diarrhea tag salad prompts I've found so far on decent gens... there's also esoteric quality tags like "very awa" or "year 2024" strewn everywhere. The generation guides I've found so far are by and large useless and uninformative on actual prompts. Basically, this model seems to have been trained by someone who was stroking out, good fucking luck prompting it without a ton of experimentation.
Hi, for your reference: https://d0xb9r3fg5h.feishu.cn/docx/YpOQdtHTDoetcZxIO9fc33onnee?from=from_copylink
Wonder why you trained the tagging in \(XXX\) format?
Parenthesis in WebUI is basically a multiplier, makes the prompt take more attention, so (text) = text x 1.1
Adding / helps keep it as just how we use parenthesis in normal text
@darionk You're supposed to use \(backslashes\) to escape parentheses, not forward slashes. But otherwise you are correct. One thing to note though is that when training a lora, image captions with parentheses do not need backslashes, they are only used for escaping in prompts.
@CosmicElement thanks! That makes a lot sense. But why they didn't use other marks since Parenthesis is a multiplier
@XDDDAI Because they are using Danbooru database and tags, the site has existed for a long time, changing all () will take way too long, so using \ is faster. Also, this is more of a software thing, they decided to use () as a multiplier, they didn't thought people would use Danbooru tags to create prompts at the time, well that is my assumption.
And yeah @CosmicElement I made a typo, but since you corrected me, I don't see the need to edit my comment, the conversation wouldn't make sense lol
@darionk Thanks for your patient reply! Yeah that makes sense, however I feel it's not a hard thing to change that in tagging automatically since they are all text files...I had this complaint is because \ is also a special mark in coding...yeah but nvm, they already did a great job training this checkpoint
Anyone know how to do furries for this on site? Recently its been like 1/9th to 2/9th of my gens are actually a furry even the ones I have uploaded are lucky shots. Even though this should be trained on e621 tags trying to get something like a furry horse seems impossible. Removing anthro/furry/mammal from the suggested negs provides nothing and removing the loras in case of lora influence but no dice.
You can use the Keyword " BREAK " follow by the animal you prefer.
Example:
masterpiece, best quality, newest, very aesthetic, source_furry, Equine, horse, anthro, park background, BREAK furry female, horse, big breast, wearing a dress, in the city
if your prompt is long, you may prefer to use the example below:
masterpiece, best quality, newest, very aesthetic, (source_furry, Equine, horse, anthro), beautiful background, BREAK (furry female, horse), big breast, wearing a dress, lost in the city
Avoid all danbooru tags like 1girl which signal to the model "this is anime." Instead use e6 tags like solo, female. Adding a furry artist lora of any kind even on low strength will also be enough.
I can't help you :D I have the furry tag banned globally on this website lol
@Technoyote This. Putting typical danbooru tags in negatives can also help. Whether or not furry as a tag even works in negatives or positive is hit and miss because danboorus own furry tag has quite a lot. I've put furry in negatives before and actually had humans turn into furries.
@SexyUmbreon "source_x" won't do anything in particular, that's pony specific.
@Speedhoven Good to know lol I usually run test "source_furry" on different checkpoint that I often forget, that not all checkpoint needs it. Not just that, but I'm also a noobie playing around and learning new things about the different versions too
This is my favorite model I’ve ever used.
Its versatility is endless, and the need for Loras is so small due to how adequate the prompt adherence is. There are a few issues though as it’s still in alpha!
1. Danbooru tags seem to be weighted more heavily than e621 tags. If you use danbooru tags, it feels like the model tries really hard to omit the e621 dataset. Idk if it’s not fully incorporated yet, but it’s definitely something I’ve noticed. It would be nice if the danbooru and e621 tags could be merged where appropriate, too. A trick I’ve been doing to force adherence is using alternating prompts for any tags that are different between e621 and danbooru. Example: [furry male|anthro male]. It seems to help somewhat.
2. Character concept bleeding. In particular, because this model prefers to use both the character and the series/copyright together to generate characters, there is often concept bleeding ESPECIALLY with the E621 versions. When a less popular character is prompted for, the more popular characters from that series will start bleeding into the character you’re prompting for. This can sometimes be fixed by putting other characters from the series into the negative prompt, but it can lead to image burning/artifacts quickly.
3. The CFG is something I’m still struggling to get, it feels like black magic. I think part of it is that the e621 and Danbooru datasets are not weighted evenly, so sometimes low cfg actually grants higher prompt adherence with danbooru only tags. I’m still trying to figure it out. I still recommend a cfg rescale to help your output images.
Altogether though, I’m very excited for this model to fully release. The V Pred model, even as it is, is preferable to me over the others.
yes ive realized that too, it was a thing with pony as well. for example "bocchi the rock!" is way more biased towards generating gotoh hitori rather than any of the other characters.
tagging "bocchi the rock!, yamada ryo" sometimes generates ryo with pink hair, unless "blue hair" is added to the prompt, etc, etc
also, less popular artists do required loras to accurately replicate
@Hugs288 That's interesting! I usually use character Loras for Pony because it doesn't seem like Pony has super great adherence to a lot of characters I had tried to prompt for.
For sure, loras will always have a place! With Pony though, I feel like loras are essential especially for art styles. That said, nothing will quite compare to a Lora to really get the best effect. But, if you can simply get a character from a prompt with ~90% accuracy, that's definitely a more desirable method for me personally than needing to download loras.
@softtaco1712 yeah most characters dont need a lora or anything, usually just the character name is enough, if it isnt strong enough you can add copyright/descriptive tags etc. but only about a couple hundred artists can be replicated somewhat accurately by just tagging.
pony was so insanely overcooked even with lora it struggled sometimes lol
Great model, im surprised with how it can understand my prompts
Would it be possible to write some kind of explanation of the different versions? I'm new to this model and I'm a bit confused as to which version is best / what I should use
from my experience with these AI models 99% of people creating these models have no idea what they are doing, no one knows actually not even the people from big tech AI companies know, and here 90% of creators will spam the website to be on the recent list adding the same model with minimal changes. the best approach is to look at the images OP added, and those from other users. If you can afford the time and the bandwidth download and test the models you are interested in. The result may be either the model keeps ignoring your prompt and just generates the same images because it was "burned" to death by OP or the model will generate amazing images that neither OP or other users got as result.
Hi, you may refer this:
https://d0xb9r3fg5h.feishu.cn/docx/YpOQdtHTDoetcZxIO9fc33onnee?from=from_copylink
@Euge_ Thank you, this is just what I needed!
I have a problem: When using an upscaler model to enlarge an image, the image will look gray. Is this normal? Or is there a better way to enlarge the image and improve the details?
for me it's look gray when i using hires. when i using tiling upscale all is ok
It’s not very serious, the picture looks a little gray, loses some saturation, and has a smeared feel. It only feels like a 10% change.
@Flange yeah, it’s about the same for me
Yeah. Sometimes jacking up CFG a little bit on 1st pass only or 2nd pass only helps.
I also use the "vivid" tag to try and squeeze more colour out sometimes.
Hey, I've been trying to get the v-pred version to work. Using SD.Next (automatic1111 fork). When manually changing the prediction method to v_prediction the results look almost right. But end up blurry, and/or with weird discoloration like a gas puddle. I've copied the settings in description, rest are defaults.
經過數天的通宵嘗試我放棄了⋯嘗試的結果告訴我eps1.0要更好於eps1.1版本。
首先1.1版本不知道什麼原因 在光線上會更偏向正面打光,即使使用背光tag加強,還是會出現一樣的結果。即使想辦法強制變成背光構圖,陰影部位也偶爾會畫壞掉,變成和臉的陰影完全不一樣。另外背光偶爾也會變成奇怪的輪廓光圍繞在角色身上。然後就是,1.1的膚色感覺也明顯比1.0更白,或則說更亮一些。這個可能是喜好問題?不過我覺得偏黃一點的感覺會更好一些。
其次,出圖成功率似乎變低了。這邊講的出圖是一些複雜的場景,例如2個角色在同框畫面內性愛。在1.0版本同樣的動作tag下,出圖率是很好的。例如,每5*5張可能才會出現幾張稍微奇怪的圖。而在1.1使用同樣動作tag的情況下,會出現包括但不限於消失的手腳,甚至是整個手臂大腿消失。這裏我猜測,或許是因為我使用了x-ray的寫法,但是x-ray在tag中是參考了danbooru的陰道被插入時橫切面tag。1.1版本增加了對自然語言理解的權重後,可能誤把x-ray當成透視?
另外還有的就是莫名其妙位移的胸部,甚至是人物上半身和下半身相反,以及奇怪的姿勢。還有2個角色的接近的部位融合在一起,像是手在放在頭髮附近的圖,整個前臂有機會和頭髮混合在一起。這些問題在1.0是我較少會遇到的,甚至可以說沒遇過。感覺就像1.1刻意為了完成tag的任務,為生而生,導致畫面出現奇奇怪怪的獵奇填充效果。
上面這些情況⋯我不知道是不是單純我個人使用問題,不過我暫時應該還是會繼續使用1.0版本,直至下次更新才會再嘗試了。也辛苦noob的技術人員一直以來的辛勞,非常感謝你們提供了如此優異的模型給大家。
awesome model, wish we had some artist templates and examples to use
Everything is here:
https://docs.qq.com/sheet/DZGxRSXhvcmNmeHFv?tab=8ztba1
I've been flipping through this guy's amazing style LORAs because he has lots of image previews.
https://civitai.com/user/SasakiNsfw/models
I use his LORAs for regular Illustrious but they're not working well for me in NOOB. I just type the artist name in when I'm using NOOB and try it without a LORA.
For someone more familiar with this model: Is there any way to get use out of less (~150 image) tagged characters? They show up in the csv lists, but entering them as a tag doesn't seem to get any recognition from the model, even in terms of things like species/general colours/etc. I can understand not being able to get things exact, but it feels like the tags just aren't doing anything at all once the image count drops below 400 or so.
It actually works for some characters with less than 100 illustrations but only if they've been on danbooru for a solid year or two. I've found that with characters where it only has a vague idea what they look like you can improve the results by putting as many descriptive words of their appearance as possible into the prompt. Also make sure you use spaces (not underscores) in the characters name since that's what the model is actually trained on.
You may add the core tags of the character. You could find them in this csv file: danbooru_character_webui.csv · Laxhar/noob-wiki at main (huggingface.co)
Also, you may refer to this document for more information about the model:
https://d0xb9r3fg5h.feishu.cn/docx/YpOQdtHTDoetcZxIO9fc33onnee?from=from_copylink
@NVGE Yeah, was using the tags straight out of the csv/spaces/etc, or even just testing on one-word names. I'll see if adding in some additional help works out and the tag just refines from that though; thanks!
Are Illustrious and Noob checkpoints the same thing or nah? If you train a LoRA with one, would it be compatible with the other? I can't seem to keep up or keep track of all these new generation anime models lol
Illustrious is the base model for noob. Due to extensive training, noob differs significantly from Illustrious. Although LoRAs trained on Illustrious can work on noob, the results are not perfect. Personally, I believe that character LoRAs in Illustrious perform better than art style LoRAs
translated from:Illustrious是noob的基础模型,noob由于大刀阔斧的训练,和Illustrious有许多不同,训练在Illustrious上的lora虽然可以在noob上工作,但是效果不完美,我个人认为Illustrious的lora中角色Lora的效果好于画风Lora
NoobAI-XL is what Illustrious should've been in the beginning.
Nice model
Can you use Gelbooru instead of Danbooru for training the dataset on? Danbooru removes tags and Gelbooru is more up-to-date.
I agree with you very much! I think it's possible to use Danbooru as basic training, but it's not a good idea to train tag. The current Danbooru community shows a serious aversion to artists or styles that do not conform to the preferences of their moderators. As a result, many artists' images or styles were not allowed on Danbooru's website because they did not meet the preferences of their auditors. This is not good for training AI.
@yukiyanagiba Exactly. Hopefully they have already changed to Gelbooru by now.
@AkioAI They would have to train the entire model from scratch if they wanted to train on Gelbooru instead of Danbooru, which is probably very costly. I, too, wish they'd use Gelbooru, though, due to so many styles on there that aren't represented on Danbooru. I'd also vote to train using the Rule34 website as there are tons of artists on there that aren't even represented on Danbooru or Gelbooru. However, the Rule34 website has and allows ai artists and art on it so you'd get those styles trained in the model as well as the other stuff.
@madaraxuchiha88 Idk how they train the model or the details on finetuning/training on a full model but couldn't they just switch to Gelbooru or a better website in the middle of training? I am pretty sure you can do blacklist of "AI-assisted" and "AI-generated" tags to avoid training on A.I or really bad images that was created by A.I if they used Rule34 as well. Unless they are using an A.I to grab all of the images?
The quality control and tagging on Gelbooru is much worse than Danbooru. 80% of images on Gelbooru are pulled straight from Danbooru, the rest are duplicates or of varying quality.
@HDiffusion I see. What is a better database to grab tags and images then? if you know any
@AkioAI For anime I think danbooru is already the best. Everything else requires a ton of filtering and deduplication. I think instead the focus should be on types of anime data that isn't common on image boards, like screencaps. They've already done this starting with v0.65. I think the noob dataset is already big enough.
Good model
I gave NoobAI another try with this version. First results look intriguing, especially the style variations. I guess it's time to say good bye to Pony and hello to Illustrious!
very good, but I can't seem to get males. Whenever I try generating a male character, it just ends up giving me a girl with short hair or a tomboy, but overall the model is pretty good
Try adding "male focus" after 1boy/male or adding "otoko no ko, femboy, girly, crossdressing" in negative. And it could also be cause of artist styles/loras, cause for me it gives me a solid gigachad/faceless dojin man with just "1boy, solo" without any loras.
Best tip I can provide is to make sure the male character has around 100 images in Danbooru, avoid artist that tend to draw only females and if you select one, lower the intensity of the artist by writing it as (Artist:0.7) or a lower number. I haven't needed to use 1girl, woman, girl, in negatives at all so far, but I can see those helping. Either way, good luck
i alway use
1boy,fat man/old man,short hair......
1boy,Asian man,......
It can be used perfectly with 1girl, and there is no problem that men can’t appear at all.
If you only provide "1boy," there is no reaction. However, if you add any related male prompt words, it works perfectly.
Additionally, please note that if there are any "1girl" prompts with "solo" in front, please remove them.
For the problem where there is no specific male character, I chose to train lora myself to solve it.
thank you all for the tips! I have switched a couple artists and now it makes me male characters flawlessly!
Details
Files
noobaiXLNAIXL_epsilonPred11Version.safetensors
Mirrors
noobaiXLNAIXL_epsilonPred11Version.safetensors
NoobAI-XL-v1.1.safetensors
noobaiReplaceForLora_v10.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
NoobAI-XL-v1.1.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
NoobAI-XL-v1.1.safetensors
NoobAI-XL-v1.1.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
NoobAI-XL-v1.1.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
noobaiXLNAIXL_epsilonPred11Version.safetensors
Available On (10 platforms)
Same model published on other platforms. May have additional downloads or version variants.


















