Kohaku XL beta
An anime SDXL model trained on 1.5M images.
Notice: Further experiments have showed that beta7 is generally better than beta7pro. Beta7pro have some mosaic artifact (more significant than beta7) and beta7 have better txt-img alignment. So I rearrange the order of models.
Introduction
This model is resumed from [Kohaku-XL alpha](Kohaku-XL alpha - nyan | Stable Diffusion Checkpoint | Civitai) with 1.5M images and then merged with other models.
Usage Details
This model is very flexible on resolution, you can use the resolution you used in sd1.x/2.x to get normal result(like 512x768), you can also use the resolution that is more native for sdxl(like 896*1280) or even bigger (1024x1536 also ok for t2i).
recommended negative prompt for anime style:
photorealistic, 3d model, bad, worse, worst, ugly, bad anatomy, blurry, close-up, disembodied limbphotorealistic, text, icon, artist name, signature, twitter username, naked, nude, monochrome, blurry, bad anatomy, watercolor, oil paintingwatercolor, oil painting, photo, deformed, realism, disfigured, lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry
Training Details
Kohaku-XL beta5
This model is trained on new-danbooru (danbooru images with id from 5,000,000~6,600,000) which have 1.48M images.
This model is resumed from kohaku-xl alpha7 and then merged with NekoRayXL.
Kohaku-XL base4 (haven't publish yet)
This model is trained on new-danbooru (danbooru images with id from 5,000,000~6,600,000) which have 1.48M images.
Thie model is resumed from sdxl-0.9 (due to some bad property in sdxl-1.0 which will affect finetuning). In the plan this model will be trained with 2epoch (about 94.5k steps).
I haven't published this pretrained model yet.
Kohaku-XL beta7
This model is merged with base4 and beta 5, the formula is:
beta(5+n) = beta5 + (n/4) * (base4 - sdxl0.9)So beta7 is beta5 + 0.5 * (base4 - sdxl0.9)
Kohaku-XL beta7.1(7pro)
as same as beta7 but use the finished base4 and 0.25 weight.
Note: the base4 here is the 50k step version!!
Future Plan
I will run training on base4(after it finish) in Mynefactory dataset or CyberMeow(alea31415)/Narugo1992's reg dataset.
Acknowledgements
Models
Description
new danbooru (danbooru id 5,000,000~6,600,000) 1.5M images.
50k steps (32 batch size) from sdxl-0.9.
Merged from kohaku-xl beta5 (beta7 = beta5 + 0.5 * (new_dan - sdxl0.9))
FAQ
Comments (6)
这手指的生成效果超级好啊
Pretty awesome.
I'm a bit confused... First images i got with it i was like hmm, this could be the kind of rendering i could get with a SD 1.5 model, but then i (re) read description, ok it accepts SD 1.5 sizes, figures.
I'm not saying it's bad, on the contrary, i got stunning -and completely unexpected- results with a rather simple prompt. But this kinda defeats the purpose of an XL model ? You don't take the benefits from XL general quality and details, especially faces. Of course you can summon utility loras here but this tends to change general composition.
My 2 cents: split up model in two versions, one dedicated for SD 1.5 with a 512x512 dataset, one for SDXL with a 1024x1024 dataset. Seems to me it's an uncomfortable position to try and get the best of both worlds in a single place.
You know what
I don't have any low res dataset
It just "accept" the low res
I don't know why
@kblueleaf Ha that defies logic... I like the oddities it can produce !
This model is amazing, thanks so much!
Details
Files
kohakuXLBeta_beta7.safetensors
Mirrors
kohakuXLBeta_beta7.safetensors
kohakuXLBeta_beta7.safetensors
kohakuXLBeta_beta7.safetensors
kohakuXLBeta_beta7.safetensors
KohakuXL_beta7.safetensors
kohakuXLBeta_beta7.safetensors
kohakuXLBeta_beta7.safetensors
kohakuXLBeta_beta7.safetensors
kohakuXLBeta_beta7.safetensors
kohakuXLBeta_beta7.safetensors
Kohaku-XL_beta7.safetensors
Available On (3 platforms)
Same model published on other platforms. May have additional downloads or version variants.




