Kohaku-XL beta - beta7

NSFW

Kohaku XL beta

An anime SDXL model trained on 1.5M images.

Notice: Further experiments have showed that beta7 is generally better than beta7pro. Beta7pro have some mosaic artifact (more significant than beta7) and beta7 have better txt-img alignment. So I rearrange the order of models.

Introduction

This model is resumed from [Kohaku-XL alpha](Kohaku-XL alpha - nyan | Stable Diffusion Checkpoint | Civitai) with 1.5M images and then merged with other models.

Usage Details

This model is very flexible on resolution, you can use the resolution you used in sd1.x/2.x to get normal result(like 512x768), you can also use the resolution that is more native for sdxl(like 896*1280) or even bigger (1024x1536 also ok for t2i).

recommended negative prompt for anime style:

photorealistic, 3d model, bad, worse, worst, ugly, bad anatomy, blurry, close-up, disembodied limb
photorealistic, text, icon, artist name, signature, twitter username, naked, nude, monochrome, blurry, bad anatomy, watercolor, oil painting
watercolor, oil painting, photo, deformed, realism, disfigured, lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry

Training Details

Kohaku-XL beta5

This model is trained on new-danbooru (danbooru images with id from 5,000,000~6,600,000) which have 1.48M images.
This model is resumed from kohaku-xl alpha7 and then merged with NekoRayXL.

Kohaku-XL base4 (haven't publish yet)

This model is trained on new-danbooru (danbooru images with id from 5,000,000~6,600,000) which have 1.48M images.
Thie model is resumed from sdxl-0.9 (due to some bad property in sdxl-1.0 which will affect finetuning). In the plan this model will be trained with 2epoch (about 94.5k steps).

I haven't published this pretrained model yet.

Kohaku-XL beta7

This model is merged with base4 and beta 5, the formula is:

beta(5+n) = beta5 + (n/4) * (base4 - sdxl0.9)

So beta7 is beta5 + 0.5 * (base4 - sdxl0.9)

Kohaku-XL beta7.1(7pro)

as same as beta7 but use the finished base4 and 0.25 weight.

Note: the base4 here is the 50k step version!!

Future Plan

I will run training on base4(after it finish) in Mynefactory dataset or CyberMeow(alea31415)/Narugo1992's reg dataset.

Acknowledgements

Models

NekoRayXL

Description

new danbooru (danbooru id 5,000,000~6,600,000) 1.5M images.
50k steps (32 batch size) from sdxl-0.9.
Merged from kohaku-xl beta5 (beta7 = beta5 + 0.5 * (new_dan - sdxl0.9))

FAQ

Comments (6)

second222Oct 22, 2023· 2 reactions

CivitAI

这手指的生成效果超级好啊

Euge_Oct 23, 2023· 3 reactions

CivitAI

Pretty awesome.

ze_thrillerOct 23, 2023· 3 reactions

CivitAI

I'm a bit confused... First images i got with it i was like hmm, this could be the kind of rendering i could get with a SD 1.5 model, but then i (re) read description, ok it accepts SD 1.5 sizes, figures.

I'm not saying it's bad, on the contrary, i got stunning -and completely unexpected- results with a rather simple prompt. But this kinda defeats the purpose of an XL model ? You don't take the benefits from XL general quality and details, especially faces. Of course you can summon utility loras here but this tends to change general composition.

My 2 cents: split up model in two versions, one dedicated for SD 1.5 with a 512x512 dataset, one for SDXL with a 1024x1024 dataset. Seems to me it's an uncomfortable position to try and get the best of both worlds in a single place.

kblueleaf

Author

Oct 23, 2023

You know what

I don't have any low res dataset

It just "accept" the low res

I don't know why

ze_thrillerOct 23, 2023· 1 reaction

@kblueleaf Ha that defies logic... I like the oddities it can produce !

bloatedcowOct 25, 2023· 1 reaction

CivitAI

This model is amazing, thanks so much!