This model is different from the past. If you use it,please read the contents carefully:,
This model is a lightning big model pair (text emphasis/CFG Scale) comparison. The larger the inscription value,the higher the saturation. According to the official recommend and the CFG value "2" tested by myself,the expressive force is the most prominent and true,
The recommend range of the sampling step (Sampling steps) is between 10 and 30,and the best value "25" is obtained after the multi-vocabulary test ",
This model fixes and integrates VAE,keeping the VAE status as (None/None) in the running graph,
The amplification algorithm can use 4x-UltraSharp (high-score iteration step number "15"),
Sampling method recommend DPM 2S a,
This model is a multi-class large model can generate a variety of different styles,you can be bold innovation,
Must Read: V8.3 (Dark Photography Edition) belongs to the fine-tuning version, the training of the light shadow texture effect and the whole body hand crash problem to optimize and adjust with the big model fine-tuning synthesis, this model of the sampler to adjust the attention of the sampling ## Restart ##
If you like gray photography texture, you can use this version.
(This version can be used as a base film for training to understand the hands much better, and greatly reduce the crashing problem)
Support for image production (headshots, portraits, half-body, full-body)
Suggested sizes: 768, 1024, 1280, 1536 (2048*2048 images will have multi-body problems, if you want better texture, you can use the amplifier to realize large size images 4x-UltraSharp)
Iterative deployment: (25-30 steps)
Cue word guidance coefficient CFG scale: (7)
Recommended sampling method: Restart
Recommended Zoom Algorithm: 4x-UltraSharp (or) 4x_NMKD-Superscale-SP_178000_G
(Redraw amplitude recommended around 4) (High score iteration steps: 25 steps - 30 steps)
(Local map running - zoomable Recommended 1.5-2x)
About this version
Must Read: In V3, Retraining Using Cog Natural Language TAG Tagging Training
In the large model training set, extract some pictures for LOHR training and integrate them into the large model.
Output image support (headshot, portrait, half-body, full-body)
Iterative deployment: (25-30 steps)
Cue word guidance coefficient CFG scale: (7)
Recommended sampling method: DPM++ 2S a
In the V2 version, the character texture has been further optimized to enhance the detail and realism of the texture. At the same time, the light effect has also been optimized, so that the model can present a more realistic light and shadow effect under different lighting conditions.
This update makes changes to the Sampler. Please note the following recommended parameters
Iterative deployment: 25 -30
Recommended Sampling Method: DPM SDE Karras
Recommended magnification algorithm: 4x-UltraSharp (or) SwinIR_4x (redrawing amplitude is recommended to be around 0.3-0.5) (local operation chart magnification is recommended to be 1.5-2 times)
It is recommended to open the ADetailer run chart.
This model photography light and shadow sense has been fine-tuned, realistic visual effects
Iterative deployment: 25 -30 (30 details recommended skin is very good)
Recommended Sampling Method: DPM 2S a Karras
Recommended magnification algorithm: 4x-UltraSharp (or) SwinIR_4x (redrawing amplitude is recommended to be around 0.4-0.6) (local running map-magnification can directly open 2 times with good picture quality)
It is recommended to open AD running chart.
Description
Must Read: In V3, Retraining Using Cog Natural Language TAG Tagging Training
In the large model training set, extract some pictures for LOHR training and integrate them into the large model.
Output image support (headshot, portrait, half-body, full-body)
Iterative deployment: (25-30 steps)
Cue word guidance coefficient CFG scale: (7)
Recommended sampling method: DPM++ 2S a
FAQ
Comments (6)
Very nice! Have you considered making this a lightning model at all? I've been experimenting with this an a lightning lora, and the accuracy of prompt to image is astounding.
Can I ask how many sample images you used to do the CLIP CogVLM tagging training? If you wouldn't mind sharing the details I'd appreciate it. I'm trying a similar approach, but with LlaVA 1.6 at the moment...
Nice, hoping you'll train this over SD3 soon!!!
fairly good model, cryengine 2.0 (no longer for download) is better at fine details, especially cleaner eyes but has more artificial skin. If it was a bit cleaner it could get up next to artUniverse XL - keep going
This is probably the most realistic model I've used, however it has a long way to go. It doesn't follow prompts very well and NSFW is very hard to generate. Not even porn I mean things like a woman wearing a low cut shirt takes a ton of generations to get something. If you can change those things for the next version you'll have a great model on your hands.
Say what you want, but if you use this model combined with video game lora's and turn them realistic this is a sickly amazing realistic model maybe the best I have used. So I can overlook the NSFW part as I don't think he intended it for NSFW
Details
Files
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.





