CivArchive
    Preview 50403094

    ***Work in Progress***

    Hyper Flux Dedistilled models combine the Flux-dev-de-distill checkpoint by nyanko7 with the Hyper-SD LoRA by ByteDance into single-file models in q4_KM GGUF format for easy use on mid-tier hardware.

    The Hyper 16 Steps version provides the same generation time as Flux-dev while producing better results. It approaches Flux Pro in terms of quality and prompt understanding. The Hyper 8 Steps version is twice as fast and offers a balance between speed and quality.

    Various weights for the Hyper-SD LoRA were tested, and these models were created using the ones that provided the best results. This ensures that the models maintain consistent performance and converge as expected with the recommended step values, even after quality loss from quantization.


    Hyper 8 Steps:

    • CFG: 1 to 2 (not distilled guidance)

    • Steps: 8 to 12 (more is better, best at 12 steps)

    • Sampler: DPM-2 (for high detail) or DPM++ 2M (for faster generation)

    • Scheduler: Beta

    Best for quick generations while keeping image quality above the standard Flux-dev level.

    Hyper 16 Steps:

    • CFG: 2 to 3

    • Steps: 16 to 24 (more is better, but 20 gives great results)

    • Sampler: DEIS (for stylized, artistic outputs) or DPM++ 2M (for realistic raw images)

    • Scheduler: Beta

    Ideal for refined, detailed images with excellent prompt understanding, similar to Flux Pro.


    This is a pruned model, so you will need to download the following :

    Text Decoders: t5_xxl and clip_l from Flux Text Encoders (the FP8 version is recommended for mid to low tier pc).

    VAE: ae.safetensors from Black Forest Labs VAE.

    I also recommend trying the following clip_l model for better text and details :

    CLIP Text Encoder (alternative option): CLIP-GmP-ViT-L-14 (HiT GmP TE).


    No need to use Flux guidance—this model uses real CFG, just like any Stable Diffusion model, making it possible to use negative prompting effectively. For Hyper-16, use the same CFG value you would normally use for distilled guidance in Flux-dev (e.g., if you usually use 2.5, stick with that). For Hyper-8, I recommend using half of that value.


    Future Versions:
    This release is in q4_KM format. Other formats, such as FP8, q5_KM or q8_0, may be made available upon request for users with different hardware needs.


    This project builds upon the incredible work of the original authors:

    • Flux-dev-de-distill by nyanko7: An advanced checkpoint implementing true classifier-free guidance.

    • Hyper-SD by ByteDance: A powerful LoRA solution enabling faster generation time.

    The creation of these models is solely attributed to their respective authors. I take no credit for their work or development. The Hyper Flux Dedistilled models simply integrate and format these into a combined, accessible solution.


    Both versions havw been tested to run smoothly on mid-tier GPUs, including a RTX 3070 and a RTX 2060 Super with 8GB VRAM, with excellent performance.

    Description

    Ideal for refined, detailed images with excellent prompt understanding, similar to Flux Pro.

    Recommended Settings for Hyper 16 Steps:

    • CFG: 2 to 3 (real CFG)

    • Steps: 16 to 24 (more is better, but 20 gives great results)

    • Sampler: DPM-2 (best quality), DEIS (for stylized outputs) or DPM++ 2M (for realistic raw images)

    • Scheduler: Beta

    FAQ

    Comments (50)

    dio011Jan 8, 2025
    CivitAI

    I Wait for 8 steps of fp16 and fp8.

    C47HERINE
    Author
    Jan 8, 2025· 1 reaction

    The FP8 version has been uploaded!

    dio011Jan 9, 2025

    @C47HERINE thank you~

    totesJan 8, 2025
    CivitAI

    Its speedy and good with art, but it doesn't do the loras I made to well. I'm not sure if thats on me or if its the type of model. Either way, as far as non-lora goes, awesome.

    C47HERINE
    Author
    Jan 8, 2025

    Try lowering the weight of your LoRAs. I’ve found that this model understands LoRAs better than any other model I’ve used, it’s epic. However, sometimes it understands them too well, making any defects or signs of overtraining very obvious.

    What works best for me is either decreasing the weight, increasing the CFG value (if the LoRAs seem to have no effect), or lowering the CFG if the effect is too strong. If you have an earlier training epoch, that might work better too.

    Some of my LoRAs require half the weight I was using with regular Flux-dev, and for some style LoRAs, I even need to lower it as much as possible (sometimes down from a weight of 1 to 0.1) to see the intended effect.

    You’ll need to relearn how to work with LoRAs for this model, but it’s definitely worth it. Also, make sure you’re not using Flux guidance.

    SirColinJan 8, 2025· 1 reaction

    With de-distilled models you have to train your loras on Flux Dev2Pro checkpoint, not on flux dev. The loras I trained on dev looks terrible on this model, but if you train using the Dev2Pro checkpoint, they are exceptionally working well with this model.

    C47HERINE
    Author
    Jan 8, 2025

    @sparkleboy599 Well yes and no, the real answer is that flux-dev is not so good at training and capturing details. It's better than most, but it does mistake. Sadly we're only finding that now that dedistilled models are coming out. Dev2Pro is one of those and it will do an amazing job compared to dev. But, does it mean we'll have to redo everything? Probably not, some stuff will be salvagable... some is just plain better, and some will have to be retrained.

    totesJan 8, 2025· 1 reaction

    @C47HERINE Hey there, I turned down the weight on the LoRA a bit and changed the sampler to simple and it seems to have worked well. So all in all, great model. Awesome work. Keep it up! Also thanks for the tips.

    SirColinJan 9, 2025

    @C47HERINE Interesting, does it mean I should try to train my Loras straight on your de-distilled checkpoint, without using the dev2pro model?

    C47HERINE
    Author
    Jan 9, 2025

    @sparkleboy599 Personally I would use the Dev2pro. My model is an hyper version based on the same idea. Mine limits the number of steps needed for the model to converge; but, for training I prefer more steps with a lower learning rate.

    valentinkognito365Jan 8, 2025· 1 reaction
    CivitAI

    I can't wait to test that :-)

    valentinkognito365Jan 13, 2025

    I've done some comparative tests on a fixed seed picture and my conclusions were :
    - Hyper8 at 8 or 10 steps is definitely not quality enough for me, but at say 12 steps, it's an OK quick-job
    - Hyper8 at 30 steps is 50% time of dedistilled 60 steps but around 70% its quality
    - Hyper8 at 40 steps is 66% time of dedistilled 60 steps but around 90% its quality
    - Hyper16 I found NOT good at all every time I've used it, at any steps number. I won't use it in its current form.

    Thank you for these models and I hope you can make them even better

    wregiszter632Jan 8, 2025
    CivitAI

    I don't use dedistilled models before, so I have a question.

    What should I set up for Distilled CFG if I use this model? Or does it have no effect on the generation in this case?

    C47HERINE
    Author
    Jan 8, 2025

    what are you using, comfy, forge or something else? From what I’ve seen so far, sometimes it doesn’t do anything and sometimes it affects the image in a bad way. I believe that it is best to disable it, but if you do use it, it won’t change much.

    wregiszter632Jan 8, 2025

    @C47HERINE I use Forge. Thank you, I set it to 0.

    C47HERINE
    Author
    Jan 8, 2025

    @wregiszter632 I haven't used forge in a while, but I think you can load it as flux schnell and still use Loras. I believed doing that it will simply remove the flux guidance slider. If you try it can you please confirm if it worked for you and if loras are still working ?

    wregiszter632Jan 8, 2025· 1 reaction

    @C47HERINE Yes, Forge disable distilled CFG for Schnell models. I try some LoRAs, and I don't have problems with them.

    C47HERINE
    Author
    Jan 9, 2025

    @wregiszter632 Then that’s the way to use it in forge, thanks for the reply! 🙂

    wregiszter632Jan 9, 2025· 1 reaction

    @C47HERINE I don't know where, but I saw that Diffusion in Low Bits has to be set to Automatic (fp16 LoRA) in Forge, and I've been using it that way ever since. On Automatic not all LoRA was correct.

    Tozi_WhiteJan 9, 2025· 1 reaction
    CivitAI

    This all works without setting up CLIP-L etc? Because you don't mention anything about it in the description.
    I ask because I don't have to set anything related to it in SwarmUI and it works, only that the quality is poor.

    C47HERINE
    Author
    Jan 9, 2025

    Swarm auto autodetects and downloads/load the clip and vae models. That’s why.

    This is a pruned model, so no it’s not included. I will add a link for the T5, my favourite clip_l, and the vae from Black Forest Lab in the description. Now for quality, disable the flux guidance, and use a cfg value of about 1.5 for the 8 steps model and a cfg value of 2.5 for the 16 steps model. Those are only recommendations you can always adjust those values to your liking.

    As with Stable Diffusion models, lower values give more realistic results and higher values outputs more artistic results.

    Satsuma9977Jan 9, 2025
    CivitAI

    Doesnt work on Forge

    even with the 2 text encoders and the ae VAE wich isnt even mentioned in this post (doesnt work without them)

    C47HERINE
    Author
    Jan 9, 2025· 2 reactions

    Use fp8 and load it as schnell. If you do so, the only difference is that it won’t load the flux guidance function, which you don’t need. This model is based on Flux-Dev but its limitations have been removed. Which means it takes more time to generate better images if you put time into it. The reason behind that is that it needs more computing power with a CFG value greater than one (the guidance was implemented to counter this).

    Now for the text encoders and VAE, I will surely add this to the description. Thanks for the idea! Not everyone knows what a pruned model is and I should have think of that.

    Do you believe it would help other Forge users to have a full model with everything baked in? Personally I find that limiting in comfy, but if it helps you I will upload one. :)

    Satsuma9977Jan 10, 2025

    @C47HERINE No worries and thanks.

    Personally i would prefer it if i only had to download one flux file and had fast generation times with good quality

    C47HERINE
    Author
    Jan 10, 2025

    @Satsuma9977 I will add the fp8 models with everything baked in during the weekend :) Have you tried to make it work as schnell ? Another user reported that it worked well for him this way in another comment. One more tip, you should lower the weight of your loras if they don’t work well anymore. The model understands them better and signs of overtraining might show up at their normal value. Lower that and you should get awesome results!

    SirColinJan 10, 2025· 1 reaction

    I have no problem making it work beautifully in Forge WebUI. The trick is to use DPM2 sampling with Beta scheduler. Set your Distilled CFG Scale to 0 and use CFG Scale instead, with values between 1 and 5. I use the hyper8Q4KM without problem on my 4070TI Super 16GB VRAM, with VAE: ae.safetensor, clip_l.safetensors and t5xxl_fp16.safetensors and I set Diffusion in low bits to Automatic (fp16 LoRA)

    MaleeFantasyArtJan 11, 2025· 2 reactions
    CivitAI

    I'm absolutely new to Flux... Can I use loras with this model? Tnks.

    C47HERINE
    Author
    Jan 11, 2025· 1 reaction

    Yes you can use LoRa as with any flux-dev models!

    amandalevresJan 15, 2025· 1 reaction
    CivitAI

    Hi,

    I'm using the FP8 16 step version in forge (GTX1070 8Gb) as you have mentioned Lora strength seems to need toning down so it is taking some getting used to, but I love the prompt adherence. I'll probably continue to use distilled for generic work but if I want a precise piece I'll definitely use this. Excellent work, and I look forward to a full NSFW version in the future. But just wanted to thank you for your work on this, it really is appreciated!

    C47HERINE
    Author
    Jan 15, 2025· 2 reactions

    @amandalevres I totally agree! Usually, I use the regular Flux-Dev for rough drafts, and once I get the prompt right along with the settings, I switch to Hyper-16 dedistilled for the final draft. Interestingly, with Hyper-16, you can use the same parameters for everything except for the guidance. This last part is somewhat equivalent, though, so distilled guidance ≈ CFG value!

    amandalevresJan 15, 2025· 1 reaction

    @C47HERINE I normally use a Distilled CFG of 3.5 but using that with this checkpoint for CFG is too much, mu best results came between 2.5 and 3

    AlexatrJan 17, 2025
    CivitAI

    Thank you for the models.

    I have a question about using them in Forge.

    With your model I don't understand how to use the CFG Scale and Distilled CFG Scale parameters.

    C47HERINE
    Author
    Jan 17, 2025· 1 reaction

    @Alexatr use CFG instead of distilled CFG if you can. Otherwise you have two options, load the model as schnell or leave the distilled cfg to any low value. It doesn't do much anyway.

    Depending on what you use : chenge the nodes you use for comfyui, disable distilled cfg in swarmui, or load as schnell in forge

    amandalevresJan 27, 2025
    CivitAI

    Hi,

    Is there any chance of BnBNF4 version, current I'm restricted to using 768x512 rather than my normal 1024x768 I only have an 8Gb very old GTX 1070

    Thank you, again

    Amanda

    C47HERINE
    Author
    Feb 18, 2025

    I've manage to get great results with an RTX 2060s with 8gb of ram. I've found a way to generate at 2k with 1 to 2s/it There is some quality loss but most of the time it's barely noticeable. PM me if you're interested!

    benzstationFeb 18, 2025
    CivitAI

    any updates on q5_KM ?

    C47HERINE
    Author
    Feb 18, 2025

    I'll try to upload it as soon as I can! but for whatever reason uploading on civit fails most of the time.. It will be there eventually.

    benzstationFeb 24, 2025

    @C47HERINE Thanks! looking forward to use it

    pipgamemyrran187Mar 24, 2025
    CivitAI

    Can you give us a comfy workflow? Will help a lot of us who are new to comfy.

    C47HERINE
    Author
    Mar 25, 2025

    I've stopped usinf gguf models lately (updated my video card). However if you havea ggug loader installed, just drag and drop the following image in Comfy https://civitai.com/images/65675938. Replace the "load diffusion model" node with the gguf model loader node, then set the proper parameters (using the recommendations depending if you use 8 or 16 steps model).

    C47HERINE
    Author
    Mar 25, 2025

    Personally I use SwarmUI, it's simpler to set up, it installs all the requirements for you and you still have acces to comfyui for more diffucult task.

    pipgamemyrran187Mar 30, 2025· 1 reaction

    @C47HERINE 

    Thanks, I just moved to ComfyUI because I upgraded my computer setup too and was finally able to move from sd1.5 to everything else. Couldn't get Forge to work on my new setup, but comfy was no problem, well other than using it. I've never heard of swarm, gotta check that out.

    zzeke31954Jul 15, 2025
    CivitAI

    Hi, I have tried and tried really wanting it to work, but just hasn't... I followed all your exact settings (yes DCFG is 0 and CFG is 1.5-2 on Hyper8, DPM2 & Beta, got the CLIP, ae, and FP8 T5), on Forge with 2070 Super, I am getting triple the time with 14s per iteration, compared to Flux1D Q8 models that I was getting 4.5s per iteration...whole generation takes about 3 times as long...I saw your comment saying 16 steps is about as fast as Flux1D and I'm mind blown... Do you have any idea why it's so slow for me? And the results are extremely blurry/artifacty as well

    C47HERINE
    Author
    Jul 15, 2025

    Try to load it as schnell in forge.

    A lot of people had issues with that ui and dedistilled models. I don’t know why, but it seems to be forcing distilled cfg on anything loaded as dev (comfy and swarm ui handles this way better) but that simple workaround should do.

    zzeke31954Jul 15, 2025

    @C47HERINE Thanks, but dumb question: how do you load it as Schnell on Forge? In the cmd window it already gave me the message "Distilled CFG Scale will be ignored for Schnell", so I'm confused...thanks again!

    C47HERINE
    Author
    Jul 15, 2025

    As for the number of steps, it’s been a really long while since I’ve used this model but try using 24 steps (using more steps will not take necessarily make it longer) and a cfg of 2.5 with dpm++ 2m and beta. That set up without Lora’s should do wonders and you can always adjust the setting slightly once you have something that looks good (less cfg=lower prompt understanding but more realism).

    Then if you want to use Loras, add one at the time. The main issue with and distilled models is that it understands Lora’s way too well. That means that any defect, overtraining, or weird stuff included in a Lora, which would work normally with dev with some occasional hiccup, will be horrible when paired with a dedistilled model. It will pick up any defect and expose them.

    The solution is super simple add one Lora at a time, use the recommended strength, and if you run into issues lower the strength by increments of 0.1. Generate your image and then repeat if necessary. Distilled is smarter than dev it doesn’t need the same strength than dev.

    C47HERINE
    Author
    Jul 15, 2025

    @jj05690569159 perhaps it already is, but I believe it’s close to the drop down menu where you load your model along with t5 and clip_l at the top of the ui. Or edit the metadata of the model within the ui, if that’s possible.

    sorry if I can’t help you more than that with this part… as I previously mentioned I’m a swarm/comfyui kinda girl ahah

    The best advice I could give you is to try out swarm, it’s super simple and a lot of people who tried distilled migrated this ui since it’s way better at handling complicated task like using dedistilled (and many more things, it’s like the beginning of what you can do with diffusion models and you will not be able to go much farther with forge at some point).

    zzeke31954Jul 15, 2025

    @C47HERINE Thank you so much for all the help and explanation! I will certainly look at Swarm! It also looks like I'm getting better images with your hyper models, just needs a little lower CFG for my machine. You said you don't use this model anymore, what de-distilled or distilled models do you use?

    C47HERINE
    Author
    Jul 15, 2025

    @jj05690569159 I only use my other model Pandora https://civitai.com/models/1143800/pandora-or-flux , I’ve made it for myself! And I’m sharing it here. It’s wildly different but much much better. There’s two version Dev and Pro, the pro version is a dedistilled model. However at this point, the latest dev (beta) version gives results as good as my latest Pro version (alpha)! I’ll update the later soon, but I’m still testing it.

    zzeke31954Jul 15, 2025

    @C47HERINE Downloading it! Thanks!

    Checkpoint
    Flux.1 D

    Details

    Downloads
    530
    Platform
    CivitAI
    Platform Status
    Available
    Created
    1/8/2025
    Updated
    5/20/2026
    Deleted
    -

    Files

    hyperFluxDedistilled_hyper16StepsFP8.safetensors

    Available On (1 platform)

    Same model published on other platforms. May have additional downloads or version variants.