CivArchive
    SD XL - DPO Finetune (Direct Preference Optimization) - v1.0
    NSFW
    Preview 4734888
    Preview 4734886
    Preview 4734887
    Preview 4734879
    Preview 4734881
    Preview 4734878
    Preview 4734895
    Preview 4734893
    Preview 4734901
    Preview 4734889

    SD XL finetuned with Direct Preference Optimization. Tested and working with both ComfyUI and Automatic1111 WebUI.

    Credit goes to the fine-tune here: https://huggingface.co/mhdang/dpo-sdxl-text2image-v1. See the Huggingface page for more details.

    Text Encoder and VAE from SD XL v1.0 VAE Fix [E6BB9EA85B] https://civarchive.com/models/101055/sd-xl

    The U-Net of this model (ComfyUI-only model here: https://civarchive.com/models/237681/dpo-sdxl-fp16) has been converted into a format compatible with both A1111 and ComfyUI using this tool: https://github.com/arenasys/stable-diffusion-webui-model-toolkit

    Description

    FAQ

    Comments (9)

    Agent_SmthDec 20, 2023
    CivitAI

    thanks for sharing! may i ask what is this preference optimization about?

    adempotent
    Author
    Dec 20, 2023

    From https://huggingface.co/mhdang/dpo-sdxl-text2image-v1

    "Direct Preference Optimization (DPO) for text-to-image diffusion models is a method to align diffusion models to text human preferences by directly optimizing on human comparison data."

    pandaasylum404354Dec 23, 2023

    @adempotent In caveman please?

    AiliotAldersonDec 24, 2023· 2 reactions

    @pandaasylum404354 Most models are capable of generating excellent or very poor images, right? From what I understand, DPO basically means a human steps in and says, "Hey, I like image A better than B. Let's lean toward A". So it's fine-tuned with user intervention.

    This is how I interpret it at least so take that with a grain of salt.

    sevenof9247Dec 20, 2023
    CivitAI

    seems usual loras dont work

    looks allways overtrained also with weight 0.1

    adempotent
    Author
    Dec 20, 2023

    Lora trained for SD XL Base are working for me. ComfyUI seems to output better images than A1111 with this checkpoint. Unsure why though...

    DanrisiDec 26, 2023

    I've just checked and loras are working perfect. Don't know maybe smth wrong with a1111. I use comfy

    HikariasJan 13, 2024· 1 reaction
    CivitAI

    im sorry but whats the difference with this?

    https://civitai.com/models/237681/dpo-sdxl-fp16

    phil866Jan 29, 2024

    there seem to be some rendering bugs in this one compared to the other one. I recommend using other one.
    See https://postimg.cc/9DrJZTFp

    Checkpoint
    SDXL 1.0

    Details

    Downloads
    792
    Platform
    CivitAI
    Platform Status
    Available
    Created
    12/20/2023
    Updated
    5/13/2026
    Deleted
    -

    Files

    sdXLDPOFinetuneDirect_v10.safetensors

    Available On (1 platform)

    Same model published on other platforms. May have additional downloads or version variants.