CivArchive
    Long CLIP (Distilled) - Distilled_Predict_CLIP-L
    NSFW
    Preview 98647332
    Preview 98639545
    Preview 98639100
    Preview 98639099
    Preview 98645144
    Preview 98639995
    Preview 98639997

    Long CLIP (Distilled)

    • Teacher/Student Distillation from 248/218 token length to projected 77

    • Pruned for use in SDXL, FLUX, SD 1.5, SD3, Hunyaun Video

    • DO NOT USE IN HI-DREAM, PONY (In most cases), or iLLustrious

    • Some of the top onsite models built with FP32 Distilled CLIP/FP32 VAE

    • Forcing FP32 CLIP recommended for Comfy, Forge, Auto1111


    HiDream CLIP has been trained on a distillation set and the 248 and 218 token lengths reduced to 77 based on the pooled vision/text model output.

    Description

    FAQ

    Comments (8)

    vhpSep 6, 2025
    CivitAI

    Did you know you can use ViT-L-14 instead of clip-L with SDXL? I didn't. It's great!

    Felldude
    Author
    Sep 6, 2025

    Vit-Large-14 is the source of the vision for any of the clip-l models I have trained other then Gemma

    vhpSep 6, 2025

    @Felldude Oh. Interesting, well i am just dabbling here and found it interesting because i thought the vit-l was just for flux.

    Felldude
    Author
    Sep 6, 2025

    @vhp Even going back to SD 1.5 the CLIP has used Vit-L-14 but before mixed precision was handled well the entire model was saved in FP16 rather then mixed for SDXL it has a version of LAION G also. https://huggingface.co/stable-diffusion-v1-5/stable-diffusion-v1-5/blob/main/text_encoder/config.json

    ElectricDreamsOct 28, 2025
    CivitAI

    Sorry i'm a noob with comfy. Can anyone guide me to a workflow to merge clip L on a checkpoint? thanks.

    Felldude
    Author
    Oct 28, 2025

    It depends on the model in question but SDXL is dual clip loader with CLIP-G and CLIP-L

    coochieNov 29, 2025
    CivitAI

    Regarding the merged linked checkpoints, wouldn't that nullify any of the custom training or concepts introduced by the model? I would think checkpoint merges also merge CLIP weights but I'm not very knowledgeable about this stuff.

    Felldude
    Author
    Jan 30, 2026

    None of the models listed above are merges they are distillation using MSE comparative contrast loss of clip teacher/student

    Checkpoint
    SDXL 1.0

    Details

    Downloads
    381
    Platform
    CivitAI
    Platform Status
    Available
    Created
    9/6/2025
    Updated
    5/12/2026
    Deleted
    -