CivArchive
    Captioning With Extra Steps - Workflow (ComfyUI) - v1.0
    NSFW
    Preview 93646491
    Preview 93646490
    Preview 93646488
    Preview 93646486
    Preview 93646494
    Preview 93646485
    Preview 93646489
    Preview 93646496
    Preview 93646487
    Preview 93646493

    📌 Optimal Setup Guidelines

    📝Overview

    This ComfyUI workflow seamlessly combines WD14, JoyCaption, and Ollama to generate detailed and accurate captions from images. While it is not 100% accurate, it gets the job done well for most cases. Some images may require running the workflow 2-3 times to achieve the best caption results.

    📦 Required Packages - To ensure seamless functionality, please install the following ComfyUI custom nodes:

    1. ComfyUI-WD14-Tagger

    2. ComfyUI-Ollama

    3. ComfyUI-Prompt-Reader-Node

    4. ComfyUI-JoyCaption

    5. Python Interpreter Node

    6. ComfyUI-Various

    7. Dolphin3

    8. Two more inside zip file

    ⚙️ How to Use: Key Nodes Explained

    This workflow centers around three main nodes that control captioning and tag processing. Here's how to use each:

    1. JoyCaption Instruction

    • Purpose: Generates initial image captions using the JoyCaption node.

    • Usage:

      • Feed the image input into this node.

      • Adjust style or detail level if available in the node parameters to customize caption output.

      • The generated captions serve as a base for further refinement.

    2. Ollama Instruction

    • Purpose: Refines and enhances captions by leveraging Ollama’s language model.

    • Usage:

      • Take the output text from JoyCaption as input here.

      • Use Ollama Instruction to add context, clean up, or expand on the caption text.

      • You can tweak prompt templates or parameters to better match your desired caption style.

    3. Remove Tags

    • Purpose: Cleans up unwanted or irrelevant tags from generated captions to improve prompt quality.

    • Usage:

      • You can configure the tag list to remove certain words or phrases.

      • Ensures the final caption is concise and useful for downstream tasks (e.g., text-to-image generation).


    Note:

    • Depending on the instructions provided to these nodes, the output may vary greatly. Experiment with different prompts and parameters to achieve the best results for your use case.

    • The current instructions are optimized to work well for both SFW (safe-for-work) and NSFW (not-safe-for-work) content.

    Description

    FAQ

    Comments (6)

    supernova_aiAug 11, 2025
    CivitAI

    What model or API do you use for comfyui-ollama?

    UnholyDesiresStudio
    Author
    Aug 11, 2025

    Latest you need to directly download it from their GitHub page

    UnholyDesiresStudio
    Author
    Aug 11, 2025· 1 reaction
    supernova_aiAug 11, 2025· 1 reaction

    UnholyDesiresStudio Thanks a lot.

    UnholyDesiresStudio
    Author
    Aug 11, 2025· 1 reaction
    CivitAI

    Just uploaded a new file with recent fixes.

    AnonymouspseudonymFeb 9, 2026· 1 reaction
    CivitAI

    Hi, there! There are broken nodes in the workflow that cannot be updated by ComfyUI Manager. I assume that these are custom nodes created by you(?) and that they are related to the two extra files in the zip file. Where do you place these extra files? I've tried throwing them in the same folder as the workflow file (default folder), as well as in the custom nodes folder to no avail. What am I missing?

    Affected nodes:

    JC_ExtraOptionsToString

    UDS_TextConcatenate

    Prompt Toggles: SD Prompt Reader / WD14

    Workflows
    Other

    Details

    Downloads
    755
    Platform
    CivitAI
    Platform Status
    Available
    Created
    8/11/2025
    Updated
    5/13/2026
    Deleted
    -

    Files

    captioningWithExtraSteps_v10.zip

    Mirrors