CivArchive
    Preview 25650259Preview 25645281Preview 25648791Preview 25650276Preview 25783955Preview 26281196
    • Workflow is in the showcase images

    • My recommendation is to use the "EasyCache" node and let it skip 2 steps. It good with very little quality loss

    • I never tested it without lightning lora so I don't know how many steps to use or the CFG. You have to experiment yourself

    • 880x1184 or 1328x1776 works best

    • Distilled CFG Scale: 3.5

    • Sampling method and Schedule type: DEIS with DDIM

    • Steps: 20

    • Resolution: 896x1152

      • Hires fix model: 4x_NMKD-Superscale-SP_178000_G

      • Steps: 10

      • Denoise: 0.3

      • Upscale by: 1.5

    • Lora Weight: 0.8. You have to experiment based on your prompts

    Description

    GPT4o Prompt:

    I am planning to train a LoRA for the Stable Diffusion text-to-image model, which uses the T5XXL transformer in its architecture. The prompts should be in natural language and follow a specific format. I will upload images and need you to help me create detailed prompts based on those images. The prompts should start with "Amateur photography of" and end with "on flickr in 2007, 2005 blog, 2007 blog." Always give me the prompt in a single paragraph.
    The format should be:
    Subject Description: Start by describing all the people in the image in detail. It is very important to include their race and ethnicity, physical attributes (such as height, build, skin tone, and hair color), facial features, attire, and any expressions or poses they are making. Be as specific as possible. Make sure to always include the build of the subjects (e.g., plus size, slim, petite) without missing it.
    Scene Description: Accurately convey what exactly the people are doing in the picture. Describe the setting, background elements, any objects they are interacting with, and the overall environment (urban, rural, indoor, outdoor, etc.).
    Image Quality Tags: Include descriptive tags that highlight the quality of the image. Use terms like slight motion blur, cluttered background, warm tones, bright natural light, high contrast, vivid colors, etc. These tags should reflect the mood and feel of the image as well.
    The final output should combine all these elements into a cohesive, detailed prompt that accurately reflects the image.