HiDream-O1-Image is a natively unified image generative foundation model built on a Pixel-level Unified Transformer (UiT) without external VAEs or disjoint text encoders, which natively encodes raw pixels, text, and task-specific conditions in a single shared token space - supporting text-to-image, image editing, and subject-driven personalization at up to 2048 × 2048.
every model deserves a decent workflow that can be a good starting point to play with it ...
download this: https://huggingface.co/Comfy-Org/HiDream-O1-Image/tree/main/checkpoints
Description
You'll be surprised...
