Update your ComfyUI to the latest (Github version) => cd to ComfyUI directory -> terminal -> git pull -> restart ComfyUI
The upscale workflow file contains Wan FlashVSR, which is the fastest but resource-heavy at 10GB VRAM. The Hunyuan SR workflow is time-consuming but VRAM-friendly
Lightx2v 4steps LoRA: https://civitai.com/models/2162543
Prompt guide: https://civitai.com/articles/22889/hunyuan-15-sudio-prompt-generator-and-guide
CFG 1 Steps 30-50 [ with no 4steps LoRA ]
Workflows in the zip files (i2v & t2v) + download links (text encoders, vae, clip vision, upscalers)
Type: Lightweight, open-source video generation model (Diffusion Transformer, 8.3B parameters)
Capabilities: High-quality text-to-video (T2V) and image-to-video (I2V) synthesis
Efficiency Features: Selective and Sliding Tile Attention for faster inference on consumer GPUs
Additional Support: Bilingual prompts, integrated super-resolution to 1080p
Performance: State-of-the-art visual quality and motion coherence among open-source models
These models are redistributed here for the sake of convenience.
Description
480p i2v CFG Distilled FP8 Scaled
Details
Files
Available On (1 platform)
Same model published on other platforms. May have additional downloads or version variants.