Wan Video
Note: There are other Wan Video files hosted on Civitai - these may be duplicates, but this model card is primarily to host the files used by Wan Video in the Civitai Generator.
👍 SOTA Performance: Wan2.1 consistently outperforms existing open-source models and state-of-the-art commercial solutions across multiple benchmarks.
👍 Supports Consumer-grade GPUs: The T2V-1.3B model requires only 8.19 GB VRAM, making it compatible with almost all consumer-grade GPUs. It can generate a 5-second 480P video on an RTX 4090 in about 4 minutes (without optimization techniques like quantization). Its performance is even comparable to some closed-source models.
👍 Multiple Tasks: Wan2.1 excels in Text-to-Video, Image-to-Video, Video Editing, Text-to-Image, and Video-to-Audio, advancing the field of video generation.
👍 Visual Text Generation: Wan2.1 is the first video model capable of generating both Chinese and English text, featuring robust text generation that enhances its practical applications.
👍 Powerful Video VAE: Wan-VAE delivers exceptional efficiency and performance, encoding and decoding 1080P videos of any length while preserving temporal information, making it an ideal foundation for video and image generation.
Original HuggingFace Repo: https://huggingface.co/Wan-AI
Description
FAQ
Comments (21)
Forge?
literally impossible. the only way to run video models is with comfyui
Literally possible, but it's not WAN, it's based on Hunyuan: https://github.com/lllyasviel/FramePack
We need some sort of Forge/1111 type UI for video. I absolutely hate using Comfy.
Experienced the online generator of WAN2.1 and LoRA training! It's so shocking. Although the video model has caused some trouble, it cannot be denied that it is the future ! =)
can u add also the I2V Version from
14b 480p and 720p fp8_e4m3fn
COMFY? How do we use it on PC?
Here is a good guide to start
https://www.reddit.com/r/StableDiffusion/comments/1j209oq/comfyui_wan21_14b_image_to_video_example_workflow/
https://comfyanonymous.github.io/ComfyUI_examples/wan/
Also this? about upsampling
https://comfyui.org/en/boost-video-creation-with-rife-upsampling
You need lots of RAM though, additionally to 8gb of VRAM at least, it seems.
@Lostcut Thank you very much! I have 12gb of VRAM.
Now I can try it out.
@qinglv46782 good luck then)
How do I achieve the smooth quality videos?
It's currently at 24fps. Do I just increase it to 60 fps? Also how do I increase the length?
Just use another tool on the video like Video2X and use the frame interpolation feature to add more frames.
After some testing I can say Wan 2.1 is almost certainly the best IMAGE model I have ever used, super good prompt following and realism, every image is a banger, it seems flexible on resolutions and samples, nice and fast, I am going to have fun with this!
Same. Who would have thought that a video model would be such a banger for still images. It's a nice refresher after using Flux Dev for a while. It does some really nice images that in a lot of cases rival and even surpass Flux Dev. And paired with the RES4LYF ClownSharK sampler and nodes, it really starts to shine.
And the fp8_e4m3fn quant isn't even the best one. There's "wan2.1-t2v-14b-F16.gguf" by city96 which is FP16 quality converted from the full-fledged FP32 model, and it produces even better quality images and videos, with even more detail and textures. I use it instead of the fp8_e4m3fn version and I'm really pleased with the results.
mmdd2543 I will try that out, thanks . The only think Flux does a lot better is text, I will have to see what WAN 2.2 is like when it release today.
Not sure if im doing it right, im using 720p , wan21 vae, latent video 720*720, clip wan umt5, and im getting a green blurry mess on comfy for the output.
Im using the prebuilt comfy wan workflow.
Wan 2.2 is missing
how to download it to forge?
whete put the file? IDK, on one video on YT
Details
Files
wanVideo21_wan21T2v14BFp8E4m3fn.safetensors
Mirrors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wanVideo_wan21T2v14BFp8E4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wanVideo_wan21T2v14BFp8E4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wanVideo_wan21T2v14BFp8E4m3fn.safetensors
wanVideo_wan21T2v14BFp8E4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors
wan2.1-t2v-14b-fp8_e4m3fn.safetensors
wan2.1_t2v_14B_fp8_e4m3fn.safetensors

