Hunyuan Video (Safetensors) - New Uncensored Llama - Kijai-Diffusion-FP8

NSFW

Hunyuan Video

Kijai marked files only for use with Kijai Nodes You do not need them for Comfy Native

Full Guide to picking the correct file above
Workflow for 8GB Card users
Uncensored llama will work with COMFY Native

Using the Kajai marked models on COMFY native will cause rainbow or black output.

I do not recommend the the FP8 VAE unless you are trying to fit all models into GPU, see the guide for 4090 full GPU launch commands.

Technical details regarding "Uncensored"

The model used for Hunyuan was based on llava-llama-3 8 billion parameter LLM. The Intel vision tuned model was used to refine the tokenized model restoring over 5 million values.

Description

FAQ

Comments (118)

A_Friendly_SpiderDec 18, 2024

CivitAI

I love the effort you put in to all this, but what exactly does this do entirely? Video2Video?

Felldude

Author

Dec 18, 2024· 1 reaction

It is a multi use model with text2img, txt2video, and video2video, it might have text to 3D also

sc2protection718466Dec 18, 2024

CivitAI

Hey, so could you help me understand what this means. I already have a running workflow using hunyuanvideo wrapper.

I use the previously existing 8fp.

I have a gtx1080ti (11gb vram)

And now I have this "scaled" instead but I don't notice any difference.

Any ideas what this does in practice ?

lee864Dec 18, 2024· 1 reaction

according to the Tencent site, it can save ~10gb of vram (https://github.com/Tencent/HunyuanVideo/commit/5ab5edef9ac734966d5358e793470900ba7db189). Now, that's assuming you configure it appropriately, which i personally haven't figured out how to do yet ;-) i think we'll find out more soon

Felldude

Author

Dec 18, 2024· 1 reaction

The stored size is still a factor when it comes to loading the model, right now they do not have dynamic block swapping but that may come in the future. - For those with a ADA generation card FP8 with FP8 attention could be faster then BF16 but for most that is not the case assuming the model can fit into VRAM

3427221Dec 20, 2024· 1 reaction

CivitAI

no idea what's wrong with the fp8 on my end, can't make it work, only output video noise and give error : unet unexpected: ['double_blocks.0.img_attn_proj', 'double_blocks.0.img_attn_qkv', 'double_blocks.0.txt_attn_proj', 'double_blocks.0.txt_attn_qkv', 'double_blocks.0.img_mod.linear', 'double_blocks.0.img_mlp.fc1', 'double_blocks.0.img_mlp.fc2', 'double_blocks.0.txt_mod.linear', 'double_blocks.0.txt_mlp.fc1', 'double_blocks.0.txt_mlp.fc2', 'double_blocks.1.img_attn_proj', 'double_blocks.1.img_attn_qkv', 'double_blocks.1.txt_attn_proj', 'double_blocks.1.txt_attn_qkv', 'double_blocks.1.img_mo (about a full page of it, just a sample here) anyway bf16 work (very slowly but heh)

Felldude

Author

Dec 21, 2024

Did you make sure scaled FP8 was selected, is comfy updated

3427221Dec 21, 2024

I updated it internally, I'm not sure about fp8 scaled though, where does this appear ? Maybe I need to download a new COMFY from the site (I use portable version, so i'm not sure the update is absolute latest when I do)

Felldude

Author

Dec 21, 2024

@NoArtifact Im not sure if https://github.com/kijai/ComfyUI-HunyuanVideoWrapper works with portable

3427221Dec 21, 2024

@Felldude Yeah I think y comfyUI version maybe the problem, i'll check with other version later, thanks for the info

edit : And I'm lucky, the latest portable version is only 7 hours ago fresh ;)

4809817Dec 31, 2024

@Felldude you keep mentioning this but you never specify which node you're talking about, I mean.. I see that I have a llava_llama_3_FP8_scaled.safetensors selected, but other than that I have no idea what you mean.

MrReclusive666Dec 21, 2024

CivitAI

any chance you could explain why the need for such a massive llm for this?
There is smaller llm's, ones based on Llava llama 3b just like the one they provide, but much smaller, i tried to look at the wrapper to see if I could force it to use a smaller llm, but yeah, over my head.

Felldude

Author

Dec 21, 2024

They use a math equation but simply the parameter size of the LLM needs to roughly match the model

NephilimDec 22, 2024

CivitAI

there's any way to use the fp8 version with the comfy native node(load diffusion model)?

Felldude

Author

Dec 23, 2024· 1 reaction

Only when comfy adds the architecture definition to the main git

2770379Dec 23, 2024· 5 reactions

CivitAI

I honestly cannot tell a difference in quality between the FP8 and BF16. Both put out really good video.

tylerburden100Dec 23, 2024· 1 reaction

It seems like you can get more motion "jitter" with FP8, at least in my experience.

2770379Dec 24, 2024· 1 reaction

@tylerburden100 Possibly, after more testing I have noticed what appears to be... I guess just less general movement and detail. Minor, minor details, but less.

psspsspsspssspssDec 24, 2024

CivitAI

I can't get the fp8 model to work with comyui native, is this version specifically for kijai's nodes?

Felldude

Author

Dec 24, 2024· 1 reaction

You need to set FP8 Scaled and right now I think only Kijai's nodes work

4809817Dec 31, 2024· 5 reactions

ok.. nobody is describing what is happening. I am getting nothing but noise on my output, is that the same as you?

ValuedRenderJan 2, 2025· 1 reaction

CivitAI

I'm just getting noice, do you have workflow example ?

Felldude

Author

Jan 3, 2025

It uses Kijai nodes

ReelaiJan 3, 2025

CivitAI

Unfortunetly, i am getting really bad quality whatever i do :( Nodes, models are good and intact. Output is like took a blur brush shower. :D

FrenzyXJan 3, 2025

I have the same with the 25GB model, going to try the FP8 now

ReelaiJan 3, 2025

@FrenzyX fp8 is the same for me. :(

FrenzyXJan 4, 2025

Ended up downloading the models for the Kijai wrapper nodes, those seem to work well for me.

ReelaiJan 4, 2025

@FrenzyX Aren't they embedded on comfyui or did you download it from github, again ?

FrenzyXJan 4, 2025

@Reelai I think his checkpoints are packed differently, tried a mixed approach first, but ended up using all the checkpoints provided for the nodes from github. Also for me the encoder didn't automatically download, so I manually downloaded that as well. Might have been caused by me not having git lfs in advance, which I do now. Any way, took some tinkering and problemsolving but I am up and running now.

TurboCoomerJan 6, 2025

increase sampling steps count

ReelaiJan 10, 2025

@FrenzyX I found out what is the real problem. If you have 3 seconds of video ( no more than 3 seconds) details will be okay while denoise is 1.0. However, if you have more than 3 seconds of video, you need to lower denoise gradually.

FrenzyXJan 10, 2025· 1 reaction

@Reelai thanks, that's good to know, might try it out again in the future, as it might enable some workflows that I can't get to work atm

bhoppingJan 4, 2025· 11 reactions

CivitAI

We're gonna get img2vid locally before gta6

throwawayacc123Jan 4, 2025· 1 reaction

CivitAI

Hi,
I used the 'hunyuan video fp8' vae, and the model I downloaded, but when I try running the example workflow from the hunyuan Video Wrapper in my custom nodes for comfyui (hyvideo_t2v_example_01.json), the output is a static... where is my mistake ? :(

Unhing3dJan 22, 2025

Did you ever manage to fix this? I am having the same issue, and only with this model

throwawayacc123Jan 24, 2025

@Unhing3d yea, the problem was with the clip, i was using bf16 in configs for fp8 model, i downloaded the other model for the right config (fp16 or bf16, i forgot) and it worked

YinsenJan 5, 2025

CivitAI

can you upload this to a platfom on browser? I have nowhere near the GPU needed :(

Felldude

Author

Jan 10, 2025

I have no clue if it is hosted cloud, unless block loading or Q4 quant is supported I don’t have the card to run it either

Hamsome_SkidwordJan 5, 2025

CivitAI

is there an extantion for webUI Forge that will let me run this without ComfyUI?

bhoppingJan 5, 2025

Not yet, if you've been using forge for awhile, you can learn comfyui relatively easily. The layout is different but you'll recognize things and get use to it. Highly recommend.

Hamsome_SkidwordJan 5, 2025

@bhopping I've used Comfy in the past, and other node based systems like Blender's and UE5's. the thing i didn't get about it was trying other peoples workflows, many nodes weren't available for some reason. but i might as well try it again for this

bhoppingJan 5, 2025

@Hamsome_Skidword yeah that def makes it more confusing. Make sure you got custom node manager so it can detect what nodes to install when using other ppls workflows

Hamsome_SkidwordJan 6, 2025

@bhopping well... my 6gb card wasn't enough, guess i'll wait for either a lower cost model, or until i can buy a super computer

bhoppingJan 6, 2025· 1 reaction

@Hamsome_Skidword Hmm, I found a reddit post saying someone did it with only 6gb here on this reddit thread. https://www.reddit.com/r/StableDiffusion/comments/1ho2elu/all_in_one_custom_workflow_vid2vid_and_txt2vid/ I can't confirm if this works but he seems to share his workflow and has a youtube video on it as well in the thread.

When I started, I got a lot of vram errors. Turning the res down, vae decode tile and length helps with that. Hope this works out for you.

bhoppingJan 6, 2025

@Hamsome_Skidword Also which hunyuan video model were you using?

Hamsome_SkidwordJan 6, 2025

@bhopping i was using this model, but maybe that was the problem? maybe i should use the FP8 model from GitHub. and thanks for the help

bhoppingJan 6, 2025

@Hamsome_Skidword Np. The bf16 model is pretty resource heavy for the very slight quality improvement (if any) and theres even a fastvideo version of bf8 on github/huggingface which lets you get away with 7 steps from what I believe.

Hamsome_SkidwordJan 6, 2025

@bhopping i'll definitely have to check those out

DroneMeOutJan 6, 2025

CivitAI

do I need this if I am running a RTX 6000 Ada? I have plenty of VRAM so far. Will this be faster? Also, I am using ComfyUI portable.

SpockeJan 6, 2025

CivitAI

is 4gb gpu good enough?

testlh123611Jan 7, 2025

no,4gb is too small

nogoJan 9, 2025

I believe you need a lot. Something like 24GB +

Felldude

Author

Jan 10, 2025· 1 reaction

@nogo 16Gb is possible with FP8 - 8Gb should be possible with Q4 and if the model doesn’t break it might work in 4GB with Q2

nogoJan 10, 2025

cheapest nvidia 16GB card I can find is about 650€+
not a cheap endeavour

nogoJan 10, 2025

@Felldude ca. how long does the average low res text to video generation take for a short clip?

Felldude

Author

Jan 11, 2025

@nogo If your trying to do CPU something like 30 hours

nogoJan 12, 2025

@Felldude I mean how long if you have a capable GPU

Felldude

Author

Jan 12, 2025

@nogo I have not seen generation times for a 4090 I would guess an average of 15 seconds per it

RandomPeachJan 7, 2025

CivitAI

Are there any tips to prevent the video from rendering slo-mo video? I've always got the output to be at 24fps, but the actual motion of the characters is often in clow motion, sometimes not.

_reptiliano_Jan 7, 2025

CivitAI

good job!!! is it done with a workflow?

DocueiJan 7, 2025· 13 reactions

CivitAI

You: Alright everyone, lets all have some fun with a video gen model.

Us: Seeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeexxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

GeekModeDadJan 8, 2025· 5 reactions

CivitAI

New to trying out Hunyuan Video, can someone explain exactly where I need to put this file or if there is something else I need also?

daily_insightJan 10, 2025· 1 reaction

If you're using ComfyUI, save it on the checkpoints folder

adsazamet679Jan 14, 2025

@daily_insight after we need to select it from "CogVideoX Model Loader, PyramidFlow Model Loader or on LTX Load Checkpoint"?

yuinyan490363Jan 9, 2025· 7 reactions

CivitAI

Can it run on Forge

redlittlerabbitJan 12, 2025

I'd like to know, too. Any good articles on what it is?

AlienPleasuresJan 14, 2025

yeah..how can we run video on forge?

ygmdirFeb 20, 2025

Don't believe so... only ComfyUI atm

BerlobupisJan 9, 2025· 14 reactions

CivitAI

When I try using this, the output ends up being rainbow pixels that looks like a static image on a TV. I am using the correct VAE. I am also using clip l and llava_llama3_fp8_scaled

sueanpina1Jan 10, 2025

yeah, me either

HappyTrAIlsJan 10, 2025

Same. Trying some other settings and will report back if I get it to work.

HappyTrAIlsJan 10, 2025

No dice. Tried changing every setting and still just getting static.

2885872Jan 10, 2025

Are you using the Comfy Native Nodes? I experienced what you're describing with this Model Card's FP8 file when using the native node workflow from Comfy's site/blog. I switched to the BF16 (25.6 GB) file from HuggingFace and was then able to generate actual video. FYI, though, I have a GPU w/ 20GB VRAM myself, and have no way of knowing whether you might face lower VRAM limits. If you do have lower VRAM, you can indicate one of the FP8 representations in the drop-down in the native node loader, and then I think it does a 'cast' to that lower precision. YMMV. Good luck!

bearfrogcatJan 12, 2025

Same here.

BloodsugaJan 12, 2025· 1 reaction

Same, looks like I can only generate with the full bf16 model.

2885872Jan 12, 2025· 1 reaction

Since I first made my finding about BF16 rectifying the issue of generating just static with Comfy's officla workflow and only native nodes, I found some similar troubleshooting discussion in a comment on a different Workflow posted here: https://civitai.com/models/1081086/comfyui-hunyuan-text-to-video-using-loras . The issue (and a 'static'-filled generated video example) were posted there by a user 'supersuika'. This other Workflow's creator noted that the one supersuika had used (I think the official Comfy org one) ontained the 'Flux Guidance node and sd3 node', which his workflow do not, and I take it his does not succumb to that problem whewn using FP8 (I haven't tried his yet myself). So you might either fiddle w/ removing those nodes, or trying his workflow instead. I haven't turned my attention back to HYV as yet, so I can't say. Hope this helps peopel conclusively figure out why this combination bombs out like it does.

2885872Jan 12, 2025

Oh, I should add that I don't know what side effect might result (in functionality, or quality of outcome) from omitting either or both of those nodes. I assume they're there for some reason. I just haven't gotten around to experimenting yet. Forgot to point that out specific possibility of something else breaking when I made my last posting.

2984179239880Jan 13, 2025· 1 reaction

me either,why fp8 will happen this ？

k3softJan 16, 2025

same here, Why is FP8 like this? It seems that only the BF16 model can be used

sensdiffJan 16, 2025

Yeh me too, any ideas what is going on?

JhaikJan 16, 2025

Same here, can't get it working

6793031Jan 17, 2025

@Jhaik did you find a fix ?

Felldude

Author

Jan 20, 2025

@Baskets521 If it is any consolation I get rainbow also, when trying to use the COMFY native nodes, the kaji nodes work for me but they don't have CPU offloading so I go from 6-15 seconds per IT to 60-90 on the CPU

DomDomTomTomJan 21, 2025· 1 reaction

I have the same issue. You're better off using the FP8 model from Kijai if you want to be able to use the native Comfy nodes: https://huggingface.co/Kijai/HunyuanVideo_comfy/tree/main

Just get: hunyuan_video_720_cfgdistill_fp8_e4m3fn.safetensors

sensdiffJan 22, 2025· 1 reaction

@DomDomTomTom Yes that one works, but what's the difference? Is it worse than this other fp8 model?

Felldude

Author

Jan 22, 2025

@sensdiff I serialized in the timestamps provided by tencent for the vision tree but COMFY prunes those, I still don't know why it appears to be fine for some and not for others

sensdiffJan 22, 2025

@Felldude What?

Felldude

Author

Jan 22, 2025

@sensdiff The ignored block message you see when using COMFY native, same thing with the VAE, the comfy UI native trims a considerable number of blocks from all models including the VAE, kaiji nodes do not

sensdiffJan 23, 2025

@Felldude It works when using it with the kijai nodes, just make sure to select "fp8_scaled" as the quantization option.

Minase460Jan 25, 2025

Same problem. What is the fix?

darkdJan 10, 2025

CivitAI

Hi. Maybe it works on a 12gb nvidia geforce rtx 3060? And which of these models could work? Thanks for the help.

bhoppingJan 11, 2025· 3 reactions

fp8 model should work people have gotten away with just 8gb of vram. there's workflows suited for exactly 12gb here https://civitai.com/models/1048302/hunyuanvideo-12gb-vram-workflow
There could be better ones out there tho

scooter_deJan 14, 2025· 2 reactions

I run it on that GPU.

darkdJan 14, 2025

@scooter_de That's great. What workflow did you make it work with? and anything to keep in mind in configuration?

scooter_deJan 15, 2025· 1 reaction

@darkd I used this workflow https://civitai.com/models/1079810?modelVersionId=1212334

darkdJan 16, 2025

@scooter_de Hello thanks for your help, but I have a problem generating the video, the video is just noise, maybe that happened to you and you were able to solve it?

scooter_deJan 17, 2025· 1 reaction

@darkd I'm trying to publish my workflow. I reduced what I had found here to the bare minimum. That way a user could start with it and extend from there. I found many example here too complicated if one only wants to try the basic functionality.

scooter_deJan 17, 2025· 1 reaction

I just posted my workflow here: https://civitai.green/user/scooter_de/models?section=published

darkdJan 17, 2025

@scooter_de Thanks, I'll check it

Fonx104Jan 10, 2025

CivitAI

Guys the model load stuck at 35% with "model_type FLOW" stuck on cmd, anyone know how to fix it ?

TheKnightsWhoSayNIJan 11, 2025

CivitAI

I'm new here and I have doubts about using it in comfyui

I had downloaded the full version (but it doesn't seem to run on my 4080 super with 2x8gb ram, any tips?

Full> hunyuan_video_vae_bf16 (400mb) + llava_llama3_fp8_scaled (8.8gb) + hunyuan_video_t2v_720p_bf16 (25gb), is that right? https://comfyanonymous.github.io/ComfyUI_examples/hunyuan_video

Now I downloaded the Full Model fp8 version (12.8gb) but my program still can't run.

The Llava Llama TE version available here (Full Model fp16) weighs 16gb, while llava_llama3_fp8_scaled weighs only 8GB, which one should I use? use?

The VAE version available here is also twice the size (Full Model fp32 (940.27 MB)...

My problem is the RAM Memory I believe, I'm trying to use the workflow available for 12GB Vram but I'm not able to configure it correctly

Can you help me choose the correct Moddel, Text encoder and Vae?

I'll be able to run with only 16gb ram + 16gb Vram?

Felldude

Author

Jan 12, 2025· 1 reaction

You might have a system ram issue, if you have a solid state or nvmec try setting the virtual memory to 50GB

maxbob555Jan 14, 2025· 6 reactions

CivitAI

it works but took me 5673.56 seconds haha a little long buuut it works

yuinyan490363Jan 17, 2025

This speed has no value anymore

6793031Jan 17, 2025

can you show me a screenshot of the workflow. i keep getting static rainbows.

azeliJan 21, 2025

You're doing something wrong.

chrisss1Jan 14, 2025· 7 reactions

CivitAI

Does anyone tried it in Forge? is it works at all?

1246388Jan 20, 2025· 2 reactions

Was wondering that too. Forge is my UI of choice.

chrisss1Jan 20, 2025· 1 reaction

@Dzban Same, i just can't use any other. Sucks the devs kinda bit abandoned the project.

bhoppingJan 22, 2025· 1 reaction

If you're familiar with web1111/forge I highly recommend trying out comfy. You'll recognize a lot of things but ofc the UI is different but it'll click no more than a day. I was too impatient for it to come to forge so I switched and also noticed quality improvements in my images as well

chrisss1Jan 22, 2025

@bhopping Sounds interesting. What kind of quality improvements you get with comfy? isn't it just another UI for the stable diffusion? I'm pretty sure my PC wouldn't handle comfy, that's why i'm using Forge.

bhoppingJan 22, 2025

@chrisss1 I noticed the faces at lower resolutions looked a lot clearer, so I didn't have to waist time upscaling. It could've just been bc i was using karras instead of uniform for the schedular but I'm pretty sure it's bc the optimization come at a cost of slight quality? Other than that it's pretty much the same tho, I still like using forge every now and then

chrisss1Jan 22, 2025

@bhopping Yeah it's probably due to samplers. I am getting pretty great results in Forge and forge also have many options to improve quality at a cost of performance.

NorrbJan 22, 2025

Using auto1111. Thinking of installing comfy. But don´t want to break the auto1111 setup in case I want to use it after the installing comfy. Think it is possible to run both?

bhoppingJan 22, 2025· 1 reaction

@Norrb ComfyUI is a whole separate installation and won't affect your web1111. You can technically have them both open though, but they're two different programs. Also you can configure comfyui to share models, loras, and etc with your web1111 so you don't clog up your space. Give comfy a shot, the learning pays off

bhoppingJan 22, 2025· 1 reaction

@chrisss1 Yeah once hunyuans open-source plan is all done, I'd assume forge would finally get hunyuan support bc of how popular it is

chrisss1Jan 22, 2025

@bhopping Hopefully!

NorrbJan 23, 2025

@bhopping Thanks for your answer! I´ll give it a try then.

Checkpoint

Hunyuan Video

by Felldude

Download (Beta) View on CivitAI

hunyuan

tencent