Hunyuan Video MultiGPU Setup - CivArchive (CivitAI Archive)

Hunyuan Video MultiGPU Setup - HunyuanVideo Wrapper File

Based on hunyuan video wrapper by kijai.

Update 1/1/25

This adds device selection nodes to hunyuan video wrapper.

I have included the workflow.

AND there is a download for the changed files if you want to just manually replace the 2 files in /custom_nodes/Comfyui-HunyuanVideoWrapper/

Or you can download the git from here.

https://github.com/MrReclusive/ComfyUI-HunyuanVWMultiGPU

This has be recoded from the last version to make this all optional, so I could get the pull request approved.

so this IS coming to base hunyuanvideowrapper.

I've modified the nodes to allow selection of device for those with multiple gpu's, this was necessary for me as my system has 2 4090's on a 1x pcie switch, so unloading and loading was slower then generating, with 2 4090's and the 4080 I don't have to unload anything during generation anymore.

Description

This has the 2 files needed just to update hunyuan video wrapper to work with this.
overwrites nodes and nodes_rf_inversion.

Will remove this once pull request is complete.

FAQ

Comments (12)

martossienJan 8, 2025

CivitAI

Hello, i have 2 gpu , 2 RTX 3090 and other
i put the two file and load your workflow without error.
difficult to change cuda device in cuda device selector
and
if i stay with default parameters , i have one error :
HyVideoVAELoader.loadmodel() got an unexpected keyword argument 'cuda_device' is:issue

sorry :
other question , lm studio or other loader can make load sharing on multi gpu with gguf llm , do you know if it can make it in futur on Hunyuan video, or in the roadmap of someone ?

MrReclusive666

Author

Jan 9, 2025

not sure how its difficult to change, its a drop down list, my only guess is what "other" is in your system.. and for the error, its all optional so with nothing connected to the cuda input on the loader it should just be ignored.

I do know the hunyuan guys are working on a multigpu version, but where that is in terms of timing unsure, looks like image to video is next in the roadmap.

ZephyredJan 12, 2025

CivitAI

Any idea when you will merge with the latest commits? Looking for TeaCache. Thanks!

MrReclusive666

Author

Jan 12, 2025· 1 reaction

the version I have on github has teacache and dpm scheduler, I haven't looked today if anything changed, but every time I updated they would update again like a few hours later so I was waiting for that to chill out a bit before I did a full merge again.

softwareweaverJan 13, 2025

CivitAI

I have 4 GPUs but cuda device selector shows only cuda:0 when I click on it. Is there some other configuration that needs to be done?

MrReclusive666

Author

Jan 14, 2025

what gpu's do you have? and are any of them in nvlink?

softwareweaverJan 14, 2025

@mrreclusive3545 4 RTX 6000 Ada. No Nvlink. Transformers and other python programs recognize them.

softwareweaverJan 14, 2025· 3 reactions

Solved it. I was running ComfyUI in a batch file with
python main.py --cuda-device 0 --listen 0.0.0.0 --port 8180

which was restricting the GPUs visible
Running it without the cuda device parameter, it works.
python main.py --listen 0.0.0.0 --port 8180

MrReclusive666

Author

Jan 17, 2025

@softwareweaver glad you solved it, do you by chance have the capability of nvlink? i am curious how comfy and all this sees nvlinked devices, I've been thinking of picking up refurbished 3090's specifically for nvlink and AI video.

softwareweaverJan 19, 2025

@mrreclusive3545 Unfortunately. I don't have access to any devices with NVLink. Best bet would be someone with couple of 3090s.

omnisaaFeb 9, 2025

@mrreclusive3545 if you dont mind use linux, there is a hack driver can use pcie to do a "fake nvlink" p2p mode for multi-gpu, its seem work for llm but im not sure for sd

MrReclusive666

Author

Feb 17, 2025· 1 reaction

@omnisaa not sure that would be of any use in my setup, my 2 4090's share a 1x pcie slot(on 4 port pcie riser), which is the reason I had to create this in the first place was because the block swap in the wrapper was so slow because of that, it would actually run faster on the 4080 I have on the board as it has a full x16.
but now with hunyuan being fully integrated in comfy I no longer use this as the other multigpu addons work for it, and with teacache and all that i don't even use the fast models anymore.

Workflows

Hunyuan Video

by MrReclusive666

Download (Beta) View on CivitAI

tool

Details

Downloads

Platform

CivitAI

Platform Status

Available

Created

1/1/2025

Updated

7/25/2026

Deleted

Files

hunyuanVideoMultigpu_hunyuanvideoWrapper.zip

Size:

18.64 KB

SHA256:

126c98ada3bb6f989610646e49c175d614fadab145fcc2bb9234128faaaaffc9

Mirrors

HuggingFace (1 mirrors)

hunyuanVideoMultigpu_hunyuanvideoWrapper.zip

CivitAI (1 mirrors)

hunyuanVideoMultigpu_hunyuanvideoWrapper.zip

Description

FAQ

What is Hunyuan Video MultiGPU Setup?

What files are available and where can I download them?

Comments (12)

Details

Files

hunyuanVideoMultigpu_hunyuanvideoWrapper.zip

Mirrors