LTX2.3 FP4 Models
Because no one uploads FP4 here. and they should (fp4 works on 3xxx and above for vram savings.)
LTX2.3 Distilled FP4ME - Distilled FP4 Mixed Extreme - 14.1GB**
LTX2.3 Full Dev FP4ME - Full Dev FP4 Mixed Extreme - 14.1GB**
LTX2.3 Official NVFP4 - From Lightricks - Updated 3/17/2026 - 21GB
LTX2.3 Dev NVFP4 - from Hippotes / LTX-2.3-various-formats - 18.15GB
LTX2.3 Dev FP4 STFO - from Kijai's Transformers only Scaled FP8 - 16.63GB**
LTX2.3 Distilled FP4 - My first FP4 - Transformers Only - 18.15GB**
** Are transformers only, or I broke vae/text projection :) needs seperate Vae and Text projection downloads.
I have a fix for lora issue on FP4ME models. Comfy Bathroom, custom lora loader with presets as well as custom config. https://huggingface.co/MrReclusive/LTX-2.3-FP4
Diffusion model loader/unet loader seems to get confused with these, use the checkpoint loader even though it has no clip/vae
Description
Official FP4 from lightricks, updated 3/17
FAQ
Comments (41)
there no FP8 model at all?
Distilled Q8 is better than fp8 for me it's faster and better results than fp8. For LTX-2.3 specifically haven't tested all models.
yes, but other people are uploading those, I just started sharing the fp4's because no one shares fp4's on civit
Groans, i already had the nvfp4 version from March 7, guess it's time for a huge download again :/
lol, yeah, my hard drive is mad at me, not only do i have all these different fp4's, i have the full bf16's for working on my custom fp4's.. 250gb just in ltx2.3 models.
@MrReclusive666 woah, and here i was complaining because my Comfy folder is 950GB already xD
@artificialotaku yeah, add in all the other stuff like hunyuan video, ltx 2, flux2, etc.. yeah, this 2tb nvme is just for comfy...
@MrReclusive666 agreed, i will buy a new 2TB SSD just for comfyui, I hadn't planned ahead before i started using it.
Can't load Vae and Text projection in default LTX 2.3 workflow. Custom workflows are exploding as usual, I can't hit download/install custom nodes 1000 times, restart, try updating comfyUI etc, nothing changes.
i never used "default" ltx workflows, they never work for me because multigpu.
i use vaeloader kj audio for the audio vae, vaeloader multigpu for video vae, and dualcliploader for text projection + gemma 3.
i don't even think I have seen the default workflows for ltx2.3, i just learned from the ltx2 ones they don't work at all for me.
oh and updating comfyui, not sure what version you are using, but i know on the portable comfy, the update inside the program itself doesn't work right on these major updates, you need to use the updates inside the update folder, there is an option for update everything that will update comfy and all python crap, but, be careful, a lot of shit broke in that update.
@MrReclusive666 Thanks man. I'm using the ComfyUI Dekstop app. I tried comfyUI portable a couple of times over in 2025/2024, its always been completely incomprehensibly for me. The new ComfyUI sektop app is actually super useful, with an internal "app store" (free and open source) for loading workflows and downloading things. But... When I just drop a workflow in, and try to download the nodes, it rarely works.
@Foxdude yeah, i haven't use the desktop app yet, but, have been working on building my workflows for now on with the "app" thing configured. i like the app thing really, kind of nice when im laying in bed playing with it on my phone.
@MrReclusive666 It's very cool tbh, and they'll fix the issues. It's gone from nothing to impressive in a very short time. I'm primarily going to use LTX for a visual novel. I wish it could do NSFW, its vastly superior to Wan imho.
@Foxdude lol, that is EXACTLY why i got into ai video, i make a visual novel, 3d render everything, animations were always day long processes, but, with ai video, i can just render key frames now and let ai interpolate.
@Foxdude biggest problem was though, i got everything setup and ready to go in ltx2. then 2.3 dropped. like really?! damnit, start all over.
@MrReclusive666 Hah that's so cool man!
ComfyUI portable has been getting broken by ComfyUI updates too. Comfy Manager is totally fucked most of the time and has had the same update message for around 5 months now but obviously changes have been happening. Downloading updates constantly fail too due to security settings even if it's an official node that the Manager has listed as verified. Basically anything that is a nightly update fails. This also means that workflows with a bunch of custom nodes won't work and rarely do workflow uploaders post links or information on these custom nodes.
@LetTheBassDrop yeah, comfy is a huge mess atm, i refused to update for the longest time, but kind of had to for ltx2.3, 16 hours to get it running again, and i still haven't fixed everything, just what i needed to get it running. and if i remove --disable-dynamic-vram from my batch file, it won't even start, just crash with no message. the dynamic vram should of been optional, not forced on everyone, it broke sooo sooo much.
Do you have to use the distilled lora with these and if so, is there a model that doesn't require the distilled lora?
you don't need the distilled lora on the distilled model, you don't need the distilled on full either if you want to run full steps.
@MrReclusive666 Thanks for the reply Mr. Reclusive but I've tried every available workflow I could find and tried altering stuff (raising steps to 50. cfg of 5+) and the end result is always distorted without the lora. I've tried bypassing the upscale section of workflows that integrates the lora and upscaler. I'm at a loss on how to pull it off. It only seems to work with the lora. Any suggestions or a simple workflow you can share/ describe?
Maybe I should add I'm trying with t2v?
@BigSad11 ok, so im not the only one, lol, i tried and never got ltx2 or 2.3 to work without distilled lora or the distilled model, i just thought it was me.
i never got the base model work in either ltx2 or 2.3 without distilled, fp4 or fp8, so unsure, not sure what I am missing, i tried 25 steps, 50 steps, 75 steps, 100 steps, 200 steps.. never got a clean output without distilled lora, and honestly i found running distilled lora at low strength gave great results at 25 steps, but highest step count i get get results from was like 32 steps, no matter the strength of distilled or with it off, i could never get results over 32 steps.
@MrReclusive666 thanks for the reply! I tried everything too. Not quite 200 steps though 😅. I did do 60. It really stinks too because I noticed the output would be better without the distilled lora. It seems the distilled lora changed the output quite a bit. For example if I run a different lora without the distilled lora, the preview of the first sampling process matches my prompt perfectly but it comes out blurry of course. If I run the distilled lora, especially at 100%, to get a clear image, it seems to change initial video from the first pass quite a bit. That's why I was wondering because the base model seems to understand prompts and loras better but doesn't provide a clear image.
The audio also seems more robotic without the lora. Anyhow, thanks again. I'll keep experimenting. Keep me posted if you discover anything!
@BigSad11 set distilled lora to about 65%, thats kind of where i found the happy point (in ltx2 anyways), was good at 8-10 steps. if you want to push to higher steps and let base model come out more, try about 30%, thats where I was hitting 24-30 steps.
and yeah, distilled inherently snaps the model into a quick output, so less likely to follow prompt that well.
what i tend to do, is kind of "undistill" with lora's, i will inject a bunch of lora's at low strength, it returns some of the creativity lost by the distillation.
if im doing general outputs ill inject a bunch of things like vhs, galaxyace, etc, any sfw loras between 5 and 15% (the more lora's, the less strength)
and of course, for nsfw, yeah, a bunch of nsfw stuff (except pov ones, those snap hard even at low strength).
this just injects creativity back into the distilled model allowing you to raise step count a bit and prompt better.
@MrReclusive666 Got it. Thanks Mr. Reclusive. I'll try your suggestions tonight and everything you said makes total sense. I really like this model but (at least from my experience) it seems to be one of the most complicated ones made so far 😅. I'm sure that'll change with time as people like you and others work with it. I appreciate all the feedback and responses. You've got a new follower on here now ❤️
@BigSad11 yeah, its been a bumpy road with this model, and here in a few days, may not matter as much anymore. Magihuman is out, its pretty much wan with audio, so knowing how much everyone loves wan, lol
im waiting in kijai to finish the comfy wrapper for it, then ill be making fp4's, its a smaller model so should be around 8-10gb if i can maintain same level of quantification.
i probably won't use magihuman as much since it is basicly just wan with audio, wan has that 8 second limit and doesn't look like magihuman changed that, so ill stick with ltx2.3, i like my 30-60 second generations.
one weird thing im investigating with magihuman, is how i watched it generate a 5 second clip, in 3 minutes, on cpu only..
@MrReclusive666 Oh wow that sounds intriguing. I haven't heard of that model yet. I'll have to look into it. Man, things are moving at light speed with these models. Seemed like Hunyuan video just came yesterday and I had high hopes for Hunyuan at the time. It's hard to keep up. I do really like the audio and the longer videos with LTX. It seems very quick to generate too (at least on my GPU.) I don't think I could get a working workflow though if they weren't provided 😅. I'll keep an eye out for your magihuman drop on here and do some research on it in the mean time. Good talking to you brother and thanks for everything again. Cheers! 🍻
@BigSad11 yeah, magi human just dropped like 4-5 days ago, and its architecture from what I have read, is basically wan and stable audio glued together.. its more complicated then that, but that's the easy way to put it, im just waiting for kijai to finish the wrapper before i dig into it more, ill be curious to see if wan lora's work on it.
and yeah, i had a lot of hopes for hunyuan, i liked it better then wan, then hunyuan 1.5 dropped, it was great, i was training lora's for it to add all the adult stuff for people, then ltx2 dropped like the next month, and that killed hyv1.5, its a good video model, but now we have opensource video+audio, video only was kind of dead after that.
crazy thing is, my fp4 of hvy1.5 is 5gb.
i still feel that hyv1.5 has superior video, but that lack of audio now really hurt it, lets hope that tencent is working hard to add audio to hyv2.
RuntimeError: mat1 and mat2 shapes cannot be multiplied (4180x4096 and 2048x4096)
comfy up to date? and i personally have to load this with a checkpoint loader instead of unet loader because of the same issue.
same, everything up to date, but not a single node supports these nvfp4s.
@Renessance is Eros a distill? i just genned my first run with it (and an nvfp4 quanted gemma) and it got ghosty/pixelated, symptom of not enough steps but i figured its a distilled model?
@sneedingonmyligma420 https://civitai.com/posts/27286913
@sneedingonmyligma420 I'm doing okay with it)
AttributeError: module 'torch' has no attribute 'float4_e2m1fn_x2' How can i fix this
do I need custom nodes for this, or can it be done by the default workflow?
My video quality tends to vary a lot when using the Full Dev FP4ME version. Audio tends to come out garbled at times, and the videos are sometimes slowed down, but the picture quality is adequate. Has anyone also noticed these issues?