10S Nodes in my Workflow - v1.2 WF coming with nodes soon.
Ver. 1.2
Additional adjustments. Basically a more polished version of v1. This may be the best LTX2.3 can do tbh. It has a bit more tensors. This FP8 is larger, I left all connectors on exclude because that was one small part of identity drift. Full bf16 for quants and training is on HuggingFace backup. It can do anything v1 can with more 'stability' -- less tendency to do errant or odd moves and reaching more stable outputs at lower steps (8-10). Still best used to just animate sex images. Complex storytelling and prompts still requires additional guided frames and high effort, everyone knows LTX 2.3 is very clunky by now. Lighttricks seems to be 4 years behind everything, they just now decided to implement experts on the next version. I guess the totally true-proven method of autoregressive is too futuristic for them (literally the best method for video models - Seedance2, Grok, Kling).
This has slightly better initial character and likeness keep, but prompts can remove it unless care is taken. Expressions, dialogue, movement, and camera/pose change will all attack the guided character's look. Best to always train the likeness as a lora or have a guided endframe to give the model a future target.
Same bad tendencies in certain scenarios, same unrefined concepts.
Ver 1.0
This is not the official Sulphur2 model, that is also here. They are the team that trained and manage the official model. I am only affiliated as an i2v tester and consultant. This is my personal merge of the actual data and it uses different training steps and layer scaling to create a consistent flexible I2V focused version. Future versions (v2+) will be actually fine tuned by me personally aimed at filling in gaps and weak concepts.
Quality is entirely prompt, image, and workflow dependent. Aside for systemic issues with LTX: motion smear, artifacts, and identity loss, it still behaves like 2.3 requiring extensive prompt refinement, extra sampling, and resolution targeting for best results.
This probably means nothing legally, but I just want to make it clear that you waive any right to blame or use the host site, me, or sulphur in your defense if you download and misuse this. It's ethical artistic intended use is clearly stated:
By downloading, accessing, or using the 10Eros Model (the “Model”), you (“User”) agree to the following terms:
1. Permitted Use Only The Model may only be used with inputs of fictional or non-real characters. Any use involving real persons, celebrities, public figures, or identifiable individuals is strictly prohibited (“Out-of-Scope Use”).
2. Disclaimer of Liability TenStrip (the creator of the Model), together with its affiliates (CivitAI.com / CivitAI.red, Lightricks Ltd., and Sulphur Team), collectively and individually disclaim all liability for any direct, indirect, incidental, consequential, special, or punitive damages arising from your use or misuse of the Model.
This includes, without limitation, any claims related to copyright infringement, right of publicity violations, defamation, privacy violations, deepfakes, illegal activities, or any harm caused by outputs generated by the Model.
3. Out-of-Scope Uses Forbidden All Out-of-Scope Uses are explicitly forbidden. The creator and affiliates bear no responsibility for any outputs generated from prohibited inputs. Your actions with this generative Model are solely your own.
4. Assumption of Risk You assume all risks associated with the use of the Model. Any use outside the permitted fictional/non-real character inputs further releases TenStrip and all affiliates from any and all liability.
5. Indemnification You agree to indemnify, defend, and hold harmless TenStrip and its affiliates from any and all claims, losses, liabilities, damages, and expenses (including reasonable attorneys’ fees) arising from your use or misuse of the Model.
6. No Warranties The Model is provided “AS IS” with no warranties of any kind, express or implied.
This agreement is governed by the laws of the United States. If you do not agree to these terms, do not download or use the Model.
Description
Extremely rough Beta version.
FAQ
Comments (106)
Does anyone know where I can find the "SamplerSwitcherWrap" node? I just can't find it on github.
@tenstrip Thanks!
Impressive! I didnt think LTXV would outperform wan so soon, but here we are, very close or perhaps even beyond that👀
The next version of this is gonna be way better--better and more data and actually 2.3 trained.
@tenstrip how long do you think it's going to take? Also happy to provide compute if needed
@jeremiahomolewa56846 Only at 1800 steps last update I got. Aiming for well over 15k I think but also needs an i2v training run.
Dear Sir, is it possible to make a video longer than 10 seconds in your workflow -i2v 10 Eros Workflow?
Sure for simple things it can continue and evolve motions but longer stuff usually needs a lot of prompting. If the motion doesn't work that good maybe lower frame rate to compensate for the added length, and interpolate it back up afterwards.
Can be used to I2V?
I think yes
I mean every example I posted is i2v, so maybe.
@tenstrip Hahahaha ok ok, got it, thanks :)
Nice model! Does anyone know of a workflow that can use this model to extend videos? I found LTX-2.3_-_V2V_Extend_Any_Video.json but wasn't able to make it work with checkpoint models.
Any image2video workflow can be turned into v2v extend. My workflow has a video loader node all you need to do is set frame rate and cap and then use that video instead of the image on the latent combines. I usually set the initial video frame load cap to 48 or 73 frames and then you get a 10 second extension of that, works great on nsfw videos or anything.
In wan2gp I get error
"The generation of the video has encountered an error, please check your terminal for more information. 'Cannot set version_counter for inference tensor'"
Any ideas?
I've never used wan2gp but next version I'll see about compatibility and what I can do to integrate for it. That specific bug is pytorch related though something isn't using torch correctly system or program side, or a dependency issue. See if normal ltx2.3 dev undsitilled works first, if you can get that working this will just replace it.
@tenstrip Normal ltx2.3 dev works fine (but Wan2GP defaults to the Quanto INT8 version), someone in the Wan2GP discord said they got it working by disabling triton? I couldn't get it working with their solution though
@ConnoisseurOfHentai Idk if this is better maybe https://github.com/deepbeepmeep/LTX-Desktop-WanGP
Thank you very much for posting this. Makes you wonder if the future of ltx 2.3 nsfw is a finetune + some good loras.
That will definitely be the future in a few weeks/month when I can update it.
@tenstrip I'm waiting for good finetune of LTX 2.3 soo bad.. Is there any news about it?
Great stuff, excited for the full release
Hello, Please someone provide me the best LTx 2.3 first last frame + custom voice... workflow.. Please please...
Thank you 🙂
I really recommend using wan2gp, let claude help you with installing it
@kenofujimoto3882 Thanxs... But now there is a lot of improvement in LTX 2.3, wan 2.2 lacks the ability to add custom sounds
@animart920 I use wan2gp with ltx only, you can just select the model in their webui and it will install perfectly fine ;)
I dunno if I'm crazy, but it seems like the movie audio always has the same weird song playing in it. Like, every generation.
Probably related to the prompt. If you don't prompt anything for audio, background noises, or dialogue ltx2.3 will just fill in with whatever is related. The audio in this is nearly untouched from the normal 2.3 model besides some slight influences for female moaning and sex noises.
I'm waiting for good finetune of LTX 2.3 soo bad.. Is there any news about it?
If you don't dig in and learn the architecture you're always going to be dependent on the benevolence of strangers. It looks more complicated than WAN at first but IMO it's actually way more intuitive. It took me a week of screwing around to get most of the functions nailed down. That's nothing. Best advice is to ditch the upscale stage. Build a one stage basic wf that suits your hardware and you won't need to wait on anyone. I say this not to be pedantic, I just love this model. I promise you it's worth a little headache at first.
News is that it's training and still early on. Initial results show much better learning especially genitals and movements. For i2v it's probably already usable but I can test anything for 2-3 weeks.
@tenstrip I agree — it gives much better results without the LoRA compared to standard LTX 2.3, but the audio quality is much worse. I work around that by using the VAE audio decoder from the default checkpoint. Unfortunately, in my case it still struggles with generating male anatomy. More specifically, it looks fine up until the final pass, where I assume the spatial upscaler messes it up. I tried reducing its influence as much as possible at the last stage — it helps a bit, but not enough.
It also tends to mess up anatomy when parts of the body aren’t fully visible in the input image. That issue exists in Wan too, but not as strongly. I guess I still need to experiment more with the workflow.
@Honeyphoria Oh that stupid upscaler... ditch it! I've been proselytizing hard for single-stage runs. Trust me. At least try it. Delete the whole stage and the upscaler loader. It ruins audio and it ruins video. Lose the 0.5 scale node and do your run at the resolution you want. I never use it anymore, except as a standalone for fixing up old WAN generations. If you have the appropriate phallus/general nsfw LoRA, wieners should get a detail boost - low strength, and turn down all of the audio strengths as well if they screw up the sound. Try abliterated or heretic text encoder if your dingus details are ignored. This model is really good at nsfw without them though. Obviously. That's kind of the point. But they can help. I do use heretic.
@Ponder_Stibbons Yeah, i'm using heretic v2 already. I will try without this shit without spatial upscaler, thanks
@Honeyphoria Audio quality to me means the tone and overall sound fidelity, but idk if some people are using 'quality' and referencing just prompt adherence and available noise variety. The next training will have full sex noises and sounds along with the JOI and ASMR type stuff and orgasming voices and all that. They were already present but I toned down the audio layers in this version since it's 2.0 trained and it would drag the 2.3 model backwards on most of the layers. For audio fidelity theres two major things people miss, one is that if you pass the first audio latent into the upscale pass like normal-and that usually uses a lower strength distilled lora, the audio will always be completely ruined because it's being resampled too lightly. Inside my workflows there is a group of nodes that take the first pass audio and manually reencode it, providing the exact sound output as an audio-video layer to the upscale pass instead. This is how I'll always do it because it's just way better. From there you can start changing sampling and steps, different samplers produce way different audio outputs. My sampler swap workflow uses SDE sampling with 3 CFG for the first couple steps which provides a way better initial audio track, but I still didn't get it tuned enough sometimes it goes overboard and it's loud as hell.
Two questions:
1 - Will this work for text2video?
2 - Will you update this model? It looks good at this point, but it seems like it could be even better.
T2V: yes, much better when you use 2.3 loras for the better anatomy as well. The new 2.3 training is approaching 10,000 steps with a high batch. The full version of that will be released by it's trainer and operator. I will find a training step and weight mix from the new run that I hold to be the best for i2v use and update this, then further updates will be trained/tuned by me or made from merging other loras in.
@tenstrip I also have a question about Dynamic VRAM. Have you or anyone you know encountered the problem that when using this method to launch an LTX model, it doesn't work at all for text2video mode? Sometimes the progress bar stops halfway, and the interface reports the video is ready; sometimes the progress is 100% complete, but the video still isn't actually finished. I only have the image2video mode working. I use the standard Workflow in ComfyUI. I have 12GB of VRAM and 64 RAM.
@tenstrip I'm also curious where I can find the Swap node in your workflow. I can't install it because there are no custom nodes with this node listed.
@yuduz367 I'm not knowledgable about optimizing and low memory use. I run a 5090 with no real optimizations.
@tenstrip And one more question, an important one. This particular checkpoint seems unable to generate pornographic videos without using LoRa. What exactly is the point of this model? And will the next iteration of the model fix this problem?
@yuduz367 It's an I2V model, it's meant to provide a better base model to animate pornographic images than the default LTX2.0 model, and it has a wide ranging variety of nsfw data inside it that does better than the base model. I still see people using the base model with nsfw loras and wonder why; this version of 2.3 is always going to be better for I2V and provide much more variance. I merged it in at a conservative level so that the audio wouldn't be interrupted as much and so that loras can override and shine through still without being disturbed by overlap. The actual sulphur project (not mine) is the full strength T2V model that will be released. I am basically just doing my own personalized I2V fork of it with permission since I'm involved with it.
i noticed this is FP8. is there a full version available somewhere too?
The script auto converted it and I don't see any gain to leaving it at the full dev size for what it's supposed to do, but the next version I may leave a full size one available.
Thanks for noticing lioncrud. Being on 2080ti this would been a moot download
Great job bro, but if possible, I would like to tell you about some of the shortcomings, especially with the hands
I would like to ask, if this is possible and you are responsible for it, please work on the movements of your hands so that they are more natural, for example, if you write that a man takes a girl by the ass) so that it is really done and correctly, without any external hands, the model as a whole does not understand what movements and when you need to do this, depending on how you describe the context in the video, in general, try yourself such promptings as taking your ass or your chest, especially with a man, it does not work, he does not make movements with his hands., it's even better with a woman, but not always successful results are obtained, are there any improvements in these points that I have described?
Not sure if you're talking about T2V or I2V. Most of the tricks with I2V to improve the output structure is usually with more sampling. In the workflow I posted the attempt I mad significantly improves evolution and structure with the 13 steps on the first pass. Important as well: anatomy distortion and mutation is also super prevalent in the upscale pass if you don't set it up correctly which is why I like previewing the first pass, especially with vertical resolutions, still. The model could also prone to that since it only had smaller resolutions on this first training run. Additionally for I2V; adjusting the constraint mechanisms can allow latent to evolve the image more both the preprocessing and conditioning constraint. Resolution is also super important there aren't many models that have small details, if you can you want to crop out most of the start image straight to the focus and action and cut most of the framing and background out that isn't needed. This first dataset itself also didn't receive enough I2V training on the first run to properly do complex things which should be better in the next version but it might not be enough to patch weaknesses in the LTX2.3 model itself.
This shit is great - I can't imagine what the full fine tune will look like. Is there anyway to donate? I need this to actually happen.
Not to me for this project, I only train my own loras but will be adjusting around what's missing from the next version-which isn't much from what I've seen. The real sulphur model team that is doing the training will appear on here at some point and post the model, it will also be on hugginface and they may take donations to recuperate for possible future versions. It most likely will be imperfect since it's a 100k+ dataset but the I2V will be very good.
This is the real deal, dramatically improves body movement from my testing vs ltx2.3 22b dev. Looking forward to the next release
They have nuked LoRa_Daddy :(
He nuked himself from what I saw.
@tenstrip What did he do?
@MisticRain69 something on reddit set him off, he got "one-guyed" and just said F all of you I'm out.
@tenstrip He's back under a new name, at least on reddit.
I tipped him on buymeacoffee a while back and my tip got returned recently, so I guessed something was up :(
I'm having a hard time generating sexy nonude content (for example clothed POV grinding in lap). It always eventually generates a penis, even when the base image (i2v) contains no genitals or nudity at all. Does anyone know how to do this? I can't find any Lora's for this either.
Turn LTXPreprocess img compression to 15-20 or as low as it will go until it just stops animating and make the i2v conditioning strength 1.0. Then increase the compression until it does just the right movement without introducing new elements. If you use my split sampling workflow then CFG is actually on for the first frames, so prompt all the undesired stuff in the negative as well.
How to generate video without ANY sound? How to mute it completely?
A few ways: Make the audio latent 0 frames, or 1 frame if it doesn't let you and then skip 1 frame when you combine the video. After decoding you can also only save the video and unplug the audio from video combine. You can use a volume control to just make it silent.
@tenstrip thanks!
There all in one workflow its generate 2 copies of videos - with sound and without.
Hi again, I have a few questions I need answered for a post on the AI-themed Telegram channel.
1 - Will the next iteration of Fine Tune allow for NSFW content generation without lora, solely using the model itself?
2 - Do you use CG content to train the model? Anime, 3D animation, and so on.
how do you generate without music playing?
Sound : no music. Help some times
This is just LTX2. You need to prompt some kind of audio, usually very descriptive. No audio prompt: the model will default to either random white noise or music.
Make sure to put the audio you want and specify this is the only sounds you will hear.
just put "music" on the negative prompt, its surprisingly powerful when it comes to negative prompts.
@Ardent23 This and lower audio strength helps a lot
Maybe Im dumb, but what VAE do I use?
Every video I make with this, it comes out weird and blurry.
the one in the checkpoint or the ltx2.3 vae.
@tenstrip First off, thanks for responding.
The only VAE I can find is, LTX23_audio_vae_bf16 and LTX23_video_vae_bf16, but whenever I use these, I just get a blurry video.
@aardbark77431 blurry is a workflow/resolution thing. Fast motion will not be very crisp usually and it's hard to get it right with sampling and interpolation. If it's extremely blurry is a workflow related issue. For i2v the i2v_preprocess node literally blurs the input image so you want it as low as possible to still get an output with motion.
Same here - im using wanGP and doing multiple tests. this model is the best i can find for motion and sound but images go straight to distorted and echoey
@NUGGZ1616 that sounds like some issue with the distilled lora or strength then, or possibly the way that i2v conditioning is set up or too much img_compression.
@tenstrip ive been testing on a distilled checkpoint, are you saying this should work better on the dev while using a distilled lora?
Yea I feel stupid. I try using the workflow, but when I click to load the Checkpoint or the VAE it pops up with my Lora folder. And even the listed VAE is the Checkpoint, and not a VAE.
Basically, where can I download the VAE?
@aardbark77431 The video and audio VAEs are in the checkpoint. At least it does work in comfy like that. KJ has split ones off though here https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/vae
@NUGGZ1616 it should work good as a dev checkpoint that you use distilled lora on top of yeah, that's how I always use it. The new 1.1 distilled doesn't seem better but I use it first pass only at 0.9, and then I use the old ranked down distilled lora on the upscale pass at 0.7 strength.
Hey, I don't know If I come here to late, but I got the same issue earlier in the day. But with this lora at 0.5 strengh It got fixed : https://huggingface.co/Lightricks/LTX-2.3/blob/main/ltx-2.3-22b-distilled-lora-384.safetensors. It's just like the SmoothMix Wan2.2 latest I2V model
@Frgmt80 this worked thx
Hello, dear author. I really liked your model. I'll recommend it, maybe someone will give me a donation. But I have a few questions.
1) I'm confused about the trigger words. Here are yours: 3D, Real Video, Amateur – how do I insert them? At the beginning of the text? If I don't use 3D, should I remove this word from the prompt? Does the comma after the last trigger word matter? After the comma, with a capital letter, the prompt? I just can't figure out if the trigger words are strictly tied to their location in the text, and how do punctuation marks affect them?
2) I've installed a lot of lore. Let's say I have the DR34ML4Y lore – I'm writing a prompt that includes both what your model does and what the lore can do. Do I need to include the trigger words everywhere? For example: 3D, Real Video, Amateur, m15510n4ry, bl0wj0b, d0ubl3_bj, d0gg1e, c0wg1rl,
The keywords can just be used to target types of motion or ignored but better to match the style, just need the word somewhere in the prompt. Punctuation doesn't do anything its ignored by tokens, but just use normal natural language. LTX loras would only need keywords in t2v, i2v doesn't need them just use lora strength.
This model is so close, it's crazy. Closest a full ltx 2.3 model is to doing proper wan-level NSFW. But even with loras i still get lower body gore unfortunately. Thank you for sharing your "non-updated" workflow in the description, it works amazingly well. Though res2s seems a bit excessive for the low res pass, gens take twice as long with that sampler due to its heavy nature.
That's definitely fixed in the new one. I can already add, change anatomy, do pull-outs, and camera and other reveals on it's own at a decent level. A lot of training left I think too.
@tenstrip music to my ears man. LTX has so much potential, i want to see it fully dethrone wan 2.2.
retrain this on 1.1 and better ero loras? heh
2.3 is in training and already I think past the point where it's better. I just had my first hands-on testing. Actual anatomy is pretty learned and changes to poses and advanced prompting is going to be possible and audio is way improved.
really solid checkpoint, looking forward to the next version
@tenstrip Is this base LTX2.3 model that has merged LORA into it (probably using phr00t script?). Am I understanding this correctly?
Yes but a very specific mix with different training steps mixed together first. You can make a similar one by taking the old sulphur lora and using the script but it'll be different. I used the steps that had less t2v and more i2v training. The new mix in testing is a lot better and has all reasoning loras along with the actual 2.3 training run.
@tenstrip In Your testing do You see difference between Model + LORA merge vs Model + LORA load?
I do assume that such merge is better because it does propagate weights to "more" parts of the model while loading LORA is less involved ??? Do You happen to have some tech documents regarding this?
@N0n4m3 pretty much always because of the literal difference between loaded vs merged, but not significant per-seed with this merge. I've been changing the merge script into something completely different that uses Layer-Wise merging and full bf16 with actual critical layer highlights and protection this time and designed for 2.3, it's not quite phr00ts script anymore but mainly it's the actual 2.3 data making it work so much better.
There is a new model out Lightricks/LTX-2.3 at main
it is a 1.1 version. Claims to have "A different aesthetic experience and improved audio compared to v1.0"
It's a newer distilled lora. The distilled 1.1 lora is good for the first pass, but seems to include color changes and counters anatomy on second pass. The reranked ones are going to be better for this and i2v: https://huggingface.co/Kijai/LTX2.3_comfy/tree/main/loras but I find that using the old distilled at weak strength on the second pass is still better.
New version should be out soon if the website would work and allow it to be added. Doesn't need Loras and not much that it can't do besides struggle with keeping unique identity at a distance and the physical properties of cumshots still aren't the best. I'm experimenting with a pretty crazy merge with key splits and some layer-scale merged with normalization on for most parts and no normalization for video which has really injected full anatomy into it finally.
I'm really looking forward to it, please release it soon.
possible to get a direct huggingface link if civit is the bottleneck?
Refreshing on the hour. Can't wait to check it out
Yeah I'll definitely have to add it to HF at the same time and link it. At least the staged model files actually uploaded to civit finally. Right now it's waiting for a companion prompt enhancement model with API that will also release with Sulphur2. Then Sulphur2 will release, then I can release this.
Missing nodes - not found on node search and links are broken to gits for them
GACLove/ComfyUI-VFI
and
danTheMonk/comfyui-int-and-float
not sure what that is, wrong page.
Details
Files
ltx2310eros_beta.safetensors
Mirrors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
ltx_2_3_eros.safetensors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
10Eros_LTX2.3_fp8.safetensors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
Itx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors
ltx2310eros_beta.safetensors