If you want to give runpod credits to help with training, feel free to send a code in dm.
A version of the furry nsfw lora for the 14b wan model. Also works with humans.
This is an img2vid lora, it's meant for the wan 14b img2vid models (both 480p and 720p should work). Attempting to generate txt2vid can yield unexpected results.
Avoid using teacache, if you do use it, keep the threshold low. Teacache causes more artifacts with this lora, sometimes making it do strange things.
(Up to) 5x speedup with causvid
I recommend using lightx2v causvid, combined with the mps reward lora for more movement. When combined, complicated actions continue to have lots of movement. Even on 6 steps with euler/euler a + beta it yields great results with lots of motion and great physics.
Previous causvid lora info below (Personally I prefer v1 at 50% over v2)
Using the causvid lora you can get a speedup of around 5 times (assuming you used 20 steps beforehand) It even works with the wan gguf models. Just make sure to follow these steps:
Load the causvid lora at 0.5 strength (as well as this lora at 1 strength)
Sample at 4 (minimum) to 8 (6+ seems to be the sweet spot) steps with a low cfg (<=3, I usually use 2), use beta scheduler for best results. If you see ghosting, ensure your steps and sampler are set correctly.
On my 3060, this lets me generate a 60 frame video in under 4 minutes. The quality is usually higher than teacache, and it's much faster. Do not combine with teacache.
Purpose
This lora can keep characters consistent, and can handle many positions from pov or similar perspectives. It was trained on many positions, follow the prompting guide. Avoid using with t2v loras, your characters might warp and transform.
This lora is capable of generating (without the need for other loras): cowgirl(+reverse), missionary, doggystyle, blowjob(+deepthroat)
It is also capable of handjobs, titfuck. v1 might need assistance, v1.1 seems to be good on that.
It is effectively a lora for NSFW motions in i2v without changing character consistency. And with a better understanding of furry characters.
V1.1:
Continued training with a new dataset, entirely new captions, more perspectives. It usually yields more motion, and it's easier to tag. It can do many positions, perspectives and motions without the need for a second lora.
Prompting v1.1 is like prompting a t2i model, aside from motions, you can prompt for "moving up and down" or similar. Although v1.1 will usually have motion anyways.
V1:
Note: if you're not getting enough motion/not the right speed
If you don't prompt for motion, you won't get any.
"deep thrusts, fast thrusts", "medium sucking, slow sucking", etc. will adjust the depth and pace.
"The woman moves up and down as she rides the man", "the woman uses her breasts to stroke the man's cock." Should be self-explanatory. You can even prompt for pulling out, varying degrees of success.
Prompting should be similar to prompting the 1.3b version.
Trained on 400 res, frame buckets of [1, 8, 16, 24, 32, 40, 60, 80], context as "multiple_overlapping".
Compared to 1.3b version
This model performs much better at oral, has less stretching artifacts. But might be a little harder to prompt right for the motion, at least on really short videos. Make sure you include a speed and depth in your prompts in the case it doesn't animate enough, this could help.
Trained on img2vid, not recommended for txt2vid
I cannot give any promises about quality when used on txt2vid. Especially furry content, I have not tested it and cannot guarantee quality. It might be able to generate some human content, maybe a little furry content. But I would recommend using a different lora instead for those situations.
Description
Initial release, 10k steps is estimated based on last saved steps checkpoint.
FAQ
Comments (13)
Just like with the previous 1.3b, this one excels at quality and perfection. It's simply the best nsfw lora out there!
What all do you need to actually be able to use this though? I doubt its as simple as downloading like a lora and then being done.
Do you need a specific checkpoint/Lora/Model/ A111/Forge/etc
What was your batch size and learning rate?
Just a heads up civitAI made a change to their TOS that says NSFW posts will be hidden from peoples feeds if the metadata and prompts are not in the video/image post
Now that I got a chance to try this. I can say without a doubt that this model captures the motion of sex acts really well.
It doesnt need to be furry images, it works extremely well across anime, realism, etc.
Does this Lora only work with the full Wan model, or can it work with a smaller Q8 or Q6 version? I've been using Q6, and no matter what I try (prompt specifying speed and action, photorealistic image, etc.) I can't get any good motion or good output. I'm trying to animate a side view doggystyle picture. Anyone help?
Please provide prompts whenever and wherever possible folks. I know some like to keep their 'tricks' up their sleeves. But sharing is caring. Share the love. Peace.
I have to say, this is fantastic. Using other concepts it takes quite a lot of tries to get something good, but this Lora I get some very nice results within 5 attempts. Nice job.
If only I had 2 5090's in SLI. Lora is great but it suck's so much RAM I'm surprised my PC didn't blow up
A single 4090 is fine, some of the example vids were genned on a 3060, using a Q3_K_M quant of the model, you can use a gguf quant https://huggingface.co/city96/Wan2.1-I2V-14B-480P-gguf/. This works with comfyui-gguf (https://github.com/city96/ComfyUI-GGUF).
@mylo1337 How much time would you say it takes per video and length? My 2080 with the GGUF at length 33 steps 20 took 18 minutes based on one time test. It uses about 17.5 GB of RAM (27GB of RAM all together). Using non GGUF takes 40GB of RAM, including OS and other stuff. I'm also curious about the wattage on the 4090 and 3060 when using AI. I would estimate that the 4090 total system power would be 500 watts from the wall. I get around 260 watts on average. I was looking forward towards the 5090, but it just sucks so much power and has issues, let alone the price. My room already blows the breaker when my AC and computer are on at the same time (Happens rarely).
@brand175 idk exact times but I know that even on q3, the 3060 is bottlenecked by memory, I get about 20% usage with small spikes to around 50% usage. Not sure about exact wattage
Thats Wan 14B in general, you can use block_swap but I don't have issues running at full fp16 precision on a single 3090 its still slow at least 10 minutes for a higher res generation with any respectable frame length. Try using dpm_2 for the sampler and ddim_uniform for the scheduler that produces the most consistent and best motion generations from all the sampler combos ive tried. Use around 20 steps for the best quality but 6 steps is good for a quicker generation. Samplers and schedulers matter alot when aiming for quality UniPc is honestly not that great.