Wan 2.2:
This was trained using ai-toolkit on Wan 2.2 I2V 14B using 20 81-frame 16-fps videos.
Wan 2.1:
This model was trained using diffusion-pipe on the Wan I2V 14B 720P checkpoint using ~30 2-second videos from various anal sex positions at 24fps. I used an Nvidia A6000.
I'm aware that it's the general convention to train on the T2V Wan checkpoint, but I get horrendous results (with both T2V and I2V) when I do so with this dataset. My theory is Wan isn't as aware of this type of action as other ones, but I'm not really sure. I just know that I've tried a bunch of times to train this model in T2V to no avail while I've had success training on T2V with other datasets.