Zeroscope V2 576w (txt2video) - v1.0

Stop! These models are not for txt2img inference!

Don't put them in your stable-diffusion-webui/models directory and expect to make images!

So what are these?

These are new Modelscope based models for txt2video, optimized to produce 16:9 video compositions. They've been trained on 9,923 video clips and 29,769 tagged frames at 24 fps, 576x320 res.

Note that they can look much better - I had to convert the mp4 outputs to gif for Civitai. We can also upscale these videos using the Zeroscope v2 XL txt2vid models, which I'm currently uploading!

Note: this model is the lighter version of the XL model (available here) which requires a lot more VRAM. If you have >15GB of VRAM, you should be using the XL version.

Where do they go?

Drop them in the \stable-diffusion-webui\models\ModelScope\t2v folder

It's imperative you rename the text2video_pytorch_model.pt to .pth extension after downloading.

The files must be named open_clip_pytorch_model.bin, and text2video_pytorch_model.pth

Who made them? Original Source?

https://huggingface.co/cerspense/zeroscope_v2_576w

What else do I need?

These models are specifically for use with the txt2video Auto1111 WebUI Extension

Description

FAQ

Comments (21)

chzbroJun 25, 2023· 4 reactions

CivitAI

Jeebus. If I squint, I can see the end of human civilization from here.

jxz63679754Jun 25, 2023

CivitAI

theres only one file here though

theally

Author

Jun 25, 2023

Expand the Files section on the left - there are two files.

malcolmreyJun 25, 2023· 1 reaction

CivitAI

now we need some smart cookie to marry this concept with lora/lycoris :-)

embJun 25, 2023

CivitAI

With gifski you can convert your videos to gif almost without loss of quality (opensource)

theally

Author

Jun 25, 2023

Ah, I'll give it a go, thanks!

restofaceJun 25, 2023· 1 reaction

CivitAI

Thanks a lot for your help, when all the files are install, it works like a charm. Never thought videos were this easy. Cheers!

1638627Jun 25, 2023

CivitAI

so its a pickle, and I have to disable safe check with this --disable-safe-unpickle in command line args before SD will read it, hmm, not sure I want to do that, can u post a safe tensor version?

theally

Author

Jun 25, 2023

You're not trying to load it like a model for generating images are you? It doesn't work like that, and it doesn't need any command line args.

1638627Jun 25, 2023

@theally No , I placed models as directed in \stable-diffusion-webui\models\ModelScope\t2v folder

and using sd-webui-text2video in a1111.

I just get this come up in cmd ,

The file may be malicious, so the program is not going to read it.

You can skip this check with --disable-safe-unpickle commandline argument.

No problem Ill figure it out as always, I'm probably missing something in the setup

ReelaiJun 25, 2023· 2 reactions

CivitAI

I was waiting porn videos, but meh. Still there is none. :D

darksealJun 26, 2023

CivitAI

i dont think loras or embeds are working at all... so potato. But a moving potato none the less!!

theally

Author

Jun 26, 2023

No, LoRA and TI won't work with txt2video models.

ScreamingMurphyJun 26, 2023

fully functional txt2video technology and you just shrug and say "poato" cuz it can't do LoRA or TI yet? Man, I must be getting old cuz back in my day we were super impressed by little moving black and white squares on our computer screens (Pong) LOL

fob123Jun 26, 2023· 1 reaction

CivitAI

Is it possible to use an RTX2060 for this, or should I upgrade to an H100?

theally

Author

Jun 26, 2023

It's all about the VRAM - 2060 is 6GB? You might be able to gen a small/short video on 6GB! You're going to want to upgrade before mid-July anyway, as that's when StabilityAI's new SD XL model releases, which requires 8GB minimum! The more the better!

noneoofurbusinessJun 27, 2023· 2 reactions

CivitAI

Some prompts for the videos you attached would be appreciated. Ty

theally

Author

Jun 27, 2023· 1 reaction

Click the little "i" at the bottom right of the image - prompts are included

noneoofurbusinessJun 28, 2023

@theally thanks, but what about cfg scale and steps?

theally

Author

Jun 29, 2023· 2 reactions

@noneoofurbusiness all defaults, 30 steps, 17 cfg. Good luck! Lots of trial and error involved to get a good gen in this, I've found!

kypxcxrrwrjpfbmy3438Jul 17, 2023

CivitAI

how is it used?

It works for me only with 4 files:

configuration.json

open_clip_pytorch_model.bin

text2video_pytorch_model.pth

VQGAN_autoencoder.pth

Should I just put it together with these?. should i do something else?

Other

SD 1.5

by theally

Download (Beta) View on CivitAI

txt2video

tool

Details

Downloads

1,276

Platform

CivitAI

Platform Status

Available

Created

6/24/2023

Updated

5/11/2026

Deleted

Files

zeroscopeV2576w_v10.pt

Size:

2.63 GB

SHA256:

e6040b6383685912fa5d3aeb1e84d6efe1d11f4de773c23f3f2a5e97c12ab6b7

Mirrors

HuggingFace (6 mirrors)

zeroscopeV2576w_v10.pt

text2video_pytorch_model.pth

CivitAI (1 mirrors)

zeroscopeV2576w_v10.pt

zeroscopeV2576w_v10.bin

Size:

1.84 GB

SHA256:

7737c257bec4a587785ae6b9bf52cc0c16f041ef776df6bb60928615059a2878

Mirrors

HuggingFace (7 mirrors)

zeroscopeV2576w_v10.bin

open_clip_pytorch_model.bin

CivitAI (1 mirrors)

zeroscopeV2576w_v10.bin

Stop! These models are not for txt2img inference!

So what are these?

Where do they go?

Who made them? Original Source?

What else do I need?

Description

FAQ

What is Zeroscope V2 576w (txt2video)?

What files are available and where can I download them?

Comments (21)

Details

Files

zeroscopeV2576w_v10.pt

Mirrors

zeroscopeV2576w_v10.bin

Mirrors