Yiffydream image captioner - CivArchive (CivitAI Archive)

This is an extremely experimental image captioning model based on moondream for furry images, including NSFW ones. Please note: this is not a text to image model, it takes existing images and creates text descriptions for them. It's also unreliable and tends to hallucinate NSFW details in images that aren't actually there.

I'm not aware of any publicly available captioning models for furry content so I'm releasing it in its current state, even though it will likely require substantial fine tuning and manual editing of the generated descriptions.

To make use of this you'll need to install moondream and unpack this model into that directory. (I recommend setting up a venv or conda environment before doing this because it installs very specific Pytorch versions):

git clone https://github.com/vikhyat/moondream
cd moondream
git checkout 281074b9e488d142fba86760c7b606a1866acf3f
pip install -r requirements.txt
unzip yiffydreamImage_20240510.zip

Now you can point the included batch captioning script at a directory of images and it will add .caption files for any images that don't have them already:

python3 batchcaption.py myimagedirectory/

The generated captions are likely to vary a lot in quality, with interactions between multiple characters particularly poor. You can output them to a jsonl file instead with the --outfile argument if that's more useful to you. I've also included a quick fine tuning script.

Description

Details

Files

Available On (1 platform)