LlamaCaption2_AngryHamster - CivArchive (CivitAI Archive)

LlamaCaption2_AngryHamster - LlamaCaption2AngryHamster

LlamaCaption2 is a GUI-based image captioning tool that uses local AI models to generate customized captions for images.

#Originally : LlamaScribe by LamEmy |

Revised & Upgraded by AngryHamster

Originally developed by https://civarchive.com/user/LamEmy

revised and upgraded by AngryHamster

Github Page for the LlamaCaption2 :

git clone github.com/robertinosro/LlamaCaption2_AngryHamster

## Features

- Process multiple images in a batch

- Generate AI-powered image descriptions

- Apply custom formatting to captions

- User-friendly interface

- Works with local Ollama models

-Process at once all images in the input folder!

Quick Readme :Setup Requirements

Install Ollama on your system (no Python installation required)
[Ollama](https://ollama.ai/) installed and running locally
(https://ollama.com/library/llama3.2-vision)
- A compatible vision model (like llama3.2-visioon) for image analysis
- A compatible text model (like qwen) for caption refinement
Find the LlamaScribe.exe executable inside the main folder. Start the file. Enjoy captioning!

Getting Started

Launch LlamaScribe.exe
The application will open with the title "LlmamaScribe by LamEmy | Revised & Upgraded by AngryHamster (Civitai)"

Basic Usage

Main Tab:
- Select the folder containing your images
- Choose your preferred Ollama vision model
- Click "Start Processing" to begin captioning your images
Advanced Tab:
- Configure system prompts
- Adjust additional settings for the AI models
Formatting Tab:
- Enable/disable the caption prefix and suffix
- Customize the prefix text: "A photo of a woman, bloobikkx1, curvy blonde with (a well-defined neck:1.3) and (natural proportions:1.2), "
- Customize the suffix text: " (masterpiece, ultra-realistic, high-definition, 8K, cinematic lighting),(professional photography:1.4), (sharp focus:1.2), (studio lighting:1.2), (clear details:1.3), (professional atmosphere:1.3)"

Output

The application will process all images in the selected folder
Each image will receive a caption following the format:
- [Prefix] + [AI-generated description] + [Suffix]
Captions will be saved alongside your images

A more advanced Readme file is included in the main folder!

Description

V 2.0 Upgraded! Adding Custom Prefix/Suffix Text to tailor captions. (trigger words for loras)

Comments (4)

QualityControlMar 5, 2025

CivitAI

what does this do over something like JoyCaption?

AngryHamster

Author

Mar 5, 2025

First of all , uyou don't need to install any requirements. i've been strugling with JoyCaption for a long time,and everytime i ran into a black hole. Install this and that , then this is not compatible or missing..all that. The GUI itself is simple and intuitive, and i must say it works really great ( and fast too) The purpose of this is mostly for generating captions for Lora training.

It runs locally with ollama and any vision model of your choice.

Besides the initial image captioning i added the option for Prefix and Suffix. You can customize both the prefix and suffix directly in the application without modifying the source code.

Also there is an Advanced Tab (optional and for advanced users)where you can enter a Custom Prompt for the refiner model.

The repo contains all the info needed to run this Image captioner.

It does not need any requirements to be installed ( unless you want to re-build it and customize on your own)

5221141Mar 7, 2025· 1 reaction

CivitAI

The changes you made look good, thank you for uploading it. :)

AngryHamster

Author

Mar 7, 2025

Thanks for looking into it!

Other

by AngryHamster