Microsoft Vibe Voice Text To Speech Example Workflow

The example Workflow for the Vibe Voice Text to Speech AI models. Microsoft VibeVoice with ComfyUI is an AI tool that creates natural-sounding speech, supports voice cloning from audio samples, and can generate conversations with up to four distinct speakers, making it great for podcasts or long audio scripts. It offers options for controlling speech style and quality, working best with English and Chinese.

Link to ComfyUI Custom Node - https://github.com/wildminder/ComfyUI-VibeVoice

I have a free written manual installation guide that includes the required python packages and custom node links, available here - https://www.patreon.com/posts/137750868.

If you want a full video walkthrough of the Gradio Demo version of the vibe voice project and installation process, including manual setup, recommended settings, and tips for both local and cloud deployment—check out my YouTube tutorial linked below:

Comments (3)

daemondenpublic402Sep 16, 2025

CivitAI

It is really cool. Only issue I have is trying to add emotion to the dialogue. Can't really figure out the best prompting method for that.

francoispoidevin869Mar 5, 2026

Is there any solutions to add emotions through the prompt? Maybe another way to add emotions?
Thanks

cd0001Oct 23, 2025

CivitAI

Awesome work flow. Thanks for uploading it.

Description

FAQ

Comments (3)

Details

Files

microsoftVibeVoiceText_updated.zip

Mirrors

Description

FAQ

What is Microsoft Vibe Voice Text To Speech Example Workflow?

What files are available and where can I download them?

Comments (3)

Details

Files

microsoftVibeVoiceText_updated.zip

Mirrors