Video tutorial:
How to use:
1. Fill in the dialogue content (in multi-character dialogues, use [spkn] to represent, for example [spk1], [spk2], [spk3])
2. Choose whether to translate the speech into a dialect (turn on the switch if you want to translate into a dialect and select the corresponding dialect you want to translate into).
3. Selection of Chinese dialects:
Value 1 represents Sichuan dialect, value 2 represents Cantonese, value 3 represents Shanghainese, value 4 represents Northeastern Mandarin, value 5 represents Henan dialect, value 6 represents Shaanxi dialect, value 7 represents Shandong dialect, value 8 represents Tianjin dialect, and value 9 represents Minnan dialect.
4. Whether to use text normalization (only use if your conversation contains Arabic numerals)
5. Upload reference audio for your voice (mainly for voice cloning; those using voice design do not need to upload this, but do not delete the existing audio). If you need to use voice cloning, turn on the corresponding character's switch after uploading your reference audio.
6. Role Control Instructions (Control Pitch, Emotion, Rhythm): When generating speech from text, control instructions are filled in according to (core identity, such as gender, age, and role; speech quality and pitch; emotion, rhythm, and scene).
Example: A quiet, hoarse, elderly woman with a deep, noticeably grainy voice, accompanied by a slight tremor in her breathing. She speaks slowly in a very low volume, perfectly suited for historical narratives.
Character control commands are optional; filling them in will activate them, while leaving them blank will disable them.
We recommend trying it out online before downloading; it's completely free!
Online experience address:
https://www.runninghub.ai/post/2043960563025321986/?inviteCode=rh-v1058