Emotional and prosody control TTS
INPUT VOICE/PROSODY
0%
Loading dataset input...
RECOGNIZED CAPTIONS
VOICE IDENTITY
Loading default target speaker...
Speaker ref: full clip.
TTS CAPTIONS
GENERATE PANEL
TTS SETTINGS
Extended text control
Performance
CLSP CONTROLNET
No CLSP audio selected.
No checkpoints loaded.
REFERENCE MEAN GUIDANCE
GENERATED OUTPUT
Play
LONG GENERATION STRATEGYInput shorter than 10 seconds.
No output generated.