Emotional and prosody control TTS

INPUT VOICE/PROSODY

Loading dataset input...

RECOGNIZED CAPTIONS

VOICE IDENTITY

Loading default target speaker...
Speaker ref: full clip.

TTS CAPTIONS

GENERATE PANEL

TTS SETTINGS

Extended text control
Performance

CLSP CONTROLNET

No checkpoints loaded.

REFERENCE MEAN GUIDANCE

GENERATED OUTPUT

Play
No output generated.