Soul-AILab/SoulX-Podcast-1.7B-dialect
Text-to-Speech
β’
2B
β’
Updated
β’
272
β’
24
Generate images preserving face identity
Replace objects in images using prompts or reference images
Generate customized speech from text using a reference audio
Generate music from text descriptions and optional melodies
Transcribe speech from audio or YouTube videos into text
Convert spoken words into text