Resemble AI Releases DramaBox, an Open-Source Voice Model with Emotional Depth

Resemble AI has open-sourced its advanced voice generation model, DramaBox, on Hugging Face, marking a significant leap in AI voice technology. DramaBox is the first voice engine designed for director-level control, allowing users to input stage directions such as sighs or whispers alongside dialogue. This transforms AI-generated voices from robotic outputs to emotionally rich performances, eliminating the need for human voice actors or extensive post-production. DramaBox features zero-shot voice cloning, requiring only 10 seconds of reference audio to mimic a target voice. It also allows users to set a character's age, accent, and emotion through natural language prompts, producing studio-quality 48kHz stereo audio. To prevent misuse, all audio includes an invisible watermark resistant to compression and editing. The model is built on Lightricks’ LTX-2.3 audio foundation and integrates advanced technologies like Diffusion Transformer and Gemma 3 12B for text processing.

Source: Show Original

Disclaimer: The content provided on Phemex News is for informational purposes only. We do not guarantee the quality, accuracy, or completeness of the information sourced from third-party articles. The content on this page does not constitute financial or investment advice. We strongly encourage you to conduct you own research and consult with a qualified financial advisor before making any investment decisions.

You may also like