
Dia2 2B
Streaming dialogue TTS with voice cloning, non-verbal cues, and multi-speaker support
Text to Audio
Dia2 2B
Streaming dialogue TTS with voice cloning, non-verbal cues, and multi-speaker support
Text to Audio
Dia2 2B Overview
Dia2 2B is a 2 billion parameter streaming text-to-speech model from Nari Labs designed for real-time conversational AI. It begins generating audio immediately from partial text input, supports multi-speaker dialogue via speaker tags, voice cloning from a few seconds of reference audio, and non-verbal cues like laughter, sighs, and coughs. Released under Apache 2.0 for commercial use.