Dia2 2B

Streaming dialogue TTS with voice cloning, non-verbal cues, and multi-speaker support

Text to Audio

Dia2 2B Overview

Dia2 2B is a 2 billion parameter streaming text-to-speech model from Nari Labs designed for real-time conversational AI. It begins generating audio immediately from partial text input, supports multi-speaker dialogue via speaker tags, voice cloning from a few seconds of reference audio, and non-verbal cues like laughter, sighs, and coughs. Released under Apache 2.0 for commercial use.