Titan AI LogoTitan AI

dia

17,269
1,418
Python

项目描述

Dia is a 1.6B parameter text-to-speech model that generates highly realistic dialogue in one pass, with capabilities for emotion and tone control through audio conditioning and nonverbal communication.

Project Information

Created on 4/19/2025
Updated on 7/2/2025

Categories

speech-technology
ai-content-generation
text-processing

Tags

ready-to-use
model-deployment
data-processing
open-source-community
research-frontier

Topics

open-weight
ai
text-to-speech