Titan AI LogoTitan AI

VITA-Audio

607
49
Python

项目描述

VITA-Audio is an open-source, end-to-end large speech model designed for efficient and fast audio-text token generation with low latency and strong performance on ASR, TTS, and SQA benchmarks.

项目信息

创建于 4/23/2025
更新于 7/2/2025

分类

speech-technology
ai-content-generation
machine-learning-framework

标签

open-source-community
algorithm-model
model-deployment
real-time-processing
research-frontier