Titan AI LogoTitan AI

VITA-Audio

607
49
Python

项目描述

VITA-Audio is an open-source, end-to-end large speech model designed for efficient and fast audio-text token generation with low latency and strong performance on ASR, TTS, and SQA benchmarks.

Project Information

Created on 4/23/2025
Updated on 7/2/2025

Categories

speech-technology
ai-content-generation
machine-learning-framework

Tags

open-source-community
algorithm-model
model-deployment
real-time-processing
research-frontier