Titan AI LogoTitan AI

MegaTTS3

5,589
418
Python

项目描述

MegaTTS3 is a PyTorch implementation for text-to-speech (TTS) using a lightweight and efficient diffusion transformer model. It offers high-quality voice cloning, bilingual support, and controllable accent and pronunciation features.

项目信息

创建于 3/20/2025
更新于 7/2/2025

分类

speech-technology
ai-content-generation
machine-learning-framework

标签

algorithm-model
open-source-community
chinese-support
research-frontier

主题

research