Step-Audio

4,390

356

Python

Step-Audio is an open-source framework for intelligent speech interaction, supporting multilingual conversations, emotional tones, regional dialects, adjustable speech rates, and prosodic styles. It features a 130B-parameter multimodal model for speech recognition, semantic understanding, dialogue, voice cloning, and speech synthesis.

项目信息

创建于 2/11/2025

更新于 7/2/2025

分类

speech-technology

conversational-assistant

ai-content-generation

Step-Audio

项目描述

项目信息

分类

标签