Titan AI LogoTitan AI

VITA

2,344
172
Python

项目描述

VITA-1.5 is an open-source multimodal large language model designed for real-time vision and speech interaction, supporting both English and Chinese.

Project Information

Created on 8/10/2024
Updated on 7/2/2025

Categories

speech-technology
conversational-assistant
image-processing

Tags

open-source-community
multimodal
real-time-processing
chinese-support
model-deployment

Topics

multimodal-large-language-models
large-multimodal-models