Titan AI LogoTitan AI

LLaVA

22,916
2,532
Python

项目描述

LLaVA is a large language and vision model that integrates visual instruction tuning to achieve capabilities beyond GPT-4, focusing on multimodality and visual-language learning.

项目信息

创建于 4/17/2023
更新于 7/2/2025

分类

conversational-assistant
image-processing
ai-content-generation

标签

open-source-community
multimodal
model-deployment
data-processing
research-frontier

主题

chatbot
chatgpt
foundation-models
gpt-4
instruction-tuning
llama
llama-2
llama2
llava
multi-modality
multimodal
vision-language-model
visual-language-learning