Titan AI LogoTitan AI

VideoMind

222
23
Python

项目描述

VideoMind is a multi-modal agent framework designed for long video reasoning, emulating human-like processes to break down tasks, localize and verify moments, and synthesize answers, addressing temporal-grounded reasoning challenges.

项目信息

创建于 3/17/2025
更新于 7/1/2025

分类

image-processing
machine-learning-framework
search-and-retrieval

标签

algorithm-model
data-processing
model-deployment
open-source-community
research-frontier