VideoMind

Name: VideoMind
Rating: 1.111 (222 reviews)
Author: Open Source Community

222

Python

VideoMind is a multi-modal agent framework designed for long video reasoning, emulating human-like processes to break down tasks, localize and verify moments, and synthesize answers, addressing temporal-grounded reasoning challenges.

项目信息

创建于 3/17/2025

更新于 7/1/2025

分类

image-processing

machine-learning-framework

search-and-retrieval

VideoMind

项目描述

项目信息

分类

标签