Titan AI LogoTitan AI

ml-fastvlm

4,285
232
Python

项目描述

FastVLM is a Python-based project that offers an efficient vision encoding solution for vision language models, featuring a novel hybrid vision encoder that reduces encoding time and outputs fewer tokens for high-resolution images.

项目信息

创建于 5/1/2025
更新于 7/1/2025

分类

image-processing
machine-learning-framework
model-compression

标签

algorithm-model
data-processing
open-source-community
research-frontier
model-deployment