Titan AI LogoTitan AI

ml-fastvlm

4,285
232
Python

项目描述

FastVLM is a Python-based project that offers an efficient vision encoding solution for vision language models, featuring a novel hybrid vision encoder that reduces encoding time and outputs fewer tokens for high-resolution images.

Project Information

Created on 5/1/2025
Updated on 7/1/2025

Categories

image-processing
machine-learning-framework
model-compression

Tags

algorithm-model
data-processing
open-source-community
research-frontier
model-deployment