Titan AI LogoTitan AI

OmniParser

22,555
1,896
Jupyter Notebook

项目描述

OmniParser is a screen parsing tool that converts user interface screenshots into structured elements, enhancing the ability of vision-based GUI agents to generate accurate actions grounded in interface regions.

项目信息

创建于 9/20/2024
更新于 7/2/2025

分类

image-processing
machine-learning-framework
ai-development-platform

标签

algorithm-model
data-processing
model-deployment
open-source-community
research-frontier