Titan AI LogoTitan AI

OmniParser

22,555
1,896
Jupyter Notebook

项目描述

OmniParser is a screen parsing tool that converts user interface screenshots into structured elements, enhancing the ability of vision-based GUI agents to generate accurate actions grounded in interface regions.

Project Information

Created on 9/20/2024
Updated on 7/2/2025

Categories

ai-development-platform
image-processing
machine-learning-framework

Tags

algorithm-model
data-processing
model-deployment
open-source-community
research-frontier