Titan AI LogoTitan AI

olmocr

13,099
942
Python

项目描述

olmocr is a Python toolkit that converts PDFs and image-based documents into clean, readable plain text, supporting complex formatting, equations, tables, and handwriting.

Project Information

Created on 9/17/2024
Updated on 7/2/2025

Categories

image-processing
machine-learning-framework
text-processing

Tags

data-processing
algorithm-model
open-source-community
model-deployment
development-tools