olmocr is a Python toolkit that converts PDFs and image-based documents into clean, readable plain text, supporting complex formatting, equations, tables, and handwriting.