A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.