Titan AI LogoTitan AI

OnnxStream

2,014
89
C++

Project Description

Lightweight inference library for ONNX files, written in C++. It can run Stable Diffusion XL 1.0 on a RPI Zero 2 (or in 298MB of RAM) but also Mistral 7B on desktops and servers. ARM, x86, WASM, RISC-V supported. Accelerated by XNNPACK. Python, C# and JS(WASM) bindings available.

Project Information

Created on 7/14/2023
Updated on 12/29/2025