Titan AI LogoTitan AI

GPTQModel

799
113
Python

Project Description

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Project Information

Created on 6/17/2024
Updated on 9/26/2025