Titan AI LogoTitan AI

RedPajama-Data

4,814
364
Python

Project Description

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Project Information

Created on 4/14/2023
Updated on 9/21/2025