Titan AI LogoTitan AI

MOSS-RLHF

1,401
104
Python

Project Description

Secrets of RLHF in Large Language Models Part I: PPO

Project Information

Created on 7/5/2023
Updated on 10/31/2025