Titan AI LogoTitan AI

PaLM-rlhf-pytorch

7,865
680
Python

Project Description

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Project Information

Created on 12/9/2022
Updated on 9/6/2025