Titan AI LogoTitan AI

safe-rlhf

1,556
127
Python

Project Description

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Project Information

Created on 5/15/2023
Updated on 11/18/2025