Titan AI LogoTitan AI

SFTvsRL

292
16
Python

Project Description

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Project Information

Created on 1/27/2025
Updated on 10/1/2025