Titan AI LogoTitan AI

lmm-r1

828
54
Python

Project Description

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Project Information

Created on 2/13/2025
Updated on 11/4/2025