RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.