Titan AI

helm

Name: helm
Rating: 2.277 (2554 reviews)
Author: Open Source Community

2,554

344

Python

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.

Project Information

Created on 11/29/2021

Updated on 11/28/2025

helm

Project Description

Project Information