Titan AI LogoTitan AI

preparedness

790
78
Python

项目描述

The Preparedness project provides code for evaluating AI models using nanoeval and alcatraz, with evals like PaperBench and upcoming SWELancer, MLE-bench, and SWE-bench.

Project Information

Created on 3/28/2025
Updated on 7/1/2025

Categories

data-science
ai-development-platform
machine-learning-framework

Tags

development-tools
data-processing
open-source-community
research-frontier
algorithm-model