Titan AI LogoTitan AI

ai-agent-benchmark-compendium

32
2

Project Description

Compendium of over 50 benchmarks for evaluating AI agents, categorized into Function Calling & Tool Use, General Assistant & Reasoning, Coding & Software Engineering, and Computer Interaction.

Project Information

Created on 10/15/2025
Updated on 10/16/2025