ai-agent-benchmark-compendium

Name: ai-agent-benchmark-compendium
Rating: 1.029 (58 reviews)
Author: Open Source Community

Compendium of over 50 benchmarks for evaluating AI agents, categorized into Function Calling & Tool Use, General Assistant & Reasoning, Coding & Software Engineering, and Computer Interaction.