GAIA release

gaia-benchmark 's Collections

updated Nov 23, 2023

Gather the items of the GAIA release

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 184

Note The arxiv paper (arxiv.org/abs/2311.12983) describing the benchmark and dataset creation methodology.
Running on CPU Upgrade

208

🦾

GAIA Leaderboard

Note The leaderboard itself with the scored models and information on how to submit a new model.
gaia-benchmark/GAIA

Viewer • Updated Mar 26 • 932 • 773 • 160

Note The dataset with questions for the GAIA benchmark.
gaia-benchmark/results_public

Viewer • Updated 18 days ago • 77 • 864 • 10

Note Open dataset of submission results.