Skip to content

meta-agents-research-environments/gaia2

Reinforcement LearningENBenchmark

Meta-agents-research-environments/gaia2 is a reinforcement learning benchmark dataset in EN from meta-agents-research-environments in Parquet format.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About meta-agents-research-environments/gaia2

Gaia2 Paper | Code | Project Page Dataset Summary Gaia2 is a benchmark dataset for evaluating AI agent capabilities in simulated environments. The dataset contains 800 scenarios that test agent performance in environments where time ...

Details

Task
Reinforcement Learning
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
meta-agents-research-environments
Year
2025
Download

FAQ