arxiv:2411.13543
Maciej Wolczyk
rahid
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Organizations
Papers
1
models
None public yet
datasets
None public yet