← all datasets

BabyAI

Emerging
3papers using it
2025first seen

'BabyAI' is a dataset and benchmark that contains a series of tasks designed to evaluate the performance of language model agents in interactive environments, focusing on their ability to understand and execute instructions.

Papers using BabyAI (3)

BabyAI β€” datasets β€” ai-agents