BabyAI
Emerging3papers using it
2025first seen
'BabyAI' is a dataset and benchmark that contains a series of tasks designed to evaluate the performance of language model agents in interactive environments, focusing on their ability to understand and execute instructions.