BabyAI-Text
Emerging2papers using it
2025first seen
'BabyAI-Text' is a benchmark that contains low-level action spaces used to evaluate the decision-making capabilities of large language models in complex scenarios.
'BabyAI-Text' is a benchmark that contains low-level action spaces used to evaluate the decision-making capabilities of large language models in complex scenarios.