Multi-Mission Tool Bench

Emerging

2papers using it

2025first seen

The Multi-Mission Tool Bench is a benchmark that contains multiple interrelated missions designed to evaluate the robustness of large language model-based agents in dynamically adapting to evolving demands and mission-switching patterns.

🔎 Find this dataset

Papers using Multi-Mission Tool Bench (2)

Multi-mission Tool Bench: Assessing The Robustness Of LLM Based Agents Through Related And Dynamic Missions2025 · 1 cites

Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions2025 · 1 cites