← all datasets

MTBench

Emerging
3papers using it
2025first seen

MTBench is a large-scale benchmark that contains paired time series and textual data from financial and weather domains, used to evaluate large language models on their ability to understand and reason across both structured numerical trends and unstructured textual narratives.

Papers using MTBench (3)

MTBench β€” datasets β€” multimodal