← all datasets

2WikiMultihopQA

Emerging
6papers using it
2026first seen

The '2WikiMultihopQA' dataset is a benchmark that contains multi-hop question-answer pairs derived from Wikipedia, used to evaluate the ability of models to perform reasoning across multiple documents to answer complex questions.

Papers using 2WikiMultihopQA (6)

2WikiMultihopQA β€” datasets β€” ai-agents