2WikiMultihopQA
Emerging6papers using it
2026first seen
The '2WikiMultihopQA' dataset is a benchmark that contains multi-hop question-answer pairs derived from Wikipedia, used to evaluate the ability of models to perform reasoning across multiple documents to answer complex questions.
Papers using 2WikiMultihopQA (6)
- Cascading Hallucination in Agentic RAG: The CHARM Framework for Detection and MitigationRetrieval as Reasoning: Self-Evolving Agent-Native Retrieval via LLM-WikiCritic-R: Improving Agentic Search using Instruction-tuned Retrievers with Natural Language Introspective FeedbackGRASP: Graph Agentic Search over Propositions for Multi-hop Question AnsweringMiniPIC: Flexible Position-Independent Caching in <100LOCA2RAG: Adaptive Agentic Graph Retrieval for Cost-Aware and Reliable Reasoning