MuSiQue

Emerging

6papers using it

4,792HF downloads

6HF likes

2025first seen

The 'MuSiQue' dataset/benchmark is used to evaluate the factual correctness of answers generated by large language model systems in the context of knowledge graph retrieval and generation.

🤗 Hugging Face

Papers using MuSiQue (6)

Retrieval as Reasoning: Self-Evolving Agent-Native Retrieval via LLM-Wiki2026

MeMo: Memory as a Model2026

PersonalAI 2.0: Enhancing knowledge graph traversal/retrieval with planning mechanism for Personalized LLM Agents2026

Wikontic: Constructing Wikidata-Aligned, Ontology-Aware Knowledge Graphs with Large Language Models2025

MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAG2025

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision2025