MuSiQue
Emerging6papers using it
4,792HF downloads
6HF likes
2025first seen
The 'MuSiQue' dataset/benchmark is used to evaluate the factual correctness of answers generated by large language model systems in the context of knowledge graph retrieval and generation.
Papers using MuSiQue (6)
- Retrieval as Reasoning: Self-Evolving Agent-Native Retrieval via LLM-WikiMeMo: Memory as a ModelPersonalAI 2.0: Enhancing knowledge graph traversal/retrieval with planning mechanism for Personalized LLM AgentsWikontic: Constructing Wikidata-Aligned, Ontology-Aware Knowledge Graphs with Large Language ModelsMacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAGChain-of-Thought Matters: Improving Long-Context Language Models with
Reasoning Path Supervision