HotpotQA
Emerging31papers using it
2023first seen
Papers using HotpotQA (31)
- Learning Query-Aware Budget-Tier Routing for Runtime Agent MemoryEvoagentx: An Automated Framework For Evolving Agentic WorkflowsAutoplan: Automatic Planning Of Interactive Decision-making Tasks With Large Language ModelsKnowAgent: Knowledge-Augmented Planning for LLM-Based AgentsCodeAgents: A Token-Efficient Framework for Codified Multi-Agent Reasoning in LLMsRetrieval-augmented Hierarchical In-context Reinforcement Learning And Hindsight Modular Reflections For Task Planning With LlmsMemPro: Agentic Memory Systems as Evolvable ProgramsCascading Hallucination in Agentic RAG: The CHARM Framework for Detection and MitigationAdaMEM: Test-Time Adaptive Memory for Language AgentsZEBRA: Zero-shot Budgeted Resource Allocation for LLM OrchestrationParallel Context Compaction for Long-Horizon LLM Agent ServingProper Scoring Rules for Agentic Uncertainty QuantificationRetrieval as Reasoning: Self-Evolving Agent-Native Retrieval via LLM-WikiTool-Schema Compression Enables Agentic RAG Under Constrained Context BudgetsStepOPSD: Step-Aware Online Preference Distillation for Agent Reinforcement LearningPrompt Codebooks: Discrete Compositional Optimization for Language Model Instruction RefinementCritic-R: Improving Agentic Search using Instruction-tuned Retrievers with Natural Language Introspective FeedbackWhen Do LLM Agents Treat Surface Noise Differently from Semantic Noise? A 68-Cell Measurement Study with a Held-Out Trace-Level ValidationAnswer Only as Precisely as Justified: Calibrated Claim-Level Specificity Control for Agentic SystemsGRASP: Graph Agentic Search over Propositions for Multi-hop Question AnsweringReasoning Topology Matters: Network-of-thought For Complex Reasoning TasksScaling Multi-agent Systems: A Smart Middleware for Improving Agent InteractionsMemSkill: Learning and Evolving Memory Skills for Self-Evolving AgentsPseudoAct: Leveraging Pseudocode Synthesis for Flexible Planning and Action Control in Large Language Model AgentsA2RAG: Adaptive Agentic Graph Retrieval for Cost-Aware and Reliable ReasoningMAR:Multi-Agent Reflexion Improves Reasoning Abilities in LLMsMission Impossible: Feedback-Guided Dynamic Interactive Planning for Improving Reasoning on LLMsDebFlow: Automating Agent Creation via Agent DebateMulti-granular Training Strategies for Robust Multi-hop Reasoning Over
Noisy and Heterogeneous Knowledge SourcesSmurfs: Multi-agent System Using Context-efficient DFSDT For Tool PlanningAutoPlan: Automatic Planning of Interactive Decision-Making Tasks With
Large Language Models