R-2R
Emerging11papers using it
2025first seen
The R2R dataset/benchmark contains a collection of navigation tasks that evaluate Vision-Language Navigation (VLN) systems by requiring agents to follow natural language instructions to navigate through real-world environments.
Papers using R-2R (11)
- Does Peer Observation Help? Vision-sharing Collaboration For Vision-language NavigationFine-Grained Instruction-Guided Graph Reasoning for Vision-and-Language NavigationThink Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion
and Reasoning for Vision-and-Language NavigationCoNav: Collaborative Cross-Modal Reasoning for Embodied NavigationLandmark-Guided Knowledge for Vision-and-Language NavigationGlobal Commander and Local Operative: A Dual-Agent Framework for Scene NavigationImplicit Geometry Representations for Vision-and-Language Navigation from Web VideosTrajectory-Diversity-Driven Robust Vision-and-Language NavigationDoes Peer Observation Help? Vision-Sharing Collaboration for Vision-Language NavigationStructured Observation Language for Efficient and Generalizable Vision-Language NavigationVision-and-Language Navigation with Analogical Textual Descriptions in LLMs