VSI-Bench
Emerging20papers using it
2025first seen
Papers using VSI-Bench (20)
- Think3D: Thinking with Space for Spatial ReasoningEuclid's Gift: Enhancing Spatial Perception and Reasoning in Vision-Language Models via Geometric Surrogate TasksOmni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview imagesEgoMind: Activating Spatial Cognition through Linguistic Reasoning in MLLMsThinking with Spatial Code for Physical-World Video ReasoningBoosting MLLM Spatial Reasoning with Geometrically Referenced 3D Scene RepresentationsVision-aligned Latent Reasoning for Multi-modal Large Language ModelThinking with Geometry: Active Geometry Integration for Spatial ReasoningSpa3R: Predictive Spatial Field Modeling for 3D Visual ReasoningVideo Evidence to Reasoning Efficient Video Understanding via Explicit Evidence GroundingVideo Spatial Reasoning with Object-Centric 3D RolloutSpaceMind: Camera-Guided Modality Fusion for Spatial Reasoning in Vision-Language ModelsBeyond Pixels: Introducing Geometric-Semantic World Priors for Video-based Embodied Models via Spatio-temporal AlignmentVideoAnchor: Reinforcing Subspace-Structured Visual Cues for Coherent Visual-Spatial ReasoningScaling Spatial Intelligence With Multimodal Foundation ModelsVisuospatial Cognitive AssistantSee&trek: Training-free Spatial Prompting For Multimodal Large Language ModelVs-bench: Evaluating Vlms For Strategic Abilities In Multi-agent EnvironmentsInternSpatial: A Comprehensive Dataset for Spatial Reasoning in Vision-Language ModelsTowards Visuospatial Cognition via Hierarchical Fusion of Visual Experts