ScreenSpot-Pro
Emerging9papers using it
2025first seen
Papers using ScreenSpot-Pro (9)
- Mitigating Coordinate Prediction Bias From Positional Encoding FailuresMEGA-GUI: Multi-stage Enhanced Grounding Agents For GUI ElementsAdaptive Vision-Language Model Routing for Computer Use AgentsMolmoPoint: Better Pointing for VLMs with Grounding TokensGUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI GroundingGui-shift: Enhancing Vlm-based GUI Agents Through Self-supervised Reinforcement Learning\textsc{gui-spotlight}: Adaptive Iterative Focus Refinement For Enhanced GUI Visual GroundingChain-of-ground: Improving GUI Grounding Via Iterative Reasoning And Reference FeedbackPhi-ground Tech Report: Advancing Perception In GUI Grounding