Yuyin Zhou
12 papers Β· 0 citations
Most-cited papers
- Mamba-r: Vision Mamba ALSO Needs Registers2024 Β· 20 citations
- Sculpting Holistic 3D Representation In Contrastive Language-image-3d Pre-training2023 Β· 9 citations
- Where On Earth? A Vision-language Benchmark For Probing Model Geolocation Skills Across Scales2025
- Where On Earth? A Vision-language Benchmark For Probing Model Geolocation Skills Across Scales2025
- Harnessing Ehrs For Diffusion-based Anomaly Detection On Chest X-rays2025
- Medvlthinker: Simple Baselines For Multimodal Medical Reasoning2025
- More Thinking, Less Seeing? Assessing Amplified Hallucination In Multimodal Reasoning Models2025
- Openvision 2: A Family Of Generative Pretrained Visual Encoders For Multimodal Learning2025
- Controllable Layered Image Generation For Real-world Editing2026
Topics