Serena Yeung-Levy
15 papers Β· 4 citations
Most-cited papers
- Why Are Visually-grounded Language Models Bad At Image Classification?2024 Β· 3 citations
- Deforhmr: Vision Transformer With Deformable Cross-attention For 3D Human Mesh Recovery2024 Β· 1 citations
- Visualoverload: Probing Visual Understanding Of Vlms In Really Dense Scenes2025
- TTRV: Test-time Reinforcement Learning For Vision Language Models2025
Topics