Awesome Speech Audio
πŸ“„Papers🧭TopicsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ€–Ask AI
β‹―More
πŸ‘₯AuthorsπŸ“šReading PacksπŸ› οΈToolsπŸ“Blogsβœ‰οΈNewsletterπŸ”–Saved
+ Add Paper

← authors Β· overview

Rao Muhammad Anwer

12 papers Β· 0 citations
Most-cited papers
  • Terrafm: A Scalable Foundation Model For Unified Multisensor Earth Observation
    2025
  • Ragnet: Large-scale Reasoning-based Affordance Segmentation Benchmark Towards General Grasping
    2025
  • Agent-x: Evaluating Deep Multimodal Reasoning In Vision-centric Agentic Tasks
    2025
  • All In One: Visual-description-guided Unified Point Cloud Segmentation
    2025
  • Think Before You Segment: An Object-aware Reasoning Agent For Referring Audio-visual Segmentation
    2025
Topics
Embodied & AgentsVision-Language ModelsBenchmarksVisual QA & ReasoningUncategorizedAudio-VisualInstruction Tuning

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.