Awesome Speech Audio
πŸ“„Papers🧭TopicsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ€–Ask AI
β‹―More
πŸ‘₯AuthorsπŸ“šReading PacksπŸ› οΈToolsπŸ“Blogsβœ‰οΈNewsletterπŸ”–Saved
+ Add Paper

← authors Β· overview

Xiu Li

18 papers Β· 12 citations
Most-cited papers
  • Scalablevit: Rethinking The Context-oriented Generalization Of Vision Transformer
    2022 Β· 45 citations
  • A Survey Of Camouflaged Object Detection And Beyond
    2024 Β· 33 citations
  • Unihead: Unifying Multi-perception For Detection Heads
    2023 Β· 22 citations
  • A Two-stage Reinforcement Learning-based Approach For Multi-entity Task Allocation
    2024 Β· 20 citations
  • GRA: Detecting Oriented Objects Through Group-wise Rotating And Attention
    2024 Β· 16 citations
  • Segment Concealed Objects With Incomplete Supervision
    2025 Β· 12 citations
  • Segment Concealed Objects With Incomplete Supervision
    2025 Β· 12 citations
  • Controllable Video Generation: A Survey
    2025
  • Mindomni: Unleashing Reasoning Generation In Vision Language Models With RGPO
    2025
  • Haploomni: Unified Single Transformer For Multimodal Video Understanding And Generation
    2025
  • Linear Differential Vision Transformer: Learning Visual Contrasts Via Pairwise Differentials
    2025
  • Bias-reduced Multi-step Hindsight Experience Replay For Efficient Multi-goal Reinforcement Learning
    2021
  • Rethinking Goal-conditioned Supervised Learning And Its Connection To Offline RL
    2022
  • Decentralized Transformers With Centralized Aggregation Are Sample-efficient Multi-agent World Models
    2024
Topics
Object DetectionMulti-AgentUncategorizedVideo-Language3D VisionSegmentationVision-Language ModelsVideo UnderstandingVisual QA & ReasoningInstruction Tuning

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.