← authors · overview

Xiu Li

18 papers · 12 citations

Most-cited papers

Scalablevit: Rethinking The Context-oriented Generalization Of Vision Transformer
2022 · 45 citations
A Survey Of Camouflaged Object Detection And Beyond
2024 · 33 citations
Unihead: Unifying Multi-perception For Detection Heads
2023 · 22 citations
A Two-stage Reinforcement Learning-based Approach For Multi-entity Task Allocation
2024 · 20 citations
GRA: Detecting Oriented Objects Through Group-wise Rotating And Attention
2024 · 16 citations
Segment Concealed Objects With Incomplete Supervision
2025 · 12 citations
Segment Concealed Objects With Incomplete Supervision
2025 · 12 citations
Controllable Video Generation: A Survey
2025
Mindomni: Unleashing Reasoning Generation In Vision Language Models With RGPO
2025
Haploomni: Unified Single Transformer For Multimodal Video Understanding And Generation
2025
Linear Differential Vision Transformer: Learning Visual Contrasts Via Pairwise Differentials
2025
Bias-reduced Multi-step Hindsight Experience Replay For Efficient Multi-goal Reinforcement Learning
2021
Rethinking Goal-conditioned Supervised Learning And Its Connection To Offline RL
2022
Decentralized Transformers With Centralized Aggregation Are Sample-efficient Multi-agent World Models
2024

Topics

Object Detection Multi-Agent Uncategorized Video-Language 3D Vision Segmentation Vision-Language Models Video Understanding Visual QA & Reasoning Instruction Tuning