Xin Chen
19 papers · 31 citations
Most-cited papers
- Appagent: Multimodal Agents As Smartphone Users2023 · 406 citations
- Internlm2 Technical Report2024 · 378 citations
- LL3DA: Visual Interactive Instruction Tuning For Omni-3d Understanding, Reasoning, And Planning2023 · 216 citations
- Qa-lora: Quantization-aware Low-rank Adaptation Of Large Language Models2023 · 176 citations
- Chartllama: A Multimodal LLM For Chart Understanding And Generation2023 · 149 citations
- Exploring Lightweight Hierarchical Vision Transformers For Efficient Visual Tracking2023 · 108 citations
- End-to-end 3D Dense Captioning With Vote2cap-detr2023 · 49 citations
- Sutrack: Towards Simple And Unified Single Object Tracking2024 · 32 citations
- Text-visual Prompting For Efficient 2D Temporal Video Grounding2023 · 29 citations
- NTIRE 2025 XGC Quality Assessment Challenge: Methods And Results2025 · 26 citations
- Vote2cap-detr++: Decoupling Localization And Describing For End-to-end 3D Dense Captioning2023 · 25 citations
- Extreme Cardiac MRI Analysis Under Respiratory Motion: Results Of The Cmrxmotion Challenge2025 · 3 citations
- E-bayessam: Efficient Bayesian Adaptation Of SAM With Self-optimizing Kan-based Interpretation For Uncertainty-aware Ultrasonic Segmentation2025 · 1 citations
- MARS2 2025 Challenge On Multimodal Reasoning: Datasets, Methods, Results, Discussion, And Outlook2025 · 1 citations
- GUI-GENESIS: Automated Synthesis Of Efficient Environments With Verifiable Rewards For GUI Agent Post-training2026
Topics