Qing Li
27 papers · 7 citations
Most-cited papers
- A Survey On RAG Meeting Llms: Towards Retrieval-augmented Large Language Models2024 · 808 citations
- An Embodied Generalist Agent In 3D World2023 · 357 citations
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment2023 · 245 citations
- Videoagent: A Memory-augmented Multimodal Agent For Video Understanding2024 · 207 citations
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment2023 · 112 citations
- Enhancing Remote Sensing Image Retrieval With Triplet Deep Metric Learning Network2019 · 87 citations
- Sceneverse: Scaling 3D Vision-language Learning For Grounded Scene Understanding2024 · 53 citations
- CLOVA: A Closed-loop Visual Assistant With Tool Usage And Update2023 · 51 citations
- Unifying 3D Vision-language Understanding Via Promptable Queries2024 · 26 citations
- Learning Shared Semantic Space With Correlation Alignment For Cross-modal Event Retrieval2019 · 22 citations
- Prior Knowledge Integration Via LLM Encoding And Pseudo Event Regulation For Video Moment Retrieval2024 · 18 citations
- Product Image Recognition With Guidance Learning And Noisy Supervision2019 · 18 citations
- Ov-nerf: Open-vocabulary Neural Radiance Fields With Vision And Language Foundation Models For 3D Semantic Understanding2024 · 17 citations
- Tokenrec: Learning To Tokenize ID For Llm-based Generative Recommendation2024 · 9 citations
- Collaborative Multi-lora Experts With Achievement-based Multi-tasks Loss For Unified Multimodal Information Extraction2025 · 3 citations
Topics