Yan Wang
36 papers · 3 citations
Most-cited papers
- An Embodied Generalist Agent In 3D World2023 · 357 citations
- Enhancing Recommender Systems With Large Language Model Reasoning Graphs2023 · 76 citations
- Camixersr: Only Details Need More "attention"2024 · 64 citations
- Sceneverse: Scaling 3D Vision-language Learning For Grounded Scene Understanding2024 · 53 citations
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code2023 · 49 citations
- Asynchronous Large Language Model Enhanced Planner For Autonomous Driving2024 · 45 citations
- SHREC'22 Track: Sketch-based 3D Shape Retrieval In The Wild2022 · 13 citations
- PICD: Versatile Perceptual Image Compression With Diffusion Rendering2025 · 2 citations
- Efficient Multi-camera Tokenization With Triplanes For End-to-end Driving2025 · 1 citations
- Sparseoccvla: Bridging Occupancy And Vision-language Models Via Sparse Queries For Unified 4D Scene Understanding And Planning2026
- Multimodal Hypothetical Summary For Retrieval-based Multi-image Question Answering2024
- Compressing Then Matching: An Efficient Pre-training Paradigm For Multimodal Embedding2025
- CREM: Compression-driven Representation Enhancement For Multimodal Retrieval And Comprehension2026
- Viba: Implicit Bundle Adjustment With Geometric And Temporal Consistency For Robust Visual Matching2026
- Synmotion: Semantic-visual Adaptation For Motion Customized Video Generation2026
Topics