Yichi Zhang
13 papers · 5 citations
Most-cited papers
- SPHINX: The Joint Mixing Of Weights, Tasks, And Visual Embeddings For Multi-modal Large Language Models2023 · 288 citations
- Pyramidkv: Dynamic KV Cache Compression Based On Pyramidal Information Funneling2024 · 236 citations
- Making Large Language Models Perform Better In Knowledge Graph Completion2023 · 112 citations
- Towards End-to-end Embodied Decision Making Via Multi-modal Large Language Model: Explorations With Gpt4-vision And Beyond2023 · 58 citations
- STAIR: Improving Safety Alignment With Introspective Reasoning2025 · 56 citations
- Hierarchical Task Learning From Language Instructions With Unified Transformers And Self-monitoring2021 · 30 citations
- Aneumo: A Large-scale Multimodal Aneurysm Dataset With Computational Fluid Dynamics Simulations And Deep Learning Benchmarks2025 · 4 citations
- Proactive Assistant Dialogue Generation From Streaming Egocentric Videos2025 · 1 citations
- Abstractive Visual Understanding Of Multi-modal Structured Knowledge: A New Perspective For MLLM Evaluation2025
- Gemini 2.5: Pushing The Frontier With Advanced Reasoning, Multimodality, Long Context, And Next Generation Agentic Capabilities2025
- Unsupervised Defect Detection For Surgical Instruments2025
- UI-TARS-2 Technical Report: Advancing GUI Agent With Multi-turn Reinforcement Learning2025
Topics