Kaipeng Zhang
17 papers · 0 citations
Most-cited papers
- Omniquant: Omnidirectionally Calibrated Quantization For Large Language Models2023 · 385 citations
- Onellm: One Framework To Align All Modalities With Language2023 · 231 citations
- Imagebind-llm: Multi-modality Instruction Tuning2023 · 174 citations
- SPHINX-X: Scaling Data And Parameters For A Family Of Multi-modal Large Language Models2024 · 149 citations
- Lumina-next: Making Lumina-t2x Stronger And Faster With Next-dit2024 · 123 citations
- Onellm: One Framework To Align All Modalities With Language2023 · 79 citations
- Diffagent: Fast And Accurate Text-to-image API Selection With Large Language Model2024 · 5 citations
- Yume: An Interactive World Generation Model2025
- Sridbench: Benchmark Of Scientific Research Illustration Drawing Of Image Generation Model2025
- Symbolic Graphics Programming With Large Language Models2025
- Internspatial: A Comprehensive Dataset For Spatial Reasoning In Vision-language Models2025
- Tir-bench: A Comprehensive Benchmark For Agentic Thinking-with-images Reasoning2025
- Samrefiner: Taming Segment Anything Model For Universal Mask Refinement2025
- Focal Guidance: Unlocking Controllability From Semantic-weak Layers In Video Diffusion Models2026
Topics