Bin Cui
19 papers · 545 citations
Most-cited papers
- Spotserve: Serving Generative Large Language Models On Preemptible Instances2023 · 121 citations
- Buffer Of Thoughts: Thought-augmented Reasoning With Large Language Models2024 · 95 citations
- Pqcache: Product Quantization-based Kvcache For Long Context LLM Inference2024 · 88 citations
- Videotetris: Towards Compositional Text-to-video Generation2024 · 51 citations
- Cfbench: A Comprehensive Constraints-following Benchmark For Llms2024 · 51 citations
- CAFE: Towards Compact, Adaptive, And Fast Embedding For Large-scale Recommendation Models2023 · 11 citations
- Experimental Analysis Of Large-scale Learnable Vector Storage Compression2023 · 9 citations
- Pqcache: Product Quantization-based Kvcache For Long Context LLM Inference2024 · 9 citations
- Heterogeneous Adaptive Policy Optimization: Tailoring Optimization To Every Token's Nature2026
- Infinipipe: Elastic Pipeline Parallelism For Efficient Variable-length Long-context LLM Training2026
- Facilitating Multi-turn Function Calling For Llms Via Compositional Instruction Tuning2024
- Pilotrl: Training Language Model Agents Via Global Planning-guided Progressive Reinforcement Learning2025
- Siriushelper: An LLM Agent-based Operations Assistant For Big Data Platforms2026
Topics