Hailin Zhang
4 papers · 31 citations
Most-cited papers
- CAFE: Towards Compact, Adaptive, And Fast Embedding For Large-scale Recommendation Models2023 · 11 citations
- Experimental Analysis Of Large-scale Learnable Vector Storage Compression2023 · 9 citations
- Pqcache: Product Quantization-based Kvcache For Long Context LLM Inference2024 · 9 citations
Top co-authors
Topics