Zhongwei Wan
11 papers Β· 669 citations
Most-cited papers
- Efficient Large Language Models: A Survey2023 Β· 223 citations
- SVD-LLM: Truncation-aware Singular Value Decomposition For Large Language Model Compression2024 Β· 208 citations
- Scaling Laws With Vocabulary: Larger Models Deserve Larger Vocabularies2024 Β· 112 citations
- LOOK-M: Look-once Optimization In KV Cache For Efficient Multimodal Long-context Inference2024 Β· 84 citations
- D2O: Dynamic Discriminative Operations For Efficient Long-context Inference Of Large Language Models2024 Β· 15 citations
Topics