Beidi Chen
20 papers Β· 3184 citations
Most-cited papers
- Efficient Streaming Language Models With Attention Sinks2023 Β· 1654 citations
- Galore: Memory-efficient LLM Training By Gradient Low-rank Projection2024 Β· 426 citations
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time2023 Β· 317 citations
- Layerskip: Enabling Early Exit Inference And Self-speculative Decoding2024 Β· 234 citations
- Triforce: Lossless Acceleration Of Long Sequence Generation With Hierarchical Speculative Decoding2024 Β· 97 citations
Topics