Yuandong Tian
16 papers · 3596 citations
Most-cited papers
- Efficient Streaming Language Models With Attention Sinks2023 · 1654 citations
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time2023 · 317 citations
- Mobilellm: Optimizing Sub-billion Parameter Language Models For On-device Use Cases2024 · 219 citations
Topics