Awesome Similarity Search
📄Papers🧭Topics👥Authors🔥Trending🗺️Map🏆Leaderboards📚Packs🛠️Tools📝Blogs🤖Ask AI✉️Newsletter🚀Pro
+ Add Paper

← authors · overview

Zhaozhuo Xu

14 papers · 227 citations
Most-cited papers
  • KV Cache Is 1 Bit Per Channel: Efficient Large Language Model Inference With Coupled Quantization
    2024 · 75 citations
  • KV Cache Compression, But What Must We Give In Return? A Comprehensive Benchmark Of Long Context Capable Approaches
    2024 · 44 citations
  • Zeroth-order Fine-tuning Of Llms With Extreme Sparsity
    2024 · 34 citations
  • Nomad-attention: Efficient LLM Inference On Cpus Through Multiply-add-free Attention
    2024 · 21 citations
Topics
EfficiencyModel ArchitectureCodeEvaluationFine-TuningTraining Techniques

Stay Updated

E-Mail Digest

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.