Awesome Papers
LLMsQuantumSimSearchAI4CodeAgentsCVRoboticsCyberAI4SciSpeechRLMMGenAIGraphTSRecSysFL

← authors · overview

Hai Zhao

27 papers · 983 citations
Most-cited papers
  • Pyramidinfer: Pyramid KV Cache Compression For High-throughput LLM Inference
    2024 · 130 citations
  • Keep The Cost Down: A Review On Methods To Optimize LLM' S Kv-cache Consumption
    2024 · 117 citations
Topics
EfficiencyModel ArchitectureSurvey Paper

Privacy · Terms

© 2026 Awesome Papers.