Awesome Papers
LLMs
Quantum
SimSearch
AI4Code
Agents
CV
Robotics
Cyber
AI4Sci
Speech
RL
MM
GenAI
Graph
TS
RecSys
FL
☾
☀
← authors
·
overview
Gholamreza Haffari
28
papers ·
1207
citations
Most-cited papers
Minicache: KV Cache Compression In Depth Dimension For Large Language Models
2024 · 96 citations
Topics
Efficiency
Model Architecture
🤖
Ask AI