Needle-in-a-Haystack
Emerging6papers using it
2025first seen
Papers using Needle-in-a-Haystack (6)
- CONF-KV: Confidence-Aware KV Cache Eviction with Mixed-Precision Storage for Long-Horizon LLMMTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context TrainingChunkKV: Semantic-Preserving KV Cache Compression for Efficient
Long-Context LLM InferenceCan Compressed LLMs Truly Act? An Empirical Evaluation of Agentic
Capabilities in LLM CompressionPromptDistill: Query-based Selective Token Retention in Intermediate
Layers for Efficient Large Language Model InferencePause-Tuning for Long-Context Comprehension: A Lightweight Approach to
LLM Attention Recalibration