← all datasets

LongBench

Emerging

31papers using it

69,173HF downloads

181HF likes

2024first seen

LongBench is a comprehensive benchmark for multilingual and multi-task purposes, with the goal to fully measure and evaluate the ability of pre-trained language models to understand long text. This dataset consists of twenty different tasks, covering key long-text application scenarios such as multi-document QA, single

🤗 Hugging Face

Papers using LongBench (29)

IndexMem: Learned KV-Cache Eviction with Latent Memory for Long-Context LLM Inference2026

AttentionRAG: Attention-Guided Context Pruning in Retrieval-Augmented Generation2025 · 1 cites

Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference2025 · 1 cites

ProxyKV: Cross-Model Proxy Pruning for Efficient Long-Context LLM Inference2026

ART: Attention Run-time Termination for Efficient Large Language Model Decoding2026

IceCache: Memory-efficient KV-cache Management for Long-Sequence LLMs2026

LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding2026

Reinforced Fast Weights with Next-Sequence Prediction2026

Federation of Experts: Communication Efficient Distributed Inference for Large Language Models2026

EndPrompt: Efficient Long-Context Extension via Terminal Anchoring2026

M-RAG: Making RAG Faster, Stronger, and More Efficient2026

Developing Adaptive Context Compression Techniques for Large Language Models (LLMs) in Long-Running Interactions2026

AllMem: A Memory-centric Recipe for Efficient Long-context Modeling2026

Towards robust long-context understanding of large language model via active recap learning2026

PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference2025

DSPC: Dual-Stage Progressive Compression Framework for Efficient Long-Context Reasoning2025

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference2025

MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding2025

Overflow Prevention Enhances Long-Context Recurrent LLMs2025

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache2025

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers2025

MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAG2025

MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language Models2025

An Empirical Study on Prompt Compression for Large Language Models2025

PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference2025

CriticalKV: Optimizing KV Cache Eviction from an Output Perturbation Perspective2025

Does RAG Really Perform Bad For Long-Context Processing?2025

Task-agnostic Prompt Compression with Context-aware Sentence Embedding and Reward-guided Task Descriptor2025

Extending Context Window of Large Language Models from a Distributional Perspective2024

LongBench — datasets — llm-papers