RULER
Emerging18papers using it
1,488HF downloads
8HF likes
2024first seen
This is a synthetic dataset generated using π RULER: Whatβs the Real Context Size of Your Long-Context Language Models?. It can be used to evaluate long-context language models with configurable sequence length and task complexity. Currently, It includes 4 tasks from RULER: QA2 (hotpotqa after adding distracting infor
Papers using RULER (18)
- IndexMem: Learned KV-Cache Eviction with Latent Memory for Long-Context LLM InferenceRecaLLM: Addressing the Lost-in-Thought Phenomenon with Explicit In-Context RetrievalProxyKV: Cross-Model Proxy Pruning for Efficient Long-Context LLM InferenceLycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse DecodingLongAct: Harnessing Intrinsic Activation Patterns for Long-Context Reinforcement LearningFocuSFT: Bilevel Optimization for Dilution-Aware Long-Context Fine-TuningBenchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMsTowards Long-Horizon Interpretability: Efficient and Faithful Multi-Token Attribution for Reasoning LLMsTowards robust long-context understanding of large language model via active recap learningMTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context TrainingLongMagpie: A Self-synthesis Method for Generating Large-scale Long-context InstructionsScaling Instruction-Tuned LLMs to Million-Token Contexts via
Hierarchical Synthetic Data GenerationEffective Length Extrapolation via Dimension-Wise Positional Embeddings
ManipulationCriticalKV: Optimizing KV Cache Eviction from an Output Perturbation PerspectiveDoes RAG Really Perform Bad For Long-Context Processing?NExtLong: Toward Effective Long-Context Training without Long DocumentsWhy Does the Effective Context Length of LLMs Fall Short?Breaking the Stage Barrier: A Novel Single-Stage Approach to Long
Context Extension for Large Language Models