DeepSeek-R-1
Emerging7papers using it
2025first seen
The 'Deepseek R1' dataset/benchmark is used to evaluate the efficiency and effectiveness of various attention mechanisms, including Multi-head Latent Attention (MLA) and Group Query Attention (GQA), in large language models.
Papers using DeepSeek-R-1 (7)
- TransMLA: Multi-head Latent Attention Is All You NeedSWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open
Software EvolutionPRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday
Home ClustersLearning a Continue-Thinking Token for Enhanced Test-Time ScalingFrom Harm to Help: Turning Reasoning In-Context Demos into Assets for
Reasoning LMsRRTL: Red Teaming Reasoning Large Language Models in Tool LearningAdaptive Rectification Sampling for Test-Time Compute Scaling