← all datasets

Qwen-2.5-math-7B

Emerging
4papers using it
2025first seen

The 'Qwen2.5-Math-7B' dataset/benchmark contains prompts designed for evaluating reinforcement learning with verifiable rewards (RLVR) in deterministic outcome reasoning tasks.

Papers using Qwen-2.5-math-7B (4)

Qwen-2.5-math-7B β€” datasets β€” reinforcement-learning