Qwen-3-1.7B
Emerging3papers using it
2025first seen
'Qwen-3-1.7B' is a benchmark used to evaluate the performance of models on reasoning tasks, specifically assessing their ability to optimize response quality and robustness through the application of the ADPO method.