IFBench
Emerging3papers using it
2025first seen
'IFBench' is a dataset used to evaluate the performance of language models through a set of binary questions that assess various aspects of their outputs.
'IFBench' is a dataset used to evaluate the performance of language models through a set of binary questions that assess various aspects of their outputs.