← all datasets

OffTopicEval

Emerging
2papers using it
272HF downloads
6HF likes
2025first seen

OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always! Paper: https://huggingface.co/papers/2509.26495 Code: https://github.com/declare-lab/OffTopicEval Note: We release OffTopicEval, a multilingual evaluation suite for measuring operational safety of large language models (LLMs). The benchmark i

Papers using OffTopicEval (2)