HellaSwag

Canonical

10papers using it

246,174HF downloads

180HF likes

2024first seen

Dataset Card for "hellaswag" Dataset Summary HellaSwag: Can a Machine Really Finish Your Sentence? is a new dataset for commonsense NLI. A paper was published at ACL2019. Supported Tasks and Leaderboards More Information Needed Languages More Information Needed Dataset Structure Data Instances default Size of downloade

🤗 Hugging Face

Papers using HellaSwag (10)

QUIET: A Multi-Blank Cascaded Story Cloze Benchmark for LLM Creative Generation Capability2026

Making Bias Non-Predictive: Training Robust LLM Reasoning via Reinforcement Learning2026

Data-Free Pruning of Self-Attention Layers in LLMs2025

On Robustness and Reliability of Benchmark-Based Evaluation of LLMs2025

Turning the Spell Around: Lightweight Alignment Amplification via Rank-One Safety Injection2025

Uncovering Cross-Linguistic Disparities in LLMs using Sparse Autoencoders2025

Turning the Spell Around: Lightweight Alignment Amplification via Rank-One Safety Injection2025

Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst2025

More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment2025

Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs2024 · 2 cites