HellaSwag
Canonical10papers using it
246,174HF downloads
180HF likes
2024first seen
Dataset Card for "hellaswag" Dataset Summary HellaSwag: Can a Machine Really Finish Your Sentence? is a new dataset for commonsense NLI. A paper was published at ACL2019. Supported Tasks and Leaderboards More Information Needed Languages More Information Needed Dataset Structure Data Instances default Size of downloade
Papers using HellaSwag (10)
- QUIET: A Multi-Blank Cascaded Story Cloze Benchmark for LLM Creative Generation CapabilityMaking Bias Non-Predictive: Training Robust LLM Reasoning via Reinforcement LearningData-Free Pruning of Self-Attention Layers in LLMsOn Robustness and Reliability of Benchmark-Based Evaluation of LLMsTurning the Spell Around: Lightweight Alignment Amplification via Rank-One Safety InjectionUncovering Cross-Linguistic Disparities in LLMs using Sparse AutoencodersTurning the Spell Around: Lightweight Alignment Amplification via
Rank-One Safety InjectionSelf-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning CatalystMore is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety AlignmentTeuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs