← all datasets

HH-RLHF

Emerging
2papers using it
32,091HF downloads
1,794HF likes
2024first seen

Dataset Card for HH-RLHF Dataset Summary This repository provides access to two different kinds of data: Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback. These data are meant to train preference (or reward) models fo

Papers using HH-RLHF (2)

HH-RLHF β€” datasets β€” reinforcement-learning