HH-RLHF
Emerging2papers using it
32,091HF downloads
1,794HF likes
2024first seen
Dataset Card for HH-RLHF Dataset Summary This repository provides access to two different kinds of data: Human preference data about helpfulness and harmlessness from Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback. These data are meant to train preference (or reward) models fo
π€ Hugging Faceβ mit