HelpSteer2
An open-source dataset designed for training high-performance reward models.
CommonProductOpenSourceOpen-source datasetReward model
HelpSteer2, released by NVIDIA, is an open-source dataset aimed at supporting the training of models that align towards being more helpful, factually accurate, coherent, and controllable in terms of response complexity and redundancy. Collaborated on with Scale AI, it achieved a remarkable 88.8% performance on RewardBench when used with the Llama 3 70B base model, making it one of the top-performing reward models as of June 12, 2024.
HelpSteer2 Visit Over Time
Monthly Visits
19075321
Bounce Rate
45.07%
Page per Visit
5.5
Visit Duration
00:05:32