HelpSteer2

An open-source dataset designed for training high-performance reward models.

CommonProductOpenSourceOpen-source datasetReward model
HelpSteer2, released by NVIDIA, is an open-source dataset aimed at supporting the training of models that align towards being more helpful, factually accurate, coherent, and controllable in terms of response complexity and redundancy. Collaborated on with Scale AI, it achieved a remarkable 88.8% performance on RewardBench when used with the Llama 3 70B base model, making it one of the top-performing reward models as of June 12, 2024.
Visit

HelpSteer2 Visit Over Time

Monthly Visits

20899836

Bounce Rate

46.04%

Page per Visit

5.2

Visit Duration

00:04:57

HelpSteer2 Visit Trend

HelpSteer2 Visit Geography

HelpSteer2 Traffic Sources

HelpSteer2 Alternatives