Skywork-Reward-Gemma-2-27B

An advanced reward model based on the Gemma-2-27B architecture

CommonProductProgrammingReward ModelPreference Handling
Skywork-Reward-Gemma-2-27B is an advanced reward model based on the Gemma-2-27B architecture, specifically designed for preference handling in complex scenarios. It has been trained on 80K high-quality preference pair data from multiple fields, including mathematics, programming, and safety. The model ranked first on the RewardBench leaderboard in September 2024, showcasing its strong capabilities in handling preferences.
Visit

Skywork-Reward-Gemma-2-27B Visit Over Time

Monthly Visits

18200568

Bounce Rate

44.11%

Page per Visit

5.8

Visit Duration

00:05:46

Skywork-Reward-Gemma-2-27B Visit Trend

Skywork-Reward-Gemma-2-27B Visit Geography

Skywork-Reward-Gemma-2-27B Traffic Sources