Skywork-Reward-Gemma-2-27B
An advanced reward model based on the Gemma-2-27B architecture
CommonProductProgrammingReward ModelPreference Handling
Skywork-Reward-Gemma-2-27B is an advanced reward model based on the Gemma-2-27B architecture, specifically designed for preference handling in complex scenarios. It has been trained on 80K high-quality preference pair data from multiple fields, including mathematics, programming, and safety. The model ranked first on the RewardBench leaderboard in September 2024, showcasing its strong capabilities in handling preferences.
Skywork-Reward-Gemma-2-27B Visit Over Time
Monthly Visits
19075321
Bounce Rate
45.07%
Page per Visit
5.5
Visit Duration
00:05:32