Steiner-32b-preview

Steiner is a reasoning model trained on synthetic data, designed to explore multiple reasoning paths and verify them autonomously.

CommonProductProductivityReasoning ModelReinforcement Learning
Steiner is a series of reasoning models developed by Yichao 'Peak' Ji, focusing on training on synthetic data through reinforcement learning, capable of exploring multiple paths and autonomously verifying or retracing during reasoning. The model aims to replicate the reasoning capabilities of OpenAI o1 and verify the scaling curve during reasoning. Steiner-preview is an ongoing project, and its open-source nature aims to share knowledge and obtain feedback from more real users. Although the model performs well in some benchmark tests, it has not yet fully achieved the reasoning scaling capabilities of OpenAI o1 and is therefore still under development.
Visit

Steiner-32b-preview Visit Over Time

Monthly Visits

29742941

Bounce Rate

44.20%

Page per Visit

5.9

Visit Duration

00:04:44

Steiner-32b-preview Visit Trend

Steiner-32b-preview Visit Geography

Steiner-32b-preview Traffic Sources

Steiner-32b-preview Alternatives