s1-32B

s1 is an inference model fine-tuned based on Qwen2.5-32B-Instruct, trained with only 1,000 samples.

CommonProductProductivityText GenerationInference Model
s1 is an inference model that focuses on achieving efficient text generation capabilities with a limited set of samples. It scales during testing using budget enforcement techniques, capable of matching the performance of o1-preview. Developed by Niklas Muennighoff et al., the related research is published on arXiv. The model employs Safetensors technology, boasts 32.8 billion parameters, and supports text generation tasks. Its main advantage lies in achieving high-quality reasoning through a limited number of samples, making it suitable for scenarios requiring efficient text generation.
Visit

s1-32B Visit Over Time

Monthly Visits

29742941

Bounce Rate

44.20%

Page per Visit

5.9

Visit Duration

00:04:44

s1-32B Visit Trend

s1-32B Visit Geography

s1-32B Traffic Sources

s1-32B Alternatives