DeepScaleR-1.5B-Preview

A large language model optimized by reinforcement learning, focusing on enhancing mathematical problem-solving skills.

CommonProductProductivityArtificial IntelligenceReinforcement Learning

DeepScaleR-1.5B-Preview is a large language model optimized by reinforcement learning, dedicated to enhancing the capabilities of solving mathematical problems. It achieves significant improvements in accuracy within long-text inference scenarios, driven by distributed reinforcement learning algorithms. Key advantages include efficient training strategies, notable performance gains, and the flexibility of open-source availability. Developed by the Sky Computing Lab and Berkeley AI Research team at the University of California, Berkeley, this model aims to advance the application of artificial intelligence in education, especially in mathematics education and competitive mathematics. Available under the MIT open-source license, it is completely free for researchers and developers to use.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

DeepScaleR-1.5B-Preview

DeepScaleR-1.5B-Preview Visit Over Time

DeepScaleR-1.5B-Preview Visit Trend

DeepScaleR-1.5B-Preview Visit Geography

DeepScaleR-1.5B-Preview Traffic Sources

DeepScaleR-1.5B-Preview Alternatives

DeepScaleR-1.5B-Preview — A large language model optimized by reinforcement learning, focusing on enhancing mathematical problem-solving skills.

HunYuan T1 — The industry's first ultra-large-scale hybrid Mamba reasoning model, with strong reasoning capabilities.

HunYuan T1 — An industry-leading deep reasoning large model, optimized for human preferences.

Light-R1 — Light-R1 is an open-source project focusing on long-chain reasoning (Long COT), providing a training method from scratch through curriculum-style SFT, DPO, and RL.

NotaGen — NotaGen is a model for symbolic music generation, employing a large language model training paradigm and focusing on generating high-quality classical music scores.

NovaSky — NovaSky is an AI technology platform focused on code generation and inference model optimization.

Tülu 3 405B — Tülu 3 405B is a large-scale open-source language model enhanced through reinforcement learning.

PaSa — PaSa is an advanced academic paper search agent driven by large language models, capable of autonomous decision-making and obtaining accurate results.

DeepSeek-R1 — DeepSeek-R1 is a high-performance inference model supporting various languages and tasks, suitable for both research and commercial applications.

RLLoggingBoard — A tool for visualizing the reinforcement learning human feedback training process, helping with deep understanding and debugging.

self-adaptive-llms — A real-time adaptive framework for unseen tasks using large language models.

Meta Motivo — The first virtual humanoid agent control tool based on behavior-based models.

DeepMind — A leading artificial intelligence research company under Google

DIAMOND — A reinforcement learning agent trained in a diffusion world model

OpenAI Universe — A software platform for measuring and training AI general intelligence

ReFT — ReFT enhances the reasoning ability of LLM

Motif — Obtain intrinsic motivation from AI feedback

d1 — Improving the reasoning capabilities of diffusion large language models using reinforcement learning.

ChatTS-14B — A model that enhances time-series understanding and reasoning through synthetic data.

InstantCharacter — InstantCharacter is a character personalization framework based on diffusion transformers.

Wan2.1-FLF2V-14B — Open-source video generation model supporting multiple generation tasks.

Mailgo — AI-powered cold email marketing tool with high deliverability rates.

OpenAI Codex CLI — A lightweight coding agent that runs in the terminal.

Liquid — A multimodal generative model integrating visual understanding and generation.

HiDream — A user-friendly, fully Chinese AIGC creation platform that boosts creativity.

GLM-4-32B — A powerful language model supporting various natural language processing tasks.

GenPRM — Extends the testing time calculation of the process reward model through generative reasoning.

Amazon Nova Sonic — Amazon's new foundational model understands tone, intonation, and rhythm, enhancing the naturalness of human-computer dialogue.

DeepCoder — An open-source 14B parameter programming model with efficient code reasoning capabilities.

OpenAI Academy — Empowering educators with the knowledge and skills to effectively utilize artificial intelligence.