ChainForge-R1-SuperCoT

Public

A multi-stage pipeline that enhances Qwen2.5 language models with DeepSeek Reasoner's chain-of-thought capabilities. Implements the DeepSeek-R1 methodology through cold-start SFT, reasoning-oriented RL, rejection sampling, and optional model distillation.

ai cold-start-sft deepseek deepseek-r1 qwen r1 reasoning training

Creat：2025-01-25T03:13:53

Update：2025-02-24T17:02:19

Stars

Stars Increase

Related projects

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

174532

2个月前

+36today

Stable Diffusion Webui

Stable Diffusion web UI

151331

9个月前

+41today

Ollama

Hot

deepseek

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.

137582

3周前

+125today

Dify

Hot

agent

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

92356

3周前

+183today

Open Webui

Hot

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

89841

2个月前

+158today

Supabase

The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

81035

3周前

+49today

Generative Ai For Beginners

21 Lessons, Get Started Building with Generative AI ? https://microsoft.github.io/generative-ai-for-beginners/

78455

2个月前

+43today

Lobe Chat

? Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.

58926

1个月前

+29today