DeepEnlighten

Public

Pure RL without SFT to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.

deepseek deepseek-r1 fine-tuning gpt-o1 llm post-training reasoning-language-models reasoning-models reinforcement-learning

Creat：2025-03-12T21:18:28

Update：2025-03-27T03:36:34

Stars

Stars Increase

Related projects

Ollama

Hot

deepseek

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.

138555

1个月前

+150today

Lobe Chat

? Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.

59328

2个月前

+3today

LLaMA Factory

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

47621

1个月前

+4today

JeecgBoot

activiti

?「AI 低代码平台」前后端分离 SpringBoot 2.x/3.x，SpringCloud，Ant Design&Vue3，Mybatis，Shiro！强大的代码生成器让前后端代码一键生成，无需写任何代码! 引领AI低代码开发模式 AI生成->OnlineCoding->代码生成->手工MERGE，帮助Java项目解决80%重复工作，让开发更关注业务，提高开发效率、节省成本，同时又不失灵活性

42465

1个月前

+2today

Llama_index

agents

LlamaIndex is the leading framework for building LLM-powered agents over your data.

41206

1个月前

+1today

Unsloth

Hot

deepseek

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! ?

37521

1个月前

+70today

Pake

chatgpt

?? Turn any webpage into a desktop app with Rust. ?? 利用 Rust 轻松构建轻量级多端桌面应用

37284

1个月前

+2today

Chatgpt On Wechat

Hot

基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。

36488

1个月前

+36488today