AIbase
Product LibraryTool Navigation

DeepEnlighten

Public

Pure RL without SFT to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.

Creat2025-03-12T21:18:28
Update2025-03-27T03:36:34
36
Stars
0
Stars Increase

Related projects