DeepSeek-R1, launched by the DeepSeek team, is the first generation inference model that exhibits exceptional inference capabilities through extensive reinforcement learning training, eliminating the need for supervised fine-tuning. The model excels in mathematical, coding, and reasoning tasks, comparable to the OpenAI-o1 model. Additionally, DeepSeek-R1 offers various distilled models catering to different scalability and performance requirements. Its open-source nature provides robust tools for the research community, supporting commercial use and further development.