Recently, OpenAI released a study on its latest reasoning model, o3, demonstrating how large language models (LLMs) can evolve from novice competitive programmers to top-tier global competitors. o3 achieved a score of 2724 on the renowned programming platform CodeForces, placing it in the top 99.8 percentile, showcasing outstanding performance, and it secured a gold medal level achievement in the 2024 International Olympiad in Informatics (IOI).

OpenAI

Image Source Note: The image was generated by AI, with licensing provided by Midjourney.

The study indicates that the o3 model outperformed the o1-ioi model, which was specifically fine-tuned for the IOI competition. This result shows that achievements gained through reinforcement learning surpass those obtained through manually designed solutions. In the IOI 2024 event, o3 competed under standard conditions and successfully crossed the gold medal threshold. Additionally, it ranks among the top 200 programmers globally on CodeForces, capable of competing with elite human programmers.

Associate Professor Ethan Mollick from the Wharton School stated, "The general reasoning capabilities developed through reinforcement learning have now surpassed those of meticulously designed domain-specific solutions. Instead of building specialized systems for specific tasks, it's better to leverage stronger reasoning abilities to achieve better results with large general models."

This research is part of OpenAI's evaluation of its model's performance in competitive programming and the broader software engineering field. Additionally, another company, Anthropic, released a report on Monday regarding the impact of AI on the workplace. The report indicated that about 36% of jobs utilize AI in at least 25% of their tasks, with 57% of AI applications enhancing human capabilities and 43% focusing on automation. Nonetheless, only 4% of jobs use AI in at least 75% of their tasks.

The study also revealed that software development and technical writing are the primary fields for AI applications, while AI's role is relatively limited in tasks that involve physical interaction with the environment.

Key Points:  

💻 The o3 model achieved a score of 2724 on CodeForces, ranking in the top 99.8 percentile, and won a gold medal at the International Olympiad in Informatics.  

📊 The effects of reinforcement learning surpass traditional manually designed solutions, showcasing the advantages of general reasoning capabilities.  

📈 AI is widely applied in the workplace, with software development and technical writing being its main fields, but its application in physical interaction tasks is limited.