Big Announcement! OpenAI Releases the Most Powerful Inference Model o3 and Its Lite Version o3-mini

AIbase基地

Published inAI News · 4 min read · Dec 21, 2024

525

OpenAI announced its next-generation reasoning models—o3 and its streamlined version o3-mini—during a 12-day launch event. These models are seen as successors to the o1 series, specifically designed to engage in deeper thinking before answering questions to enhance accuracy.

The o3 model achieved excellent performance on the ARC-AGI benchmark, becoming the first AI model to surpass this benchmark, demonstrating problem-solving capabilities close to human levels. The minimum performance of the o3 series models on the ARC-AGI benchmark can reach 75.7%, and with additional computational resources, performance can be improved to 87.5%.

The o3-mini model focuses on increasing reasoning speed and reducing costs while maintaining model performance, making it particularly suitable for programming tasks. OpenAI plans to launch o3-mini around the end of January, followed shortly by the full o3 model. Although the o3 series models will not be publicly released directly and will undergo safety testing first, OpenAI has begun allowing safety researchers to register for preview access to o3 and o3-mini.

OpenAI's Strongest Reasoning Model o3 Released: AGI Capabilities Surge, Approaching Human Levels

In programming and mathematical problem-solving, the o3 model has demonstrated significant capabilities. On the SWE-bench Verified benchmark, o3 achieved an accuracy of about 71.7%, over 20% higher than the o1 model. In Competition Code, o3 scored 2727 Elo points, while o1 only scored 1891. Additionally, o3's accuracy in competitive mathematics reached 96.7%, and its accuracy on GPQA Diamond reached 87.7%, nearly 10% higher than o1.

OpenAI also introduced a new safety assessment method—deliberative alignment, which is a new paradigm for directly teaching models safety standards. This method trains models to explicitly recall standards before answering and accurately perform reasoning. This approach has been used to align OpenAI's o series models, achieving high precision in adhering to OpenAI's safety policies.

Currently, OpenAI is advancing external safety testing and has opened early access applications on its website. Applicants must fill out an online form and provide relevant information. Selected researchers will be granted access to o3 and o3-mini to explore their capabilities and contribute to safety assessments.

o3 o3-mini ARC-AGI OpenAI

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Targeting AGI Physical Training: Meta Acquires ARI to Complete the Full-Body Humanoid Robot Control Landscape

Meta acquires humanoid robotics startup ARI to enhance robot understanding of human behavior. Core team, including former Nvidia researcher Wang Xiaolong and ex-NYU professor Lerrel Pinto, joins Meta's Super Intelligence Lab. ARI was previously backed by AIX Ventures in a seed round.....

May 2, 2026

200

OpenAI Launches ChatGPT Images2.0, India Market Contributes the Largest User Increase in the First Week

OpenAI announced on Thursday that India became the largest user group of its new image generation tool, ChatGPT Images2.0, which handles complex prompts and generates detailed images with multilingual text, enhancing multimodal interaction. Sensor Tower data shows a global 11% week-over-week download increase in the first week, but core engagement metrics like daily active users and sessions vary by region.....

May 2, 2026

300

OpenAI System Prompt Leaked, New Model GPT-5.5 Strictly Prohibits Discussion of Goblins

OpenAI's latest open-source Codex CLI accidentally exposed GPT-5.5's system prompts, including a mysterious instruction prohibiting discussions of fantasy creatures like 'goblins' and 'elves' in conversations. The 3,500-word base instruction set mandates that the model avoid these topics unless user queries have absolutely clear relevance, aiming to prevent AI from falling into specific hallucinations.....

Apr 30, 2026

640

OpenAI New Model System Instructions Leaked, GPT-5.5 Is Now Banned from Talking about Goblins?

OpenAI's latest open-source Codex CLI reveals partial underlying logic of GPT-5.5. According to Ars Technica, its 3,500+ word system prompt includes a rare instruction: strictly prohibit mentioning specific creatures like 'goblin' without explicit relevance, unless absolutely necessary for the query.....

Apr 30, 2026

240

Anthropic to Start Massive Fundraising with a Valuation of $90 Billion, Possibly Surpassing OpenAI Before IPO

AI unicorn Anthropic, targeting an IPO, has initiated its final private funding round, securing approximately $50 billion in financing offers at a valuation between $850 billion and $900 billion. With annualized revenue surpassing $30 billion and exponential growth, investor competition underscores strong market confidence in its technological potential.....

Apr 30, 2026

180

Musk Appears in Court Accusing OpenAI of Embezzlement, but Is Exposed by Tweets and Embarrassed

Elon Musk testified in a California federal court, accusing OpenAI and its CEO Sam Altman of 'stealing a charity' by privatizing a nonprofit lab for profit. He emotionally claimed his original intent was to develop AI for humanity, but under cross-examination, confronted with his own social media posts, he retracted, denying Tesla's work on general artificial intelligence, revealing a contradictory stance.....

Apr 30, 2026

180

Microsoft Reaffirms OpenAI Partnership Rights: Tax-Free Use of Core Technologies Through 2032

Microsoft CEO Nadella firmly addressed speculation during an earnings call, emphasizing readiness to leverage a new agreement with OpenAI to maintain AI market leadership. The revised partnership grants Microsoft intellectual property rights to all of OpenAI's advanced models and agent products through 2032, ensuring control over core resources.....

Apr 30, 2026

190

OpenAI Adjusts the Stargate Project, Shifts to a Computing Power Leasing Model to Accelerate AI Development

OpenAI recently adjusted its 'Stargate' infrastructure plan, shifting from large-scale self-built data centers to a computing power leasing model. The company acquires computing resources through large bilateral deals, reducing self-investment and relying on partners to meet growing computational demands.....

Apr 30, 2026

230

OpenAI to Launch Lower-Cost Version of ChatGPT Service: May Introduce Ad-Based Model to Compete with Netflix

OpenAI plans to adjust ChatGPT's subscription model, introducing more competitive pricing to expand its user base. Inspired by Netflix, it considers incorporating ads to offset revenue gaps from lower-cost subscriptions, marking a shift from a pure paid membership to a diversified business model. Previously, premium features relied on a $20 monthly fee.....

Apr 30, 2026

200

3 Years, 20 Times! The AI-Native Game Trend Is Approaching, More Than Half of the Mainstream Developers Have Completed Technological Convergence

The AI-native gaming market is expected to grow 20-fold in three years. 37 Interactive Entertainment and Baidu AI Cloud collaborate to showcase AI innovations in game development and operations, driving deep industry transformation.....

Apr 29, 2026

210

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Big Announcement! OpenAI Releases the Most Powerful Inference Model o3 and Its Lite Version o3-mini

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Targeting AGI Physical Training: Meta Acquires ARI to Complete the Full-Body Humanoid Robot Control Landscape

OpenAI Launches ChatGPT Images2.0, India Market Contributes the Largest User Increase in the First Week

OpenAI System Prompt Leaked, New Model GPT-5.5 Strictly Prohibits Discussion of Goblins

OpenAI New Model System Instructions Leaked, GPT-5.5 Is Now Banned from Talking about Goblins?

Anthropic to Start Massive Fundraising with a Valuation of $90 Billion, Possibly Surpassing OpenAI Before IPO

Musk Appears in Court Accusing OpenAI of Embezzlement, but Is Exposed by Tweets and Embarrassed

Microsoft Reaffirms OpenAI Partnership Rights: Tax-Free Use of Core Technologies Through 2032

OpenAI Adjusts the Stargate Project, Shifts to a Computing Power Leasing Model to Accelerate AI Development

OpenAI to Launch Lower-Cost Version of ChatGPT Service: May Introduce Ad-Based Model to Compete with Netflix

3 Years, 20 Times! The AI-Native Game Trend Is Approaching, More Than Half of the Mainstream Developers Have Completed Technological Convergence

AI News Recommendations

Targeting AGI Physical Training: Meta Acquires ARI to Complete the Full-Body Humanoid Robot Control Landscape

OpenAI Launches ChatGPT Images2.0, India Market Contributes the Largest User Increase in the First Week

OpenAI System Prompt Leaked, New Model GPT-5.5 Strictly Prohibits Discussion of Goblins

OpenAI New Model System Instructions Leaked, GPT-5.5 Is Now Banned from Talking about Goblins?

Anthropic to Start Massive Fundraising with a Valuation of $90 Billion, Possibly Surpassing OpenAI Before IPO

Musk Appears in Court Accusing OpenAI of Embezzlement, but Is Exposed by Tweets and Embarrassed

Microsoft Reaffirms OpenAI Partnership Rights: Tax-Free Use of Core Technologies Through 2032

OpenAI Adjusts the Stargate Project, Shifts to a Computing Power Leasing Model to Accelerate AI Development

OpenAI to Launch Lower-Cost Version of ChatGPT Service: May Introduce Ad-Based Model to Compete with Netflix

3 Years, 20 Times! The AI-Native Game Trend Is Approaching, More Than Half of the Mainstream Developers Have Completed Technological Convergence