GPT-4.5's Brief Reign: Grok-3 from xAI Takes the Crown in Six Hours

AIbase基地

Published inAI News · 3 min read · Mar 4, 2025

Within six hours of its release, OpenAI's GPT-4.5 model soared to the top of the AI leaderboard, claiming the number one spot in all-task classification. However, this reign was short-lived, as Elon Musk's xAI Grok-3 model quickly overtook it, snatching the top position.

Voting data revealed that both GPT-4.5 and Grok-3 received over 3000 votes each, resulting in a final score of 1412 to 1411 – a difference of just one point. While GPT-4.5 excelled in most areas, Grok-3 showed a slight advantage in specific tasks like "style-controlled prompts" and "difficult prompts," leading to its victory.

ChatGPT

Image Source Note: Image generated by AI, licensed through Midjourney.

The rapid turnaround in just six hours sparked skepticism among users, questioning the legitimacy of such a swift change. Industry insiders explained that the leaderboard has a voting threshold; only models reaching 3000 votes within a specific timeframe qualify. The simultaneous achievement of this threshold by both newly released models was, therefore, a coincidence.

Interestingly, despite initial negative feedback, GPT-4.5 saw a significant rise in user approval for its high emotional intelligence. OpenAI CEO Sam Altman even shared a conversation with GPT-4.5, mentioning it was the first time a user had requested he promise not to take the model offline.

Furthermore, GPT-4.5 demonstrated exceptional performance in a unique competition resembling a "large model werewolf" game. In this game, AI models engaged in debate, strategy, and voting, with the winner decided by a jury of eliminated members. GPT-4.5 showcased superior performance in cooperation, deception, and strategic planning, surpassing human capabilities.

All this highlights the intensifying competition in the AI arena, with models constantly innovating and improving within their respective domains. The question of who will ultimately win this battle of intelligence remains to be seen, and warrants continued observation.

GPT-4.5 Grok-3 AI Model Artificial Intelligence Competition

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

OpenAI's New GPT-4.1 Model Faces Challenges in Alignment

OpenAI recently released its latest AI model, GPT-4.1, claiming superior instruction following. However, independent tests suggest a decline in alignment, i.e., reliability, compared to its predecessor, GPT-4. OpenAI typically releases detailed technical reports including safety evaluations with new models, but hasn't done so this time, explaining that GPT-4.1 is not considered a 'cutting-edge' model.

Apr 24, 2025

Claude-3 surpasses human average IQ, Anthropic leads AI intelligence into a new era

Anthropic's Claude-3 model has achieved a breakthrough in IQ testing, surpassing the human average of 100 for the first time. This marks a milestone in AI development. According to AIbase, Claude-3 outperformed its predecessor in the Norwegian Mensa IQ test, signifying a remarkable leap in AI cognitive abilities. Community analysis suggests this achievement reflects not only Anthropic's technological prowess but also sparks widespread discussion about the future of AI. Related data and predictions are...

Apr 22, 2025

240

OpenAI's New o3 AI Model Shows Increased Hallucination, Raising Accuracy Concerns

OpenAI recently released its latest o3 and o4-mini AI models, which achieve state-of-the-art performance in many areas. However, these new models have not improved upon the issue of 'hallucinations,' exhibiting even more severe instances than previous OpenAI models. 'Hallucinations,' the generation of factually incorrect information by AI models, remain one of the most challenging problems in AI today. Previous generations of models showed improvements in reducing hallucinations; however, o3 and o4-mini have not.

Apr 22, 2025

240

Swiss Researchers Claim AI Can Identify Hidden Locations of Potentially Habitable Planets

The search for another Earth-like planet in the vast universe has been akin to searching for a needle in a haystack. However, a research team from Switzerland has injected powerful new momentum into this epic exploration. They have developed an AI model that acts like a sharp-eyed interstellar detective, able to penetrate the dust and identify unknown corners that may harbor habitable worlds. This is not merely a technological breakthrough, but also a roadmap to the future. In a recent study published in Astronomy & Astrophysics, the scientists detail...

Apr 21, 2025

260

iFlytek's StarFire X1 Receives Major Upgrade: Aims to Rival OpenAI in AI

On April 21st, iFlytek officially announced a significant upgrade to its AI model, StarFire X1, aiming to compete with OpenAI's models in intelligent reasoning and multi-tasking capabilities. This domestically-trained large language model excels in various general tasks, including mathematics, programming, logical reasoning, text generation, language understanding, and knowledge question answering. This upgrade incorporates data from more complex scenarios, significantly improving the model's performance.

Apr 21, 2025

370

xAI Releases Grok3Mini: A Cost-Effective AI Model for Developers

xAI recently unveiled its new language model, Grok3Mini, further advancing efficient AI technology. Designed for speed and affordability, Grok3Mini, despite its smaller size, outperforms many more expensive AI models across various domains, particularly excelling in math, coding, and scientific benchmarks. Grok3Mini: The perfect balance of high performance and low cost. Grok3Mini is part of the Grok3 series, which includes six variants, including the standard Grok3.

Apr 21, 2025

290

Intel Open-Sources AI Playground: Arc GPU-Powered Local AI Model Execution

Intel recently announced the open-sourcing of its AI Playground software, designed for local generative AI. AI Playground provides a powerful platform for running AI models on Intel Arc GPUs. It supports various image and video generation models, as well as Large Language Models (LLMs), significantly lowering the hardware barrier for AI applications by optimizing local computing resources. The project is available on GitHub and has attracted developers and AI enthusiasts worldwide.

Apr 21, 2025

200

OpenAI's o3 Model Test Scores Questioned; Actual Performance Falls Far Short of Claims

OpenAI's recently released o3 AI model has sparked controversy over its benchmark test performance. While OpenAI confidently claimed in December that the model could correctly answer over a quarter of the highly challenging FrontierMath math problems, this assertion starkly contrasts with recent independent test results. The Epoch Institute's independent testing revealed the model achieved only a 10% success rate, significantly lower than advertised.

Apr 21, 2025

210

Intel Open Sources AI Playground for Intel Arc GPUs and Various AI Models

Intel has announced the open-sourcing of its generative AI software, AI Playground, generating significant interest within the AI community. Optimized for Intel Arc GPUs and integrated graphics, AI Playground is described as an 'AI hub' that supports local running of chat-based Large Language Models (LLMs), as well as image and video generation capabilities. This open-sourcing signifies Intel's commitment to advancing the accessibility of generative AI technology.

Apr 21, 2025

160

X-ORIGIN-AI Secures Nearly 100 Million Yuan in Pre-A Round Funding to Advance Affective AI Hardware

X-ORIGIN-AI (Xuan Yuan Technology), a consumer-grade AI robot company, recently announced it has completed a Pre-A round of financing totaling nearly 100 million yuan. The round was led by Oriental Fortune Capital, with participation from Kingtop Capital and Lenovo Star, and Renchen Capital served as the financial advisor. This funding accelerates X-ORIGIN-AI's development in AI hardware and affective interaction, showcasing its breakthroughs in the development of emotionally intelligent AI products. X-ORIGIN-AI is dedicated to breaking the limitations of traditional AI tools and pushing human-computer interaction from a 'tool-based' to an 'emotionally intelligent' approach.

Apr 18, 2025

350

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

GPT-4.5's Brief Reign: Grok-3 from xAI Takes the Crown in Six Hours

AIbase基地

This article is from AIbase Daily

AI News Recommendations

OpenAI's New GPT-4.1 Model Faces Challenges in Alignment

Claude-3 surpasses human average IQ, Anthropic leads AI intelligence into a new era

OpenAI's New o3 AI Model Shows Increased Hallucination, Raising Accuracy Concerns

Swiss Researchers Claim AI Can Identify Hidden Locations of Potentially Habitable Planets

iFlytek's StarFire X1 Receives Major Upgrade: Aims to Rival OpenAI in AI

xAI Releases Grok3Mini: A Cost-Effective AI Model for Developers

Intel Open-Sources AI Playground: Arc GPU-Powered Local AI Model Execution

OpenAI's o3 Model Test Scores Questioned; Actual Performance Falls Far Short of Claims

Intel Open Sources AI Playground for Intel Arc GPUs and Various AI Models

X-ORIGIN-AI Secures Nearly 100 Million Yuan in Pre-A Round Funding to Advance Affective AI Hardware