OpenAI Launches GPT-4o Predicted Outputs Feature: 5x Speed Improvement, But More Expensive

AIbase基地

Published inAI News · 4 min read · Nov 7, 2024

214

OpenAI has recently introduced a significant update, adding the "Predicted Outputs" feature to the GPT-4o model. This innovative technology significantly enhances the model's response speed, achieving up to five times the original speed in specific scenarios, offering developers a new level of efficiency.

This feature, developed in collaboration with FactoryAI, excels by bypassing the repetitive generation of known content. It performs exceptionally well in practical applications, especially in tasks like updating blog posts, iterating existing responses, or rewriting code. According to data provided by FactoryAI, response times in programming tasks have been reduced by 2 to 4 times, compressing tasks that originally took 70 seconds into just 20 seconds.

Currently, this feature is only available to developers via API, supporting both the GPT-4o and GPT-4mini models. The feedback from actual usage has been positive, with several developers already testing and sharing their experiences. Eric Ciarla, founder of Firecrawl, noted during SEO content conversion: "The speed improvement is significant, and the usage is straightforward."

Technically, the Predicted Outputs feature works by identifying and reusing predictable content parts. OpenAI's official documentation provides an example, such as in code refactoring scenarios, where changing the "Username" attribute to "Email" in C# code can greatly enhance generation speed by inputting the entire class file as predicted text.

However, there are some limitations and precautions to consider with this feature. In addition to model support restrictions, certain API parameters are unavailable when using Predicted Outputs, including n values greater than 1, logprobs, and presence_penalty and frequency_penalty greater than 0.

It is also worth noting that while this feature provides faster response times, it incurs a slight increase in cost. User test data shows that while processing time for the same task decreased from 5.2 seconds to 3.3 seconds with the Predicted Outputs feature, the cost rose from 0.1555 cents to 0.2675 cents. This is because OpenAI charges for tokens provided during prediction at the same rate as completed tokens.

Despite the slight increase in cost, considering the significant efficiency gains, this feature still holds considerable value for application. Developers can access more detailed technical explanations and usage guides through OpenAI's official documentation.

OpenAI Official Documentation:

https://platform.openai.com/docs/guides/latency-optimization#use-predicted-outputs

GPT-4o Predicted Outputs OpenAI FactoryAI

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Grok4 Is Coming! Elon Musk's New AI Star Successfully Challenges Programming Tests

Musk's AI model Grok4 excels in programming, creative tasks, and outperforms OpenAI o3 in 8 areas. It adapts explanations for all ages and shows potential to revolutionize work and life.....

Jul 15, 2025

Amazon Launches AI Code Editor Kiro, Supporting Free Use of Claude 4/3.7 Sonnet

Amazon AWS launches a new AI development tool called Kiro, focusing on the concept of specification-driven development. The tool is based on the open-source Code OSS platform and is compatible with the VS Code ecosystem. It uses AI collaboration to first generate requirement documents and system designs, then automatically generates code, test cases, and documentation, ensuring code quality. Kiro supports multimodal input and automated testing features. It is currently available for free preview, and a paid version will be released in the future. Its specification-driven development model has the potential to address maintenance challenges with AI-generated code, but the initial usage may be complex.

Jul 15, 2025

MiniMax Valued Over 4 Billion USD, Backed by Shanghai State Capital, Joins the 3 Billion USD Large Model Club

Chinese AI firm MiniMax raised $300M, reaching a $4B valuation. Backed by Shanghai state capital, it's now one of China's two $3B+ LLM companies. Founded by ex-SenseTime executives, with prior investments from Alibaba and Tencent, it's reportedly preparing for a Hong Kong IPO.....

Jul 15, 2025

120

OpenAI's Acquisition of Windsurf Fails, Google Successfully Poaches the CEO and Core Team

OpenAI's acquisition of Windsurf failed, and Google successfully poached its CEO and core team to join DeepMind. Windsurf, formerly known as Codeium, had raised $200 million in funding and had a valuation of $1.25 billion. Google obtained non-exclusive licensing rights to some of its technology but did not acquire the company. Recently, the popularity of code-generation startups has declined, facing competition from large models like Anthropic and Google. Windsurf will be taken over by its former executives and will continue to operate independently.

Jul 15, 2025

Google Gemini Embedding Model Tops MTEB Ranking, Surpassing OpenAI

Google released Gemini, the top embedding model with 68.37 MTEB score, surpassing OpenAI. Based on Transformer, it supports multilingual tasks at $0.15/M tokens, boosting AI applications like search.....

Jul 15, 2025

OpenAI Delays First Open-Source Large Model Release, Ensuring Safety Becomes Top Priority

OpenAI announced the postponement of its first open-source large model release, with CEO Sam Altman stating that more time is needed for safety testing and risk assessment. This new model, which has performance comparable to o3-mini, may be named 'Open Model,' but the extent of its openness remains unclear. Research Vice President Aidan Clark emphasized that the company maintains strict standards for open source, as the model cannot be recalled once released. Although the delay disappointed some users, OpenAI believes ensuring safety and taking a responsible approach is more important. This decision will shape the future of models.

Jul 14, 2025

190

OpenAI Postpones Open-Source Large Model Release, Prioritizes Safety Testing

OpenAI announced the postponement of the open-source large model release. CEO Sam Altman stated that more time is needed for safety testing. The model was originally scheduled to be released this week but is now delayed until next week to ensure its safety and reliability. Altman emphasized that once the model is released, it cannot be recalled and must be handled with caution. This is OpenAI's first attempt to release a downloadable self-running model, aimed at providing powerful tools for researchers and small businesses. Although the delay is disappointing, the community generally understands the importance of safety testing and believes it is crucial for the AI ecosystem.

Jul 14, 2025

110

SpaceX invests $2 billion to help xAI accelerate the追赶 of OpenAI

SpaceX invests $2B in xAI to compete with OpenAI. xAI, valued at $113B, integrates Grok AI in Starlink support and Tesla's Optimus. Despite past issues, Grok4 offers 100x more power, running on a supercomputer with 100K H100 GPUs. Musk calls it the 'smartest AI'.....

Jul 14, 2025

NVIDIA's market value exceeds $4 trillion for the first time, Huang Renxun's meeting with Trump draws attention

Jul 11, 2025

140

Musk's New AI Chatbot Grok 4: Pursuing Truth or Advocating Personal Opinions?

Musk's xAI launched Grok4 AI chatbot, promoting 'truth-seeking' but sparking controversy. Tests show it often cites Musk's views on sensitive topics like Israel-Palestine conflict and immigration. Grok previously faced anti-Semitic content issues, highlighting risks of linking AI to founder's opinions. While Grok4 outperforms rivals in some tests, frequent errors and lack of transparency may hinder commercialization. xAI is promoting $300/month s....

Jul 11, 2025

120

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

OpenAI Launches GPT-4o Predicted Outputs Feature: 5x Speed Improvement, But More Expensive

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Grok4 Is Coming! Elon Musk's New AI Star Successfully Challenges Programming Tests

Amazon Launches AI Code Editor Kiro, Supporting Free Use of Claude 4/3.7 Sonnet

MiniMax Valued Over 4 Billion USD, Backed by Shanghai State Capital, Joins the 3 Billion USD Large Model Club

OpenAI's Acquisition of Windsurf Fails, Google Successfully Poaches the CEO and Core Team

Google Gemini Embedding Model Tops MTEB Ranking, Surpassing OpenAI

OpenAI Delays First Open-Source Large Model Release, Ensuring Safety Becomes Top Priority

OpenAI Postpones Open-Source Large Model Release, Prioritizes Safety Testing

SpaceX invests $2 billion to help xAI accelerate the追赶 of OpenAI

NVIDIA's market value exceeds $4 trillion for the first time, Huang Renxun's meeting with Trump draws attention

Musk's New AI Chatbot Grok 4: Pursuing Truth or Advocating Personal Opinions?