OpenAI Indicates Its Latest GPT-4o Model Has a Risk Rating of 'Medium'

AIbase基地

Published inAI News · 6 min read · Aug 9, 2024

153

Recently, OpenAI released their latest GPT-4o system card, a comprehensive research document detailing the company's safety measures and risk assessments undertaken before the launch of the new model.

The GPT-4o model officially went live in May of this year. Prior to its release, OpenAI enlisted an external team of security experts to conduct risk assessments, a common practice known as "red team" testing. Their focus was primarily on the potential risks posed by the model, such as the generation of unauthorized voice clones, obscene and violent content, or repeated copyrighted audio segments.

GPT-4o ChatGPT

According to OpenAI's own framework, researchers have assessed the overall risk of GPT-4o as "moderate". This risk level is determined based on the highest risk rating among four main categories: cybersecurity, bio-threats, persuasiveness, and model autonomy. Except for persuasiveness, all other categories are considered low risk. Researchers found that some texts generated by GPT-4o were more persuasive in influencing reader opinions than those written by humans, although not overall more persuasive.

OpenAI spokesperson Lindsay McCallum Rémy stated that the system card includes readiness assessments created jointly by internal teams and external testers, including the Model Evaluation and Threat Research (METR) and Apollo Research listed on OpenAI's website, who focus on the evaluation of AI systems. This is not the first time OpenAI has released a system card; previous models like GPT-4, GPT-4 visual edition, and DALL-E3 underwent similar tests and published related research results.

However, the release of this system card comes at a critical juncture, as OpenAI faces ongoing criticism from both internal staff and state senators questioning its safety standards. Just minutes before the GPT-4o system card was released, Massachusetts Senator Elizabeth Warren and Representative Lori Trahan co-signed an open letter urging OpenAI to provide answers on how it handles whistleblowers and safety reviews. The letter mentioned several safety issues, including CEO Sam Altman's brief dismissal in 2023 due to concerns from the board, and the departure of a security executive who claimed that "safety culture and processes are suppressed by beautiful products."

Moreover, OpenAI's release of a powerful multimodal model right before the U.S. presidential election clearly presents potential risks of misinformation or exploitation by malicious actors. Although OpenAI aims to prevent misuse through practical scenario testing, public demands for transparency are growing. Particularly in California, State Senator Scott Wiener is pushing a bill to regulate the use of large language models, including requiring companies to be held legally accountable when their AI is used for harmful purposes. If the bill passes, OpenAI's cutting-edge models must undergo risk assessments in accordance with state laws before being released to the public.

Key Points:
🌟 OpenAI's GPT-4o model is assessed as "moderate" risk, primarily concerning cybersecurity and persuasiveness.
🔍 The release of the system card coincides with a critical moment when OpenAI faces external scrutiny over its safety standards, with calls for transparency increasing.
🗳️ The timing of the release is sensitive, occurring just before the U.S. presidential election, posing risks of misinformation and potential exploitation by malicious actors.

Lovable7's Annual Income Reaches 80 Million Dollars, Half of the Team Are AI-Native Employees

AI-native employees are reshaping work models. Start-up Lovable achieved $80M annual revenue in 7 months with a 35-person team through AI-driven agility: instant AI implementation bypassing traditional processes, using proprietary AI tools for rapid development, and empowering autonomous young employees. The author predicts more AI-native workers will emerge, challenging traditional management while becoming core innovation drivers.....

Grok4 to be released: Musk confirms X platform live stream on Wednesday night

Elon Musk announced that xAI's new generation large model Grok4 will be released at 8 PM (11 PM Beijing Time on Thursday) this Wednesday, and the launch will be live-streamed on the X platform. Musk previously revealed that Grok has seen significant improvements, and this release will showcase xAI's latest breakthroughs in the AI field.

Baidu AI Team Launches PaddleOCR 3.1 Version with Enhanced Capabilities Supporting MCP

On July 7, the Baidu AI team announced the official release of PaddleOCR 3.1, achieving three major upgrades in multilingual recognition, complex document translation, and large model connectivity. The new version supports text recognition in 37 languages, with an average accuracy improvement of over 30%. It also introduces a document translation pipeline and MCP server functionality to help developers efficiently build AI applications. To address multilingual needs in global scenarios, PaddleOCR 3.1 adds the PP-OCRv5 multilingual model, covering 37 languages including French, Spanish, and Russian.

Microsoft Launches Deep Research: Integration of Bing and OpenAI to Revolutionize Automated Research

Microsoft launches the Deep Research research tool, which integrates Bing search and OpenAI technology to automate research. The tool uses the core technology o3-deep-research, with a workflow that includes four key steps: first, interacting with GPT-4o/4.1 to clarify user requirements; second, calling Bing to retrieve the latest data; third, performing intelligent analysis and reasoning; finally, generating a structured report containing answers, reasoning process, cited sources, and clarification records. The tool supports integration with Azure AI

OpenAI Announces GPT-5 Will Integrate Multiple Models for a New Breakthrough

OpenAI plans to launch GPT-5 this summer, integrating multiple model capabilities. The new version combines reasoning of 'O-series' and GPT's multimodal advantages for enhanced performance. It aims to simplify user experience by eliminating model switching. GPT-5 will boost functionality and usability, though exact release timing remains unclear.....

OpenAI Takes a Unique Approach with a Researcher Residency Program to Attract Emerging AI Talent

OpenAI launches 'Residency Program' offering $210K/year to attract cross-disciplinary AI talent. The 6-month program trains 30 researchers annually from fields like physics and neuroscience, with full-time conversion for top performers. Unlike Meta's high-cost recruitment, OpenAI prioritizes cultural alignment over salary to retain talent.....

JD Logistics Launches Self-Developed Unmanned Light Truck JD Logistics VAN with L4 Level Public Road Autonomous Driving

At the 17th International Exhibition of Transportation Technology and Equipment held recently, JD Logistics officially launched its self-developed unmanned light truck product - JD Logistics VAN. This unmanned light truck has a large cargo space of 24 cubic meters, making it the one with the largest cargo capacity in the logistics industry. It is expected to replace traditional 4.2-meter trucks in logistics shuttle and transfer station links. According to the introduction, JD Logistics VAN has a full-load driving range of up to 400 kilometers and is equipped with L4-level autonomous driving capabilities on public roads. This means it can drive autonomously.

Former OpenAI Researcher Reveals: Signing with Meta Did Not Bring $100 Million Bonus

Recently, a former OpenAI researcher's remarks have sparked widespread attention. He stated that although Meta claimed to offer up to $100 million in signing bonuses when poaching research talent from OpenAI, he and his colleagues did not receive this bonus. This news has undoubtedly raised questions about Meta's hiring practices. Image source note: The image was generated by AI, and the image licensing service provider is Midjourney. This researcher is named Lucas Beyer, and he and his colleague are'

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief