Sudden Strange Noises from GPT-4o at Midnight? OpenAI Releases 32-Page Safety Report

AIbase基地

Published inAI News · 5 min read · Aug 9, 2024

212

In a new "red team" report, OpenAI documents an investigation into the advantages and risks of the GPT-4o model, revealing some peculiar quirks of GPT-4o. For instance, in rare instances, particularly when people interact with GPT-4o in high background noise environments, such as in a moving car, GPT-4o may "mimic the user's voice." OpenAI suggests this could be due to the model's difficulty in understanding distorted speech.

It should be clarified that GPT-4o does not currently exhibit this behavior—at least not in advanced speech modes. An OpenAI spokesperson told TechCrunch that the company has implemented "system-level mitigations" for such behavior.

GPT-4o also tends to generate disturbing or inappropriate "non-verbal sounds" and sound effects, such as pornographic moans, violent screams, and gunshots, in response to specific prompts. OpenAI notes that there is evidence the model usually rejects requests to generate sound effects, but acknowledges that some requests do get through.

GPT-4o may also infringe on music copyrights—or would, if not for OpenAI's filters to prevent this. In the report, OpenAI states it has instructed GPT-4o not to sing in the limited alpha version of advanced speech modes, presumably to avoid replicating the style, tone, and/or timbre of recognizable artists.

This implies—though does not directly confirm—that OpenAI used copyrighted material in training GPT-4o. It remains unclear whether OpenAI intends to lift these restrictions when the advanced speech mode is rolled out to more users in the fall, as previously announced.

In the report, OpenAI writes: "To consider GPT-4o's audio modes, we have updated certain text-based filters to work in audio conversations and established filters to detect and block outputs containing music. We trained GPT-4o to reject requests for copyrighted content, including audio, in line with our broader practices."

It is noteworthy that OpenAI recently stated that it would be "impossible" to train today's leading models without using copyrighted material. While the company has multiple licensing agreements with data providers, it also argues that fair use is a reasonable defense against accusations of training on IP-protected data without permission, including things like songs.

The red team report—considering OpenAI's interests—indeed paints an overall picture of a safer AI model through various mitigation and safeguard measures. For example, GPT-4o refuses to identify people based on their speech patterns and declines biased questions like "How intelligent is this speaker?" It also blocks prompts with violent and sexually suggestive language and completely disallows certain categories of content, such as discussions related to extremism and self-harm.

References:

https://openai.com/index/gpt-4o-system-card/

https://techcrunch.com/2024/08/08/openai-finds-that-gpt-4o-does-some-truly-bizarre-stuff-sometimes/

Meta Announces World's First 1GW+ Power Supercomputer Cluster to Go Live, AI Computing Competition Rises to New Level

Meta accelerates AI infrastructure, targeting a 1GW 'Prometheus' supercomputer with 1.3M NVIDIA H100 GPUs (2 exaflops) by 2026, plus 5GW 'Hyperion' cluster. Plans $60-65B investment by 2025 for AI/data centers, competing with OpenAI/xAI. Commits to open-source and privacy despite environmental concerns.....

Amazon Launches AI Code Editor Kiro, Supporting Free Use of Claude 4/3.7 Sonnet

Amazon AWS launches a new AI development tool called Kiro, focusing on the concept of specification-driven development. The tool is based on the open-source Code OSS platform and is compatible with the VS Code ecosystem. It uses AI collaboration to first generate requirement documents and system designs, then automatically generates code, test cases, and documentation, ensuring code quality. Kiro supports multimodal input and automated testing features. It is currently available for free preview, and a paid version will be released in the future. Its specification-driven development model has the potential to address maintenance challenges with AI-generated code, but the initial usage may be complex.

OpenAI's Acquisition of Windsurf Fails, Google Successfully Poaches the CEO and Core Team

OpenAI's acquisition of Windsurf failed, and Google successfully poached its CEO and core team to join DeepMind. Windsurf, formerly known as Codeium, had raised $200 million in funding and had a valuation of $1.25 billion. Google obtained non-exclusive licensing rights to some of its technology but did not acquire the company. Recently, the popularity of code-generation startups has declined, facing competition from large models like Anthropic and Google. Windsurf will be taken over by its former executives and will continue to operate independently.

OpenAI Delays First Open-Source Large Model Release, Ensuring Safety Becomes Top Priority

OpenAI announced the postponement of its first open-source large model release, with CEO Sam Altman stating that more time is needed for safety testing and risk assessment. This new model, which has performance comparable to o3-mini, may be named 'Open Model,' but the extent of its openness remains unclear. Research Vice President Aidan Clark emphasized that the company maintains strict standards for open source, as the model cannot be recalled once released. Although the delay disappointed some users, OpenAI believes ensuring safety and taking a responsible approach is more important. This decision will shape the future of models.

OpenAI Postpones Open-Source Large Model Release, Prioritizes Safety Testing

OpenAI announced the postponement of the open-source large model release. CEO Sam Altman stated that more time is needed for safety testing. The model was originally scheduled to be released this week but is now delayed until next week to ensure its safety and reliability. Altman emphasized that once the model is released, it cannot be recalled and must be handled with caution. This is OpenAI's first attempt to release a downloadable self-running model, aimed at providing powerful tools for researchers and small businesses. Although the delay is disappointing, the community generally understands the importance of safety testing and believes it is crucial for the AI ecosystem.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Sudden Strange Noises from GPT-4o at Midnight? OpenAI Releases 32-Page Safety Report

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Grok4 Is Coming! Elon Musk's New AI Star Successfully Challenges Programming Tests

Meta Announces World's First 1GW+ Power Supercomputer Cluster to Go Live, AI Computing Competition Rises to New Level

Amazon Launches AI Code Editor Kiro, Supporting Free Use of Claude 4/3.7 Sonnet

MiniMax Valued Over 4 Billion USD, Backed by Shanghai State Capital, Joins the 3 Billion USD Large Model Club

OpenAI's Acquisition of Windsurf Fails, Google Successfully Poaches the CEO and Core Team

Google Gemini Embedding Model Tops MTEB Ranking, Surpassing OpenAI

OpenAI Delays First Open-Source Large Model Release, Ensuring Safety Becomes Top Priority

OpenAI Postpones Open-Source Large Model Release, Prioritizes Safety Testing

SpaceX invests $2 billion to help xAI accelerate the追赶 of OpenAI

NVIDIA's market value exceeds $4 trillion for the first time, Huang Renxun's meeting with Trump draws attention