OpenAI Launches New Multimodal Content Moderation Model: Based on GPT-4o, Capable of Detecting Text and Images

AIbase基地

Published inAI News · 4 min read · Sep 27, 2024

806

In the digital era, content security issues have increasingly garnered attention. On September 26th, OpenAI officially launched a new multimodal content moderation model named "omni-moderation-latest".

This model is based on the latest GPT-4o technology, capable of accurately identifying and detecting harmful text and images. This update will provide developers with powerful tools, enabling them to build more robust moderation systems.

The highlight of the new model is that it supports moderation of both text and image inputs, especially excelling in handling non-English content.

Compared to previous moderation models, "omni-moderation-latest" not only improves the accuracy of identification but also enhances the ability to detect a wider range of harmful content. It can evaluate categories such as violence, self-harm, and sexual content, ensuring users can communicate in a safer environment.

Since launching the Content Moderation API in 2022, the volume and variety of content that automated moderation systems need to handle have been increasing, especially as more AI applications enter large-scale production. Today, many companies, such as Grammarly and ElevenLabs, are using this API to protect user safety and prevent the creation of inappropriate content.

The advantages of the new model are reflected in several aspects:

Firstly, it can perform multimodal harmful content classification, evaluating combinations of images and text to identify risks related to violence and sexuality.
Secondly, the model has added two new categories of text moderation, specifically related to illegal and violent content, further enhancing its moderation capabilities.
Additionally, the accuracy of detecting non-English content has significantly improved, with tests showing a 42% increase in accuracy across 40 languages, particularly outstanding in low-resource languages.

For developers, this new moderation model remains accessible as a free Content Moderation API. OpenAI hopes this upgrade will allow more users to leverage the latest research and security systems, creating a more friendly online experience for users.

Official blog: https://openai.com/index/upgrading-the-moderation-api-with-our-new-multimodal-moderation-model/

Key Points:
📊 The new model "omni-moderation-latest" is based on GPT-4o technology, supporting multimodal moderation of text and images.
🌍 Detection accuracy has improved by 42% for 40 languages, particularly outstanding in low-resource languages.
🔒 Added two new categories of text moderation, enhancing the ability to identify illegal and violent content.

Google Releases Gemma4 Open Source Model: Adopting the Apache License to Fully Unleash Developer Productivity

Google has released the new open source AI model Gemma4, which adopts the Apache 2.0 license, replacing previous restrictive agreements. This allows developers to freely use, modify, and distribute the model, facilitating commercial applications. The model achieves dual upgrades in technical architecture, improving performance and ecosystem compatibility.

Domestic LLM Toolchain Upgraded Again! Open Source LLMOps Platform Maxkb4j v2.6.0 Officially Released

Maxkb4j v2.6.0 enhances its open-source LLMOps platform with improved skill expansion, security authentication, and system stability. Key updates include new Shell tools, system message integration, and Webhook authentication, empowering developers with advanced LLM workflow and RAG capabilities.....

Say Goodbye to AI Standard Faces! Alibaba Wan2.7-Image Released: Can Write A4 Paper Essays and Achieve Pixel-Level Face Customization

Alibaba released the Wan2.7-Image model, breaking through the limitations of traditional AI image generation, saying goodbye to the 'standard face' and achieving a 'different face for each person'. The model enhances the virtual character face customization function, supporting comprehensive customization from bone structure, eyes, to facial details. It allows precise control over facial features such as face shape and eye shape, enhancing visual effects and personalized experience.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

OpenAI Launches New Multimodal Content Moderation Model: Based on GPT-4o, Capable of Detecting Text and Images

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Powered by the Apache 2.0 License! Google Gemma 4 is Now Open Source: 31B Parameters Performance Approaches Leading Large Models

Apple Collaborates with the University of Hong Kong to Launch the LGTM Rendering Framework, Breaking the 4K Video Quality Bottleneck

Rejecting Compute Anxiety! Apple's LGTM Framework Launches: Enabling 4K-Grade 3D Rendering to Take Off on Vision Pro

Google Releases Gemma4 Open Source Model: Adopting the Apache License to Fully Unleash Developer Productivity

Google Officially Launches Gemma4 Open-Source Large Model: Available in Four Specifications, 31B Version Ranks Third in Global Open-Source List

Domestic LLM Toolchain Upgraded Again! Open Source LLMOps Platform Maxkb4j v2.6.0 Officially Released

Google Open-Sources Large Model Gemma 4: Official Announcement Imminent: Parameter Count Increases by 4 Times

The Era of Bank Inclusive AI: OpenAI Teams Up with Gradient Labs to Create a Digital Customer Manager

Say Goodbye to AI Standard Faces! Alibaba Wan2.7-Image Released: Can Write A4 Paper Essays and Achieve Pixel-Level Face Customization

No. 5 in the World! Xiaomi MiMo-V2-Pro Tops Text Arena, Lei Jun: This Time, No Need to Look at the Rankings, See User Votes

AI News Recommendations

Powered by the Apache 2.0 License! Google Gemma 4 is Now Open Source: 31B Parameters Performance Approaches Leading Large Models

Apple Collaborates with the University of Hong Kong to Launch the LGTM Rendering Framework, Breaking the 4K Video Quality Bottleneck

Rejecting Compute Anxiety! Apple's LGTM Framework Launches: Enabling 4K-Grade 3D Rendering to Take Off on Vision Pro

Google Releases Gemma4 Open Source Model: Adopting the Apache License to Fully Unleash Developer Productivity

Google Officially Launches Gemma4 Open-Source Large Model: Available in Four Specifications, 31B Version Ranks Third in Global Open-Source List

Domestic LLM Toolchain Upgraded Again! Open Source LLMOps Platform Maxkb4j v2.6.0 Officially Released

Google Open-Sources Large Model Gemma 4: Official Announcement Imminent: Parameter Count Increases by 4 Times

The Era of Bank Inclusive AI: OpenAI Teams Up with Gradient Labs to Create a Digital Customer Manager

Say Goodbye to AI Standard Faces! Alibaba Wan2.7-Image Released: Can Write A4 Paper Essays and Achieve Pixel-Level Face Customization

No. 5 in the World! Xiaomi MiMo-V2-Pro Tops Text Arena, Lei Jun: This Time, No Need to Look at the Rankings, See User Votes

GEO Services