Meta Introduces New Framework to Limit the Release of High-Risk AI Systems

AIbase基地

Published inAI News · 4 min read · Feb 12, 2025

138

Meta recently released a new risk policy framework aimed at assessing and mitigating the risks posed by cutting-edge AI models, and, if necessary, halting development or restricting the release of these systems. This framework, called the "Cutting-Edge AI Framework," outlines how Meta will categorize AI models into high-risk and critical-risk categories and take appropriate measures to reduce risks to "tolerable levels."

Facebook Metaverse meta

In this framework, critical risks are defined as those that can uniquely contribute to the execution of specific threat scenarios. High risks, on the other hand, indicate that the model may significantly increase the likelihood of realizing threat scenarios but does not directly contribute to their execution. Threat scenarios include the proliferation of biological weapons that could match known biological agents, as well as widespread economic harm to individuals or companies caused by large-scale long-term fraud and scams.

For models that reach the critical risk threshold, Meta will halt development and restrict access to the model to a select few experts, while implementing safeguards to prevent hacking or data breaches, provided it is technically and commercially feasible. For high-risk models, Meta will limit access and take measures to mitigate risks, aiming to reduce the risk to a moderate level, ensuring that the model does not significantly enhance the execution capability of threat scenarios.

Meta stated that its risk assessment process will involve multidisciplinary experts and internal leadership to ensure that all perspectives are adequately considered. This new framework applies only to the company's state-of-the-art models and systems, which match or exceed current technological levels.

Meta hopes that by sharing its development approach for advanced AI systems, it can enhance transparency and promote external discussions and research on AI assessment and risk quantification science. The company also emphasizes that the decision-making process for AI assessment will evolve and improve with technological advancements, including ensuring that the results of its testing environment accurately reflect the model's performance in real-world operations.

Key Points:
🌟 Meta introduces a new risk policy framework to assess and mitigate risks of cutting-edge AI models.
🔒 Critical risk models will halt development and limit expert access; high-risk models will implement access restrictions and mitigation measures.
🧑‍🏫 The risk assessment process will involve multidisciplinary experts to enhance transparency and scientific rigor.

Cutting-edge AI Framework Meta Risk Policy High-Risk Models

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Meta Ray-Ban Smart Glasses Roll Out Real-Time Translation, Offline Support Included

Meta recently announced the global rollout of real-time translation for its Ray-Ban Meta smart glasses. Previously, this feature was limited to early testing users in select markets. This full launch allows users to enjoy more convenient language conversion across various scenarios, especially the ability to overcome language barriers offline. According to Meta, the real-time translation feature on Ray-Ban Meta smart glasses now covers global sales markets and supports English, French, and Italian (among other languages).

Apr 24, 2025

Meta Launches Real-Time Translation for Ray-Ban Smart Glasses

Meta has announced the rollout of several new features for its Ray-Ban smart glasses, including real-time translation, Instagram messaging, and calling. Initially available only to select users in a preview program, these features are now available to all Ray-Ban Stories users. The real-time translation feature, first revealed at Meta Connect 2024, underwent limited testing in select countries last December. Now, users can utilize this feature in supported markets.

Apr 24, 2025

Meta Uses AI to Identify Underage Users on Instagram, Triggering Protective Mode

Meta has announced it will use artificial intelligence (AI) to verify the age of teenage users on Instagram, preventing users from misrepresenting their age. This measure aims to enhance the online safety of teenagers, ensuring they use social media in a protected environment. Meta stated that once the system detects an account suspected of belonging to a teenager, even if the user has entered an adult birthday, the system will automatically place it in "teen account" mode. Instagram reportedly implemented this last year.

Apr 22, 2025

120

Apple Intelligence Feature Restricted on Meta Apps: Ban Sparks AI Competition Debate

According to foreign media reports, Apple's newly launched Apple Intelligence feature is disabled on Meta's apps (including Facebook, Instagram, WhatsApp, and Threads), preventing users from accessing core functionalities such as Writing Tools and the custom emoji generator (Genmoji). This move is believed to be related to Meta's strategy of promoting its own Meta AI tools, highlighting the intensifying competition between the two tech giants in the AI arena.

Apr 21, 2025

180

UK AI Copyright Regulations Could Lead to Biased Models and Reduced Creator Revenue

Policy experts have voiced concerns over proposed AI copyright regulations in the UK, arguing that a lack of comprehensive text and data mining exemptions could lead to lower-quality AI models and stifle innovation. They suggest that prohibiting companies like OpenAI, Google, and Meta from using copyrighted material to train AI in the UK could result in biased model outputs, diminishing their effectiveness. The UK government launched a consultation in December 2024 to explore how to protect creators while allowing the use of creative content in AI model training.

Apr 16, 2025

Meta's Plan to Use EU User Data for AI Training Raises Privacy Concerns

Meta Platforms, Inc. has announced plans to use user data from its European Union applications, including Facebook and Instagram, to train its artificial intelligence models. The company clarified that the training data will include users' public posts, comments, and interactions with Meta AI, but will exclude private messages with friends and family. Training will be limited to users aged 18 and over. Meta stated it will inform its EU users of this plan this week via in-app notifications and emails.

Apr 15, 2025

120

Meta Restarts AI Training Using Public Content from European Users

Meta recently announced it will resume training its AI models using publicly available content from European users. This decision follows a pause last year due to data privacy concerns. Meta stated that this AI training will primarily rely on publicly shared posts and comments from adult users across the 27 EU countries. Furthermore, interactions between users and Meta AI, such as questions and queries, will also be used to train and improve its AI models. Image attribution: Image generated by AI, image licensing provided by Midj

Apr 15, 2025

100

California Crosswalk Buttons Hacked to Mimic Musk and Zuckerberg's Voices

Apr 15, 2025

Meta's Llama-4-Maverick Plummets in Rankings, Raising Concerns of Benchmark Manipulation

Meta's open-source large language model, Llama-4-Maverick, has experienced a dramatic drop in LMArena rankings, plummeting from second place to 32nd. This significant shift has sparked widespread skepticism among developers, who suspect Meta may have manipulated the benchmark by submitting a specially optimized version. The issue stems from Meta's April 6th release of its latest large language model, Llama 4, encompassing three versions: Scout, Maverick, and Behemoth.

Apr 14, 2025

650

Llama 4 Arrives on Vertex AI: Deploy Meta's New Model with One Click

Google Cloud Platform recently announced that Meta's latest generation of open-source large language models, Llama 4, is now available in its Vertex AI Model Garden. The news has generated significant excitement in the global tech community. The Scout and Maverick models from the Llama 4 series are now integrated into Vertex AI and available to developers via fully managed Model-as-a-Service (MaaS) API endpoints in preview.

Apr 10, 2025

280

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Meta Introduces New Framework to Limit the Release of High-Risk AI Systems

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Meta Ray-Ban Smart Glasses Roll Out Real-Time Translation, Offline Support Included

Meta Launches Real-Time Translation for Ray-Ban Smart Glasses

Meta Uses AI to Identify Underage Users on Instagram, Triggering Protective Mode

Apple Intelligence Feature Restricted on Meta Apps: Ban Sparks AI Competition Debate

UK AI Copyright Regulations Could Lead to Biased Models and Reduced Creator Revenue

Meta's Plan to Use EU User Data for AI Training Raises Privacy Concerns

Meta Restarts AI Training Using Public Content from European Users

California Crosswalk Buttons Hacked to Mimic Musk and Zuckerberg's Voices

Meta's Llama-4-Maverick Plummets in Rankings, Raising Concerns of Benchmark Manipulation

Llama 4 Arrives on Vertex AI: Deploy Meta's New Model with One Click