Google DeepMind Launches SCoRe: A New Self-Correction Technology for Large Language Models

AIbase基地

Published inAI News · 4 min read · Sep 26, 2024

299

Google's DeepMind research team has recently made significant breakthroughs, developing an innovative technology called SCoRe (Self-Correction through Reinforcement Learning), which aims to address the longstanding challenge of large language models (LLMs) being unable to self-correct without relying on multiple models or external checks to identify and fix errors.

The core of the SCoRe technology lies in its two-stage approach. The first stage optimizes the model initialization to generate corrections in the second attempt while maintaining similarity with the initial response from the base model. The second stage employs multi-stage reinforcement learning to teach the model how to improve both the first and second answers. This method is unique in that it only uses self-generated training data, where the model creates its own examples by solving problems and attempting to improve solutions.

In practical tests, SCoRe has shown significant performance improvements. Tests using Google's Gemini 1.0 Pro and 1.5 Flash models revealed a 15.6 percentage point increase in self-correction capabilities in mathematical reasoning tasks on the MATH benchmark. In code generation tasks on HumanEval, performance improved by 9.1 percentage points. These results indicate that SCoRe has made substantial progress in enhancing the self-correction abilities of AI models.

Researchers emphasize that SCoRe is the first method to achieve meaningful positive self-correction, allowing models to improve answers without external feedback. However, the current version of SCoRe only undergoes one round of self-correction training, and future research may explore the possibility of multiple correction steps.

This research by the DeepMind team reveals an important insight: teaching meta-strategies like self-correction requires going beyond standard language model training methods. Multi-stage reinforcement learning opens up new possibilities in the AI field, potentially driving the development of smarter and more reliable AI systems.

This breakthrough technology not only demonstrates the potential for AI self-improvement but also provides new perspectives for addressing reliability and accuracy issues in large language models, which could have profound implications for the future development of AI applications.

DeepMind SCoRe Large Language Models Reinforcement Learning

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Head of ByteDance's Visual Large Model, Yang Jianchao, Announces Temporary Leave; Zhou Chang Takes Over, Drawing Attention

Yang Jianchao, the head of ByteDance's Visual Large Model team, announced a temporary leave due to family reasons, with Zhou Chang, former technical leader of Alibaba's Tongyi Qianwen, taking over. This personnel change occurred during a period of adjustment in ByteDance's AI department, sparking concerns about the stability of the technical roadmap. Yang Jianchao's work information remains in the internal system, and Zhou Chang will lead the global Seed team to continue research on visual multimodal generation. The company emphasized its continued investment in basic research and hopes that the new leader will bring innovative energy. This change highlights the importance of balancing work and health in the fast-paced tech industry.

Jul 18, 2025

100

VSCode's AI Programming Tool Traycer Excels at Handling Large Codebases

Traycer, a VSCode AI assistant, enhances coding with task breakdown, multi-agent collaboration, and real-time error detection. Offers 14-day trial, excels in large codebases.....

Jul 18, 2025

AI Influences Language Communication! Our Daily Conversations Contain More GPT Vocabulary

German study finds AI like ChatGPT is altering human language, creating 'GPT words'. Analysis shows AI-preferred terms like 'in-depth study' surge in usage across media, revealing unconscious human mimicry of AI speech patterns and raising concerns about technology's cognitive impact.....

Jul 17, 2025

100

Google DeepMind Launches MoR Architecture: Expected to Significantly Improve the Efficiency of Large Language Models

DeepMind's Mixture-of-Recursions (MoR) enhances model efficiency via dynamic token routing and recursive depth allocation, outperforming Transformers with fewer parameters. Its selective caching reduces memory pressure, proving especially effective above 360M scale, offering optimized AI deployment solutions.....

Jul 17, 2025

310

New Breakthrough in Medical AI: OpenMed Launches Over 380 Open-Source Models to Revolutionize Global Medical Technology

OpenMed released 380+ free medical NER models on Hugging Face (Apache2.0). These models (109M-568M params) rival paid alternatives, integrated into major AI ecosystems. Addressing global healthcare shortages, the project allows free use/modification, with COVID screening API previously developed in 5 days. Team plans to expand model library for open-source medical AI.....

Jul 17, 2025

120

Huawei and Yunnan Jiaotou Collaborate to Launch 'Lvmei Channel · Transportation Large Model' to Promote Digital Transformation in the Transportation Industry

Huawei, Yunnan Communications Investment, and Chang'an University launched the 'Green and Beautiful Corridor·Transportation Model' to drive digital transformation in transport. It features an 84% accurate cognitive model, AI computing centers, and 35 edge nodes, enhancing efficiency and safety in construction and management.....

Jul 17, 2025

ByteDance Seed's Latest Reinforcement Learning Recipe POLARIS Open Sourced, 4B Model's Mathematical Reasoning Approaches 235B Performance

Recently, the ByteDance Seed team collaborated with the University of Hong Kong and Fudan University to introduce an innovative reinforcement learning training method called POLARIS. This method successfully enhances the mathematical reasoning capabilities of small models to levels comparable to those of large models through a carefully designed Scaling RL strategy, offering a new approach for optimizing small models in the field of artificial intelligence. Experimental results show that the 4 billion parameter open-source model Qwen3-4B trained using POLARIS achieved remarkable performance on AIME25 and AIME24 mathematical tests.

Jul 16, 2025

220

Mistral Launches Voxtral: The Dawn of a New Era for Open-Source AI Audio Models!

Mistral releases open-source audio model Voxtral, offering cost-effective AI solutions with multilingual support. Three versions available, starting at $0.001/min. Outperforms Whisper at lower cost.....

Jul 16, 2025

MiniMax Valued Over 4 Billion USD, Backed by Shanghai State Capital, Joins the 3 Billion USD Large Model Club

Chinese AI firm MiniMax raised $300M, reaching a $4B valuation. Backed by Shanghai state capital, it's now one of China's two $3B+ LLM companies. Founded by ex-SenseTime executives, with prior investments from Alibaba and Tencent, it's reportedly preparing for a Hong Kong IPO.....

Jul 15, 2025

210

A Daily: Moonlight Open-Sources Large Model Kimi K2; Zhiyuan Fully Open-Sources RoboBrain 2.0; Tongyi Qianwen Launches Qwen Chat Desktop Client

Moon's dark side opens trillion-parameter Kimi K2 model; RoboBrain2.0 enhances robot cognition; Alibaba's Qwen adds image generation; IndexTTS2 revolutionizes voice cloning; HuggingFace's Reachy Mini sells well; Meta enables real-time video generation; PixVerse adds multi-keyframe; Tesla Grok supports AMD only; OpenAI delays open-source release; Liquid AI's LFM2 boosts edge AI; AI 'time travel' trend goes viral.....

Jul 14, 2025

180

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Google DeepMind Launches SCoRe: A New Self-Correction Technology for Large Language Models

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Head of ByteDance's Visual Large Model, Yang Jianchao, Announces Temporary Leave; Zhou Chang Takes Over, Drawing Attention

VSCode's AI Programming Tool Traycer Excels at Handling Large Codebases

AI Influences Language Communication! Our Daily Conversations Contain More GPT Vocabulary

Google DeepMind Launches MoR Architecture: Expected to Significantly Improve the Efficiency of Large Language Models

New Breakthrough in Medical AI: OpenMed Launches Over 380 Open-Source Models to Revolutionize Global Medical Technology

Huawei and Yunnan Jiaotou Collaborate to Launch 'Lvmei Channel · Transportation Large Model' to Promote Digital Transformation in the Transportation Industry

ByteDance Seed's Latest Reinforcement Learning Recipe POLARIS Open Sourced, 4B Model's Mathematical Reasoning Approaches 235B Performance

Mistral Launches Voxtral: The Dawn of a New Era for Open-Source AI Audio Models!

MiniMax Valued Over 4 Billion USD, Backed by Shanghai State Capital, Joins the 3 Billion USD Large Model Club

A Daily: Moonlight Open-Sources Large Model Kimi K2; Zhiyuan Fully Open-Sources RoboBrain 2.0; Tongyi Qianwen Launches Qwen Chat Desktop Client