Meta Scientists Develop Mind Extraction Technique, Significantly Enhancing Language Model Performance

AIbase

Published inAI News · 4 min read · Jul 11, 2024

A recent study suggests that through special training, language models can partially achieve more efficient multi-step reasoning capabilities. This ability is similar to the "System 2 reasoning" described by psychologist Daniel Kahneman, which is a slow and conscious way of processing information.

Researchers at Meta have developed a new method that "refines" the computationally intensive multi-step reasoning process into parameters of a language model. The results show that in some cases, models trained with this method can achieve performance similar to the original multi-step process at a lower computational cost.

The working principle of this "refinement" method is: first, apply multi-step reasoning methods to a large amount of example data, then filter and retain results with high consistency, and finally use these data for fine-tuning training of the language model. Essentially, this method generates synthetic training data, enabling the language model to directly draw conclusions without intermediate steps.

Artificial Intelligence Brain, Large Model

Image Source Note: The image is generated by AI, and the image is provided by Midjourney, an image authorization service.

The researchers applied this method to four different multi-step reasoning techniques and five types of tasks. The results show that in many cases, this method can effectively improve model performance, but it is not applicable to all scenarios.

For example, in tasks such as avoiding bias and improving response quality, the "refined" models perform as well as multi-step methods but require significantly less computational resources. However, in complex mathematical reasoning tasks, this method did not work. The researchers speculate that certain tasks may be too complex for single-step reasoning.

Nevertheless, the researchers believe that this method provides a promising direction for developing more powerful language processing systems. In the future, this method can be combined with other technologies to focus on solving truly challenging problems.

This study opens up new paths for improving the reasoning capabilities of language models and is expected to bring breakthroughs in multiple application areas.

AI Daily: Tencent Huyaun Launches 3D Generation Large Model Hunyuan3D-PolyGen; DingTalk AI Spreadsheet Makes a Big Entry; Alibaba Launches Multimodal Large Language Model HumanOmniV2

1.Tencent's Hunyuan3D-PolyGen boosts 3D modeling efficiency by 70% with BPT tech. 2.Alibaba's HumanOmniV2 achieves 69.33% accuracy in multilingual input. 3.DingTalk AI processes 1k tasks/hour with 'spreadsheet-as-document'. 4.Baidu PaddleOCR3.1 improves 37-language recognition by 30%. 5.Microsoft Deep Research opens API. 6.HKPolyU & OPPO's DLoRAL speeds video enhancement 10x. 7.Google opens MCP Toolbox for SQL. 8.Microsoft Win11 to add AI dynamic....

AI Daily: Bilibili May Launch an AI Creation Tool Named H; Zhiyuan Unveils Naoche Robot Lingxi X2-N; Yushu Technology Pursues IPO on Sci-Tech Innovation Board

AI Daily: B站 launches 'H' tool for video creation; Zhiyuan unveils dual-mode robot X2-N; Yushu Tech aims for IPO at $12B valuation; EarthMind innovates earth data analysis; Gemini CLI updates AV features; macOS assistant Glass goes open-source; Claude to release math-focused Neptune v3; OpenAI's GPT-5 to integrate multi-models.....

Kunlun Xiwang Once Again Open-Sources the Reward Model Skywork-Reward-V2

On July 4, 2025, Kunlun Xiwang continued to open-source the second-generation reward model Skywork-Reward-V2 series. This series includes 8 reward models based on different foundation models, with parameter sizes ranging from 600 million to 8 billion. Upon its release, it won all seven major reward model evaluation rankings, becoming a focus in the open-source reward model field. Reward models play a key role in the reinforcement learning from human feedback (RLHF) process. To build the next generation of reward models, Kunlun Xiwang has constructed a dataset containing 40 million

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Meta Scientists Develop Mind Extraction Technique, Significantly Enhancing Language Model Performance

AIbase

This article is from AIbase Daily

AI News Recommendations

AI Daily: Tencent Huyaun Launches 3D Generation Large Model Hunyuan3D-PolyGen; DingTalk AI Spreadsheet Makes a Big Entry; Alibaba Launches Multimodal Large Language Model HumanOmniV2

Ali HumanOmniV2 Launches with a Shock: The New King of Multimodal AI, Accuracy Surges to 69.33%

New Breakthrough in Cyclic Models: 500 Steps of Training Makes Ultra-Long Sequences No Longer Difficult!

Apple and Columbia University Collaborate to Develop AI System SceneScout to Assist Blind People with Street View Navigation

Concerns About AI Training in Germany: 70% of Employees Lack Access to Training, Companies May Be in Violation

AI Daily: Bilibili May Launch an AI Creation Tool Named H; Zhiyuan Unveils Naoche Robot Lingxi X2-N; Yushu Technology Pursues IPO on Sci-Tech Innovation Board

Zhixuan Launches Naocha Robot Lingxi X2-N: Can Switch Between Wheel and Foot Dual Modes

Kunlun Xiwang Once Again Open-Sources the Reward Model Skywork-Reward-V2

A Daily: Bilibili Upgrades Anime Video Generation Model AniSora V3; ByteDance Open Sources 4D Video Generation Framework EX-4D; DeepSWE Open Sources AI Agent System Rises to the Top

Topview Avatar 2 Shakes the Market! AI Digital Humans Revolution E-commerce Live Streaming, Will the Era of Models Come to an End?