Microsoft Unveils Auto Evol-Instruct AI Framework: Evolving Guidance Datasets with Large Language Models Without Human Intervention

AIbase基地

Published inAI News · 4 min read · Jul 18, 2024

432

Recently, researchers at Microsoft have introduced a novel AI framework called Auto Evol-Instruct, which can automatically evolve instructional datasets without any human intervention.

In the field of artificial intelligence, the development of large language models (LLMs) is crucial, especially in enhancing their ability to follow detailed instructions. Researchers have been exploring ways to improve the datasets used to train LLMs to enhance their performance and adaptability.

Traditional evolutionary methods like Evol-Instruct rely on evolution rules specified by human experts, which are not only costly and time-consuming but also require redesigning methods when adapting to new tasks. In contrast, Auto Evol-Instruct achieves an automated evolution process by initially using LLMs to analyze input instructions and autonomously design initial evolution rules. Subsequently, through iterative optimization by optimizer LLMs, it identifies and resolves issues in the evolution process to ensure the final evolution instructions' complexity and stability.

Auto Evol-Instruct enhances the complexity and diversity of datasets by automatically analyzing input instructions and formulating evolution rules, utilizing LLMs to design evolution methods.

In terms of performance evaluation, Auto Evol-Instruct has performed exceptionally well in multiple benchmark tests. For example, by fine-tuning Mixtral-8x7B with only 10K evolved ShareGPT data, the framework achieved 8.09 points on MT-Bench and 91.4 points on AlpacaEval, surpassing GPT-3.5-Turbo and WizardLM-70B, and matching Claude2.0.

Additionally, by using only 7K evolved GSM8K training data, the framework achieved 82.49 points on GSM8K. In code generation, by fine-tuning DeepSeek-Coder-Base-33B with 20K evolved Code Alpaca, the framework achieved 77.4 points on HumanEval, outperforming other competitive models.

This new framework has demonstrated outstanding performance in multiple benchmark tests, including MT-Bench, AlpacaEval, GSM8K, and HumanEval, showcasing its potential in improving instruction following, mathematical reasoning, and code generation capabilities.

Paper link: https://arxiv.org/abs/2406.00770

Key Points:
🔍 Auto Evol-Instruct is a fully automated AI framework capable of automatically analyzing and evolving instructional datasets without human intervention.
🚀 The framework effectively enhances the complexity and diversity of datasets by optimizing evolution methods, thereby improving the performance and adaptability of LLMs across various tasks.
💡 The research results of Auto Evol-Instruct indicate the effectiveness of automating the evolution of instructional datasets.

AI Headlines

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Institution: Downgrade the year-over-year growth rate of AI server shipments in 2025

North American large CSPs remain the main driving force behind the demand for AI servers, supported by tier-2 data centers as well as sovereign cloud projects in the Middle East and Europe. Overall demand remains stable. Driven by the demand from North American CSPs and OEMs, it is expected that AI server shipments will continue to grow at double-digit rates in 2025. However, due to changes in the international situation, the year-over-year growth rate of global AI server shipments in 2025 has been revised downward to 24.3%.

Jul 2, 2025

180

WeChat AI Search Accused of Forced 'Opening the Box' to Name, Turning into Hyperlink Instantly - Tencent Responds: Only Integrates Public Information

The newly launched AI search function in WeChat has attracted widespread attention due to allegations of leaking personal privacy. Recently, many users reported on social platforms that this function can generate a personal resume with a name hyperlink in one click, causing concerns about privacy security among users. According to user feedback, the controversy surrounding WeChat AI Search mainly focuses on its automatic identification mechanism. When users encounter names in WeChat official account articles, the system automatically converts the name into a blue hyperlink. Clicking this link will force the AI system to generate a detailed information page containing personal resume, as well as display all

Jul 2, 2025

210

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

According to NetEase Technology, JD.com's layout in the field of embodied intelligence is accelerating rapidly. The embodied intelligence brand JoyInside under JD.com has reached cooperation with more than ten leading robot companies, becoming the core engine for JD.com to seize the smart robot market. According to insiders, JoyInside is supported by JD's large model technology, focusing on providing smart interaction capabilities between robots and consumers. Its product strategy focuses on scenario-based applications such as one person, one dog, and one toy. Since its launch, the brand has successfully attracted leading enterprises from multiple niche fields to join.

Jul 2, 2025

240

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Recently, Hon Hai Precision Industrial Co., Ltd. (commonly known as Foxconn) submitted a trademark registration application for "FoxBrain" to the Trademark Office of the National Intellectual Property Administration. This AI inference large model is not only Foxconn's first attempt but also the first AI model of this type in Taiwan. According to public information, the international classification of this trademark is scientific instruments, and it is currently in the "waiting for substantive examination" status. "FoxBrain" is an AI inference large model launched by the Hon Hai Research Institute, covering data analysis

Jul 2, 2025

270

Zhipu AI Launches GLM-4.1V-Thinking Open Source! A New Leader in Multimodal Reasoning, Challenging Top Models Worldwide

Jul 2, 2025

270

Zhipu AI Open Sources GLM-4.1V-Thinking: A Breakthrough in Multimodal Reasoning

Zhipu AI officially open-sources its latest general vision model, GLM-4.1V-Thinking, based on the GLM-4V architecture, which introduces a chain-of-thought reasoning mechanism, significantly enhancing its capabilities for complex cognitive tasks. The model supports multimodal inputs such as images, videos, and documents, and excels in diverse scenarios including long video understanding, image question answering, subject problem-solving, text recognition, document interpretation, grounding, GUI Agent, and code generation, covering a wide range of industry application needs. GLM-4.1V-9B-Thinking

Jul 2, 2025

310

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and learn about innovative AI product applications. Click to learn more about new AI products: https://top.aibase.com/1、Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand audio and directly generate natural speech. Step-Audio-AQAA is an open source end-to-end speech large model,

Jul 2, 2025

250

Ant Group's Medical AI Platform Wins SAIL Award at 2025 World Artificial Intelligence Conference

Jul 2, 2025

220

Foxconn's Parent Company Registers a Trademark for an AI Inference Large Model

Jul 2, 2025

Baidu Launches the HuiXiang Platform and MuseSteamer: AI-Generated Video with a Single Image to Create Professional-Level Movies!

At today's Baidu AI DAY technology open day, Baidu's commercial R&D team officially launched its self-developed video generation model MuseSteamer and the accompanying video product platform **HuiXiang**. This innovation aims to create a comprehensive video generation solution by combining generative AI and multimodal technology, to meet the strong demand for native content production in scenarios such as search, advertising, and recommendations. The MuseSteamer video generation model series is rich, currently including Turbo, Lite, Pro, and

Jul 2, 2025

510

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Microsoft Unveils Auto Evol-Instruct AI Framework: Evolving Guidance Datasets with Large Language Models Without Human Intervention

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Institution: Downgrade the year-over-year growth rate of AI server shipments in 2025

WeChat AI Search Accused of Forced 'Opening the Box' to Name, Turning into Hyperlink Instantly - Tencent Responds: Only Integrates Public Information

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Zhipu AI Launches GLM-4.1V-Thinking Open Source! A New Leader in Multimodal Reasoning, Challenging Top Models Worldwide

Zhipu AI Open Sources GLM-4.1V-Thinking: A Breakthrough in Multimodal Reasoning

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Ant Group's Medical AI Platform Wins SAIL Award at 2025 World Artificial Intelligence Conference

Foxconn's Parent Company Registers a Trademark for an AI Inference Large Model

Baidu Launches the HuiXiang Platform and MuseSteamer: AI-Generated Video with a Single Image to Create Professional-Level Movies!