Making Large Models Understand You Better: Tencent and Shanghai Jiao Tong University Join Forces to Decode the Secrets of Instruction Tuning

AIbase基地

Published inAI News · 4 min read · Aug 16, 2024

147

As large models continue to evolve and become increasingly intelligent, the key to them truly understanding our needs lies in instruction tuning. Experts from Tencent Youtu Lab and Shanghai Jiao Tong University have collaborated to publish an in-depth review on the evaluation and selection of instruction tuning datasets, unveiling the mysteries behind enhancing the performance of large models.

The goal of large models is to master the essence of natural language processing, and instruction tuning is a crucial step in their learning process. The experts have conducted a thorough analysis on how to evaluate and select datasets to ensure that large models perform exceptionally well across various tasks.

This review not only boasts an impressive length but also covers over 400 related literature sources, providing a detailed guide from the dimensions of data quality, diversity, and importance.

Data quality directly impacts the effectiveness of instruction tuning. The experts have proposed various evaluation methods, including manually designed metrics, model-based metrics, GPT automated scoring, and indispensable human evaluations.

Diversity assessment focuses on the richness of the dataset, including vocabulary, semantics, and the overall data distribution diversity. A diverse dataset enables the model to generalize better across various scenarios.

Importance assessment identifies the most critical samples for model training. This not only improves training efficiency but also ensures the model's stability and accuracy when facing complex tasks.

Although current research has achieved certain results, experts also point out challenges such as the weak correlation between data selection and model performance, and the lack of a unified standard to evaluate instruction quality.

Looking ahead, experts call for the establishment of specialized benchmarks to evaluate instruction tuning models and enhance the interpretability of the selection pipeline to adapt to different downstream tasks.

This research by Tencent Youtu Lab and Shanghai Jiao Tong University not only provides us with a valuable resource but also points the way forward for the development of large models. With continuous technological advancements, we have reason to believe that large models will become more intelligent and better serve humanity.

Academic Fraud Busting! Research from Tsinghua and SJTU Upends Understanding: Reinforcement Learning May Hinder Large Model Reasoning

A recent paper jointly published by Tsinghua University and Shanghai Jiao Tong University challenges the widely held belief that pure reinforcement learning (RL) enhances large model reasoning capabilities. The research found that models incorporating reinforcement learning performed worse than their original counterparts in certain tasks.

SenseCore 2.0, SenseTime's Large-Scale AI Infrastructure, Receives Major Upgrade and Launches a $10 Million Voucher Program

At the 2025 SenseTime Technology Exchange Day in Beijing, SenseTime officially announced a comprehensive upgrade to its SenseCore 2.0 large-scale AI infrastructure. As a leader in AI infrastructure, SenseCore 2.0 aims to provide businesses with agile, flexible, and reliable full-stack AI infrastructure services, driving the efficient implementation and large-scale application of large-scale models at an optimal price-performance ratio.

AI Daily: Alibaba's Qwen3 Model Imminent; GitHub Opensources MCP Server; Runway Releases Gen-4 Turbo

Welcome to the AI Daily column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest topics in the AI field, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products here: https://top.aibase.com/1、Qwen3 is coming soon: Support for Alibaba Cloud's new model has been officially merged into the vLLM code repository. Alibaba Cloud's Qwen3 model is about to be released, marking another significant advancement in its AI endeavors.

AI Daily: Zhipu Releases Agent Product AutoGLM-Thinking; Google Gemini 2.5 Pro Opens for Free Use; ChatGPT's Native Image Generation Rolls Out to Free Users

Welcome to the [AI Daily] column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest content in the AI field, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products: https://top.aibase.com/ 1. Zhipu Releases Agent Product AutoGLM-Thinking: The First Agent That Can 'Think and Act' Zhipu AI released its latest Agent product—AutoGLM-Thinking—at the 2025 Zhongguancun Forum.

AI Countdown to Explosive Growth! Kai-Fu Lee Predicts Open-Source Models to Unleash Commercial Wave in 2025!

The AI revolution is upon us! Tech leader Kai-Fu Lee recently made a bold prediction: 2025 will not be ordinary; it will mark an explosive year for AI applications and a crucial test for large model commercialization. Opportunities and challenges coexist as a transformative storm reshaping the future of AI is brewing. Lee's prediction is not unfounded but based on his deep understanding of industry trends. He points to the emergence of DeepSeek as an example, expressing excitement that it represents more than just a Chinese AI advancement.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Making Large Models Understand You Better: Tencent and Shanghai Jiao Tong University Join Forces to Decode the Secrets of Instruction Tuning

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Academic Fraud Busting! Research from Tsinghua and SJTU Upends Understanding: Reinforcement Learning May Hinder Large Model Reasoning

Dongfeng Motor Launches Tianyuan Intelligent Technology Brand to Power a Smart Future

AI Daily: ChatGPT Launches Major Image Library Feature; Free Access! Veo2 Lands on Google AI Studio; Ant's Treasure Box Launches MCP Zone

Zhihu AI Officially Initiates IPO Process; A New Chapter for the 'Big Six' in Large Language Models

SenseCore 2.0, SenseTime's Large-Scale AI Infrastructure, Receives Major Upgrade and Launches a $10 Million Voucher Program

AI Daily: Alibaba's Qwen3 Model Imminent; GitHub Opensources MCP Server; Runway Releases Gen-4 Turbo

Baidu Releases PaddlePaddle Framework 3.0 to Empower Intelligent Development in the Age of Large Models

AI Daily: Zhipu Releases Agent Product AutoGLM-Thinking; Google Gemini 2.5 Pro Opens for Free Use; ChatGPT's Native Image Generation Rolls Out to Free Users

Alibaba Cloud and Nanjing University Launch Joint AI Talent Cultivation Program

AI Countdown to Explosive Growth! Kai-Fu Lee Predicts Open-Source Models to Unleash Commercial Wave in 2025!