AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

Academic Fraud Busting! Research from Tsinghua and SJTU Upends Understanding: Reinforcement Learning May Hinder Large Model Reasoning

AIbase基地

Published inAI News · 3 min read · Apr 23, 2025

【Research Upends Conventional Wisdom】

A recent joint paper from Tsinghua University and Shanghai Jiao Tong University challenges the widely held industry belief that "pure reinforcement learning (RL) can enhance the reasoning capabilities of large models." The research found that models incorporating reinforcement learning performed worse than their original counterparts in certain tasks.

【Experimental Verification】

The research team conducted systematic experiments in three major areas: mathematics, coding, and visual reasoning:

Mathematical Tasks: In benchmark tests like GSM8K and MATH500, RL models showed improved accuracy at low sampling rates (k-values), but a significant decrease in problem coverage at high k-values.
Coding Tasks: The RLVR-trained model showed improved single-sample pass@1 scores in tests like HumanEval+, but coverage decreased at high sampling rates (k=128).
Visual Reasoning: The Qwen-2.5-VL-7B model showed consistent performance in multimodal tasks, with RL not altering its fundamental problem-solving strategies.

【Academic Controversy】

The research results have sparked heated debate in academia:

Supporters argue that RL improves sampling efficiency but limits the development of reasoning capabilities.
Opponents suggest that the problem may lie in flawed reward structures rather than RL itself.
Neutral viewpoints suggest combining other methods, such as distillation, to enhance reasoning.

【Essential Considerations】

The research team proposes a key distinction:

Capability: The model's potential to solve problems and its logical chains.
Efficiency: The speed and stability of obtaining answers within a given capability.

Reinforcement learning acts more like a "capability regulator" than a "capability creator." It allows models to excel at known tasks but struggles to develop new reasoning pathways.

【Industry Implications】

This research serves as a wake-up call for the overheated RL training trend in large models, suggesting that the industry should:

Pay more attention to the representational capacity and knowledge organization of base models.
Clearly distinguish between capability enhancement and efficiency optimization goals.
Establish a more scientific evaluation system for reasoning capabilities.

ReinforcementLearning(RL)LargeModel TsinghuaUniversity ShanghaiJiaoTongUniversity

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Dongfeng Motor Launches Tianyuan Intelligent Technology Brand to Power a Smart Future

Apr 23, 2025

120

Primeiro Modelo de IA de Casamento de Patentes de Universidades em Hangzhou Lançado, Resolvendo o Problema de Patentes Inativas

Apr 23, 2025

110

Penguin Reading Companion, an AI Reading Assistant powered by Tencent's HunYuan Large Model, Officially Launches

Tencent officially launched an AI reading assistant called "Penguin Reading Companion" on World Book Day. This innovative product is powered by Tencent's HunYuan large model and Tencent's Yuanqi platform. Led by Tencent SSV Digital Education Lab, it aims to provide a technologically advanced and engaging reading experience for primary and secondary school students.

Apr 23, 2025

130

Microsoft's New Open-Source Model MAI-DS-R1: Improved Sensitive Topic Response and Reduced Safety Risks

Apr 18, 2025

490

Shanghai AI Laboratory Unveils Upgraded Multimodal Large Model, 'Shusheng · Wanxiang 3.0'

Apr 17, 2025

380

Yiwu Mall Group Integrates Alibaba's Tongyi Large Model to Create AI-Powered Business Assistant

Yiwu Mall Group announced its official integration with Alibaba's Tongyi large language model. Leveraging Alibaba's strengths in cloud computing, big data, and e-commerce, this collaboration will empower 2.1 million small and medium-sized merchants to leverage AI for precise business operations and rapid expansion into overseas markets. This partnership marks a significant step forward in Yiwu Mall Group's digital transformation and globalization strategy, and also highlights Alibaba's crucial role in driving the digital transformation of SMEs.

Apr 17, 2025

180

AI Daily: ChatGPT Launches Major Image Library Feature; Free Access! Veo2 Lands on Google AI Studio; Ant's Treasure Box Launches MCP Zone

Apr 16, 2025

440

National Supercomputing Platform Releases New Generation Multimodal Large Model to Advance AI Agent Development

Apr 16, 2025

180

Xiaopeng Announces In-House Turing AI Chip for Q2 Launch, Supporting 30B-Parameter Large Models

Xiaopeng Motors chairman He Xiaopeng recently announced that the company's fully self-developed Turing AI chip will be mass-produced and launched in the second quarter of this year. This progress comes as the automotive industry accelerates the application of end-to-end intelligent driving technology and the scale of AI large models continues to expand. Xiaopeng Motors is building its strongest AI brain by simultaneously developing a world base model with 35 times the parameters of mainstream VLA models, and a self-developed chip with computing power equivalent to three Nvidia Orin Xs, which is about to be mass-produced.

Apr 15, 2025

200

Tencent Cloud's Large Model Knowledge Engine Upgrades MCP Protocol, Ushering in a New Era for AI Applications

Apr 15, 2025

300