AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation MCP

Is ChatGPT's Mysterious Power Holding Back LLMs? Karpathy and LeCun Jointly Critique RLHF Technology

AIbase基地

Published inAI News · 6 min read · Aug 9, 2024

192

Recently, renowned AI researcher Andrej Karpathy stirred up a contentious debate when he suggested that Reinforcement Learning from Human Feedback (RLHF), currently highly regarded, might not be the inevitable path to truly human-level problem-solving capabilities. This statement undoubtedly dropped a heavy bomb into the current AI research field.

RLHF was once considered the key to the success of large language models like ChatGPT, hailed as the "secret weapon" that endowed AI with understanding, compliance, and natural interaction abilities. In the traditional AI training process, RLHF typically serves as the final step after pre-training and supervised fine-tuning (SFT). However, Karpathy likened RLHF to a "bottleneck" and a "stopgap measure," arguing that it is far from the ultimate solution for AI evolution.

Karpathy skillfully compared RLHF to DeepMind's AlphaGo. AlphaGo employed what he called "true RL" (reinforcement learning) technology, which, by constantly playing against itself and maximizing the win rate, eventually surpassed top human players without human intervention. This method achieved superhuman performance levels by optimizing neural networks directly from game outcomes.

In contrast, Karpathy believes that RLHF is more about mimicking human preferences rather than genuinely solving problems. He envisioned that if AlphaGo used RLHF, human evaluators would need to compare a large number of game states and choose preferences, a process that might require up to 100,000 comparisons to train a "reward model" that mimics human "atmosphere checks." However, this "atmosphere"-based judgment in a rigorous game like Go could lead to misleading results.

Similarly, the reward model of current LLMs works in a similar way—it tends to rank highly answers that human evaluators statistically seem to prefer. This is more of a proxy for superficial human preferences rather than a true manifestation of problem-solving abilities. More concerning is that the model might quickly learn how to exploit this reward function rather than genuinely improving its capabilities.

Karpathy pointed out that while reinforcement learning performs well in closed environments like Go, it remains challenging for open-ended language tasks. This is mainly because it is difficult to define clear objectives and reward mechanisms in open tasks. "How do you give an objective reward for summarizing an article, answering a vague question about pip installation, telling a joke, or rewriting Java code into Python?" Karpathy posed this insightful question, adding, "Moving in this direction is not impossible in principle, but it is certainly not easy; it requires some creative thinking."

Nevertheless, Karpathy still believes that if this challenge can be overcome, language models could truly match or even surpass human problem-solving abilities. This view aligns with a recent paper by Google DeepMind, which points out that openness is the foundation of Artificial General Intelligence (AGI).

As one of the several senior AI experts who left OpenAI this year, Karpathy has been busy with his educational AI startup. His remarks undoubtedly inject new dimensions of thought into the AI research field and provide valuable insights into the future direction of AI development.

Karpathy's views have sparked extensive discussions within the industry. Supporters believe he has revealed a critical issue in current AI research: how to make AI truly capable of solving complex problems, rather than merely mimicking human behavior. Opponents worry that prematurely abandoning RLHF could lead to a deviation in the direction of AI development.

Paper link: https://arxiv.org/pdf/1706.03741

RLHF ChatGPT Large Language Models DeepMind

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

KPMG Report: China Leads in Medical Large Models, Accounting for 70% of the Global Total

A recent report titled "Health Tech 50 - The First Edition" released by KPMG China reveals that China has taken a leading position in the field of medical large models globally. The report indicates that the number of medical large models launched in China accounts for more than 70% of the global total, far surpassing other countries and regions. In terms of model categories, large language models (LLMs) are the most numerous, accounting for nearly 65%. Moreover, the report also highlights the strong growth momentum of the intelligent medical devices market in China. It is expected that by 2025, the scale of the intelligent medical devices market in China will reach 24.23 billion yuan, and it will continue to grow.

Jul 3, 2025

Topview Avatar 2 Shakes the Market! AI Digital Humans Revolution E-commerce Live Streaming, Will the Era of Models Come to an End?

Jul 3, 2025

110

Perplexity Launches Monthly $200 Max Subscription Service to Unlock Advanced AI Models and Exclusive Features

Jul 3, 2025

Exploring the Compatibility of LLMs with Reinforcement Learning: Shanghai Jiao Tong University Reveals Differences Between Llama and Qwen, Introducing OctoThinker

Large Language Models (LLMs) have achieved significant progress in complex reasoning tasks by combining task prompts with large-scale reinforcement learning (RL), as demonstrated by models like Deepseek-R1-Zero, which directly apply reinforcement learning to base models, showcasing strong reasoning capabilities. However, this success is difficult to replicate across different base model families, especially within the Llama series. This raises a core question: what factors lead to inconsistent performance of different base models during reinforcement learning? How does reinforcement learning perform in

Jul 3, 2025

Scientists Have Something to Say! SciArena Platform Launches Multi-Dimensional Evaluation of Large Language Models' Scientific Performance

Jul 3, 2025

OpenAI Suspends Large-Scale Use of Google TPU Chips, NVIDIA and AMD Remain Core Suppliers

OpenAI recently announced that, despite initial testing, it will not adopt Google's TPU chips on a large scale. TPU (Tensor Processing Unit) is a custom ASIC chip developed by Google for machine learning tasks, designed to accelerate the training and inference of neural networks. TPU uses a dataflow-driven architecture, enabling efficient matrix multiplication pipeline computing and reducing memory access latency. Image source note: The image is AI-generated, provided by the licensing service Midjourney. OpenAI stated that it will continue

Jul 3, 2025

Major Breakthrough! Research Team Reveals the Hidden Reward Mechanism Inside Large Language Models

Jul 2, 2025

190

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

According to NetEase Technology, JD.com's layout in the field of embodied intelligence is accelerating rapidly. The embodied intelligence brand JoyInside under JD.com has reached cooperation with more than ten leading robot companies, becoming the core engine for JD.com to seize the smart robot market. According to insiders, JoyInside is supported by JD's large model technology, focusing on providing smart interaction capabilities between robots and consumers. Its product strategy focuses on scenario-based applications such as one person, one dog, and one toy. Since its launch, the brand has successfully attracted leading enterprises from multiple niche fields to join.

Jul 2, 2025

400

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Recently, Hon Hai Precision Industrial Co., Ltd. (commonly known as Foxconn) submitted a trademark registration application for "FoxBrain" to the Trademark Office of the National Intellectual Property Administration. This AI inference large model is not only Foxconn's first attempt but also the first AI model of this type in Taiwan. According to public information, the international classification of this trademark is scientific instruments, and it is currently in the "waiting for substantive examination" status. "FoxBrain" is an AI inference large model launched by the Hon Hai Research Institute, covering data analysis

Jul 2, 2025

400

Zhipu AI Launches GLM-4.1V-Thinking Open Source! A New Leader in Multimodal Reasoning, Challenging Top Models Worldwide

Jul 2, 2025

470