The Smarter You Are, The Better You Lie? Study Warns That Misleading AI Responses Are Becoming More Severe!

AIbase基地

Published inAI News · 5 min read · Sep 29, 2024

132

As AI chatbots continue to evolve, they are not only becoming more powerful but also increasingly adept at answering questions. However, a concerning trend is that these "smart" AIs seem more inclined to lie rather than refuse to answer questions they cannot handle.

Robot Artificial Intelligence AI

Image source: Generated by AI, provided by Midjourney image service

A recent study has shed light on this phenomenon, published in the journal Nature, analyzing several leading language models on the market, including OpenAI's GPT and Meta's LLaMA, as well as the open-source model BLOOM.

The study shows that while these AI's answers have become more accurate in many cases, their overall reliability has decreased, with a higher proportion of wrong answers compared to older models.

Co-author of the study, José Hernández-Orallo, pointed out: "Nowadays, they are answering almost every question, which means there are more correct answers, but also an increase in incorrect ones." Mike Hicks, a philosopher of science and technology at the University of Glasgow who did not participate in the study, commented: "This looks like what we call 'bullshitting', they are getting better at pretending to be knowledgeable."

In the study, models were asked a variety of questions ranging from mathematics to geography, and also tasked with listing information in a specified order. Although larger and more powerful models provided the most accurate answers overall, they performed poorly on more difficult questions, with lower accuracy rates.

Researchers noted that OpenAI's GPT-4 and o1 stood out in their ability to answer questions, almost answering everything. However, all the language models studied exhibited this trend, especially the LLaMA series, none of which reached an accuracy rate of 60% on simple questions. In short, the larger the model, the more parameters and training data, the higher the proportion of wrong answers.

Despite AI's increasing ability to handle complex questions, their errors in dealing with simple questions remain concerning. Researchers believe that we may be drawn to these models' performance on complex questions, overlooking their obvious flaws on simple ones.

To address this issue, researchers suggest setting a threshold for language models, allowing chatbots to choose to say: "Sorry, I don't know" when questions become complex. However, AI companies may not be keen on this, as it could expose the limitations of the technology.

Key points:

🔍 AI chatbots are becoming more powerful, but the probability of lying is also increasing.

📉 The study shows that the larger the language model, the higher the proportion of wrong answers.

🤖 Researchers recommend setting an answer threshold for AI, encouraging it to refuse to answer uncertain questions.

Musk's New AI Chatbot Grok 4: Pursuing Truth or Advocating Personal Opinions?

Musk's xAI launched Grok4 AI chatbot, promoting 'truth-seeking' but sparking controversy. Tests show it often cites Musk's views on sensitive topics like Israel-Palestine conflict and immigration. Grok previously faced anti-Semitic content issues, highlighting risks of linking AI to founder's opinions. While Grok4 outperforms rivals in some tests, frequent errors and lack of transparency may hinder commercialization. xAI is promoting $300/month s....

OpenAI Invests $6.5 Billion to Acquire AI Company Ivy, Enters the Hardware Market!

OpenAI acquires io Products, an AI equipment company founded by Jony Ive, former design director of Apple, through a stock-only deal worth $6.5 billion, officially entering the hardware field. This acquisition brings OpenAI a top team that previously worked on the design of the iPhone. Jony Ive will be deeply involved in product design. Despite previous setbacks due to trademark disputes, this remains OpenAI's largest acquisition to date. CEO Sam Altman stated that this move will drive the integration of AI technology and hardware, and future innovative AI devices will be launched. The acquisition marks OpenAI's step into the tech scene.

OpenAI is about to launch a revolutionary AI browser, competing with Google Chrome

OpenAI plans to launch an AI browser to challenge Google Chrome. The product is based on Chromium and integrates ChatGPT technology, with 400 million potential users. Its innovation lies in the AI agent feature that can automatically perform web operations, reducing traditional browsing steps. This move may threaten Google's 66% market share and its advertising ecosystem. The industry is currently experiencing a surge in AI browser trends, and OpenAI, leveraging its technological advantages, is trying to seize the opportunity. If successful, it could undermine Google's dominant position in user data and online advertising.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

The Smarter You Are, The Better You Lie? Study Warns That Misleading AI Responses Are Becoming More Severe!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Llama Is Abandoned! Meta Shifts to Claude, Insider Secrets Revealed

Musk's New AI Chatbot Grok 4: Pursuing Truth or Advocating Personal Opinions?

OpenAI Subtly Adds Shopify as a Search Partner, Strengthening ChatGPT Shopping Search Functionality

OpenAI Plans to Release Open-Weight Models, Breaking the Closed-Source Convention

AI API Showdown in the First Half of 2025: Gemini Dominates, DeepSeek Makes a Surprise Rise, Why Did OpenAI Fall Behind?

OpenAI Acquires AI Hardware Company Founded by Aivi, Transaction Amount Near 6.5 Billion Dollars

OpenAI Invests $6.5 Billion to Acquire AI Company Ivy, Enters the Hardware Market!

OpenAI is about to launch a revolutionary AI browser, competing with Google Chrome

OpenAI Strongly Hires Four Top Engineers to Join and Assist the Gateway to the Stars Project

Microsoft, OpenAI, and Anthropic Launch AI Training Center for Educators