Research Finds: Large Language Models May Be More Likely to Lie Than Admit Ignorance

AIbase基地

Published inAI News · 2 min read · Sep 29, 2024

107

Recently, scientists at the Polytechnic University of Valencia in Spain conducted a study that revealed a tendency for large language models like GPT, LLaMA, and BLOOM to lie rather than admit ignorance when processing questions. The study found that as AI model complexity increases, their accuracy in handling complex questions declines, and they are more likely to fabricate answers.

Large Models and the Metaverse (2)

Researchers discovered that human volunteers also struggled to identify these erroneous answers during tests, suggesting a potential risk that AI falsehoods may pose to humans. Scientists recommend that to enhance AI reliability, performance on simple questions should be improved, and AI should be encouraged to opt not to answer when faced with difficult questions, allowing users to more accurately assess the credibility of AI.

The findings indicate that large language models may prioritize providing seemingly reasonable answers over admitting their lack of knowledge when answering questions. This could lead to a decrease in user trust in AI and even potentially severe consequences. Scientists call on developers and researchers to address this issue, improving AI response strategies to ensure the reliability and safety of AI.

Large Language Models AI Lies GPT LLaMA

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Major Breakthrough! Research Team Reveals the Hidden Reward Mechanism Inside Large Language Models

Jul 2, 2025

150

Institution: Downgrade the year-over-year growth rate of AI server shipments in 2025

North American large CSPs remain the main driving force behind the demand for AI servers, supported by tier-2 data centers as well as sovereign cloud projects in the Middle East and Europe. Overall demand remains stable. Driven by the demand from North American CSPs and OEMs, it is expected that AI server shipments will continue to grow at double-digit rates in 2025. However, due to changes in the international situation, the year-over-year growth rate of global AI server shipments in 2025 has been revised downward to 24.3%.

Jul 2, 2025

160

WeChat AI Search Accused of Forced 'Opening the Box' to Name, Turning into Hyperlink Instantly - Tencent Responds: Only Integrates Public Information

The newly launched AI search function in WeChat has attracted widespread attention due to allegations of leaking personal privacy. Recently, many users reported on social platforms that this function can generate a personal resume with a name hyperlink in one click, causing concerns about privacy security among users. According to user feedback, the controversy surrounding WeChat AI Search mainly focuses on its automatic identification mechanism. When users encounter names in WeChat official account articles, the system automatically converts the name into a blue hyperlink. Clicking this link will force the AI system to generate a detailed information page containing personal resume, as well as display all

Jul 2, 2025

200

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

According to NetEase Technology, JD.com's layout in the field of embodied intelligence is accelerating rapidly. The embodied intelligence brand JoyInside under JD.com has reached cooperation with more than ten leading robot companies, becoming the core engine for JD.com to seize the smart robot market. According to insiders, JoyInside is supported by JD's large model technology, focusing on providing smart interaction capabilities between robots and consumers. Its product strategy focuses on scenario-based applications such as one person, one dog, and one toy. Since its launch, the brand has successfully attracted leading enterprises from multiple niche fields to join.

Jul 2, 2025

230

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Recently, Hon Hai Precision Industrial Co., Ltd. (commonly known as Foxconn) submitted a trademark registration application for "FoxBrain" to the Trademark Office of the National Intellectual Property Administration. This AI inference large model is not only Foxconn's first attempt but also the first AI model of this type in Taiwan. According to public information, the international classification of this trademark is scientific instruments, and it is currently in the "waiting for substantive examination" status. "FoxBrain" is an AI inference large model launched by the Hon Hai Research Institute, covering data analysis

Jul 2, 2025

240

Zhipu AI Launches GLM-4.1V-Thinking Open Source! A New Leader in Multimodal Reasoning, Challenging Top Models Worldwide

Jul 2, 2025

260

Zhipu AI Open Sources GLM-4.1V-Thinking: A Breakthrough in Multimodal Reasoning

Zhipu AI officially open-sources its latest general vision model, GLM-4.1V-Thinking, based on the GLM-4V architecture, which introduces a chain-of-thought reasoning mechanism, significantly enhancing its capabilities for complex cognitive tasks. The model supports multimodal inputs such as images, videos, and documents, and excels in diverse scenarios including long video understanding, image question answering, subject problem-solving, text recognition, document interpretation, grounding, GUI Agent, and code generation, covering a wide range of industry application needs. GLM-4.1V-9B-Thinking

Jul 2, 2025

300

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and learn about innovative AI product applications. Click to learn more about new AI products: https://top.aibase.com/1、Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand audio and directly generate natural speech. Step-Audio-AQAA is an open source end-to-end speech large model,

Jul 2, 2025

240

Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand Audio and Generate Natural Speech Directly

Jul 2, 2025

220

Ant Group's Medical AI Platform Wins SAIL Award at 2025 World Artificial Intelligence Conference

Jul 2, 2025

210

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Research Finds: Large Language Models May Be More Likely to Lie Than Admit Ignorance

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Major Breakthrough! Research Team Reveals the Hidden Reward Mechanism Inside Large Language Models

Institution: Downgrade the year-over-year growth rate of AI server shipments in 2025

WeChat AI Search Accused of Forced 'Opening the Box' to Name, Turning into Hyperlink Instantly - Tencent Responds: Only Integrates Public Information

JD.com's Embodied Intelligence Strategy Accelerates Rapidly, JoyInside Collaboration Map Exposed

Foxconn Launches Its First AI Inference Large Model FoxBrain, Trademark Application Submitted

Zhipu AI Launches GLM-4.1V-Thinking Open Source! A New Leader in Multimodal Reasoning, Challenging Top Models Worldwide

Zhipu AI Open Sources GLM-4.1V-Thinking: A Breakthrough in Multimodal Reasoning

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand Audio and Generate Natural Speech Directly

Ant Group's Medical AI Platform Wins SAIL Award at 2025 World Artificial Intelligence Conference