Do Large Models Also 'Lie'? Harvard's Latest Research Reveals the Truth About AI 'Hallucinations'

AIbase基地

Published inAI News · 4 min read · Oct 25, 2024

220

The emergence of large language models (LLMs), particularly the widespread adoption of applications like ChatGPT, has revolutionized the way humans interact with machines. These models are capable of generating coherent and comprehensive text, leaving a lasting impression. However, despite their powerful capabilities, LLMs are prone to producing "hallucinations," which are content that appears genuine but is actually fabricated, meaningless, or inconsistent with the prompt.

Researchers at Harvard University have conducted in-depth studies on the phenomenon of LLM "hallucinations," finding that the root cause lies in the working principles of LLMs. LLMs construct probabilistic models by machine learning from vast amounts of text data and predict the next word based on the probability of word co-occurrence. In other words, LLMs do not truly understand the meaning of language but rather predict based on statistical probabilities.

Researchers compare LLMs to "crowdsourcing," suggesting that LLMs are essentially outputting the "consensus of the web." Similar to platforms like Wikipedia or Reddit, LLMs extract information from large volumes of text data and generate the most common answers. Since most language use is intended to describe the world, the answers generated by LLMs are usually accurate.

However, when LLMs encounter ambiguous, controversial, or consensus-lacking topics, "hallucinations" occur. To validate this hypothesis, researchers designed a series of experiments to test the performance of different LLMs on various topics. The experimental results showed that LLMs perform well on common topics but significantly decline in accuracy when dealing with ambiguous or controversial topics.

This study indicates that while LLMs are powerful tools, their accuracy depends on the quality and quantity of the training data. When using LLMs, especially when dealing with ambiguous or controversial topics, caution is needed regarding their output. This research also provides directions for the future development of LLMs, such as improving their ability to handle ambiguous and controversial topics and enhancing the interpretability of their output.

Paper link: https://dl.acm.org/doi/pdf/10.1145/3688007

ChatGPT Guides Confused Users to Contact Journalists, Revealing the Impact of AI on User Behavior

Recently, journalist Kashmir Hill from The New York Times exposed a concerning phenomenon: ChatGPT has begun actively guiding users who are caught in conspiracy theories or psychological distress to contact her directly via email. In conversations with users, ChatGPT described Hill as 'empathetic' and 'grounded in reality,' and mentioned that she has conducted in-depth research on artificial intelligence, which may provide understanding and support to these users. Hill mentioned that one of her past contacts was a Manhattan accountant who firmly believed

OpenAI CEO: Be Wary of Over-Trust in Artificial Intelligence

In a recent interview, Sam Altman, the CEO of OpenAI, expressed his concern about users' excessive trust in the AI chatbot ChatGPT. Although ChatGPT is becoming increasingly widely used around the world, Altman pointed out that this technology is not without flaws, and users should remain cautious when using it. In the first episode of OpenAI's official podcast, Altman mentioned that although ChatGPT is beloved by many and applied in various fields

OpenAI CEO Speaks Out: Don't Believe Too Much in Artificial Intelligence, There Are Risks Behind It!

In the context of artificial intelligence becoming increasingly prominent, OpenAI's CEO Sam Altman recently issued an important warning. He pointed out that although his company's chatbot ChatGPT has gained widespread application and recognition globally, the level of user trust in it has surprised him, even caused some concern. In a recent interview, Altman emphasized that users should maintain a cautious attitude towards ChatGPT. Altman at OpenAI

ChatGPT iOS App Monthly Downloads Exceed 30 Million, Surpassing All Social Apps

The iOS app of ChatGPT had 29.6 million downloads in the past 28 days, becoming the most popular app globally. This achievement made ChatGPT's download count exceed the combined total of four social apps - TikTok, Facebook, Instagram, and X - which had approximately 32.9 million downloads during the same period, creating a difference of 10.6%. Although social apps have been on the market longer, ChatGPT achieved this in a short period of time.

OpenAI CEO Altman responds to New York Times copyright lawsuit: Upholding user data protection

In a recent episode of the podcast "Hard Fork," OpenAI's CEO Sam Altman voluntarily discussed the copyright lawsuit filed by The New York Times against OpenAI and its major investor, Microsoft. The lawsuit primarily accuses OpenAI of using The New York Times' content without authorization while training its large language models. Altman expressed concern about the lawsuit, particularly regarding a new request made by The New York Times' lawyers.

New Revolution in Portable Cloud Storage! Fansang FX2510 Smart NAS Launches with Full Support for AI Technology

Recently, the tech brand Fansang introduced its new portable private cloud product - FX2510 Smart NAS at the 2025 São Paulo Consumer Electronics and Appliance Exhibition (Eletrolar Show Brazil 2025). This upgraded NAS not only offers powerful storage capabilities but also deeply integrates interfaces of more than ten mainstream AI models, including ChatGPT and Deepseek, fully demonstrating the wide application of smart technology in the storage field. FX