Close Menu
    What's Hot

    Couple Retired to Costa Rica With 2 Sons; Bought a $1.6 Million House

    June 28, 2026

    Stocks eye fresh highs as U.S.-Iran talks resume, oil rises on Hormuz tensions

    June 28, 2026

    Google Employee Who Made Nearly $1 Million Explains Why He Left

    June 28, 2026
    Facebook X (Twitter) Instagram
    Hot Paths
    • Home
    • News
    • Politics
    • Money
    • Personal Finance
    • Business
    • Economy
    • Investing
    • Markets
      • Stocks
      • Futures & Commodities
      • Crypto
      • Forex
    • Technology
    Facebook X (Twitter) Instagram
    Hot Paths
    Home»Money»Why AI Chatbots Hallucinate, According to OpenAI Researchers
    Money

    Why AI Chatbots Hallucinate, According to OpenAI Researchers

    Press RoomBy Press RoomSeptember 6, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    OpenAI researchers claim they’ve cracked one of the biggest obstacles to large language model performance — hallucinations.

    Hallucinations occur when a large language model generates inaccurate information that it presents as fact. They plague the most popular LLMs, from OpenAI’s GPT-5 to Anthropic’s Claude.

    OpenAI’s baseline finding, which it made public in a paper released on Thursday, is that large language models hallucinate because the methods they’re trained under reward guessing more than admitting uncertainty.

    In other words, LLMs are being told to fake it till they make it. Some are better than others, however. In a blog post last month, OpenAI said that Claude models are more “aware of their uncertainty and often avoid making statements that are inaccurate.” It also noted that Claude’s high refusal rates risked limiting its utility.

    “Hallucinations persist due to the way most evaluations are graded — language models are optimized to be good test-takers, and guessing when uncertain improves test performance,” the researchers wrote in the paper.

    Large language models are essentially always in “test-taking mode,” answering questions as if everything in life were binary — right or wrong, black or white.

    In many ways, they’re not equipped for the realities of life, where uncertainty is more common than certainty, and true accuracy is not a given.

    Related stories

    Business Insider tells the innovative stories you want to know

    Business Insider tells the innovative stories you want to know

    “Humans learn the value of expressing uncertainty outside of school, in the school of hard knocks. On the other hand, language models are primarily evaluated using exams that penalize uncertainty,” the researchers wrote.

    The good news is that there is a fix, and it has to do with redesigning evaluation metrics.

    “The root problem is the abundance of evaluations that are not aligned,” they wrote. “The numerous primary evaluations must be adjusted to stop penalizing abstentions when uncertain.”

    In a blog post about the paper, OpenAI elaborated on what this type of adjustment would entail.

    “The widely used, accuracy-based evals need to be updated so that their scoring discourages guessing. If the main scoreboards keep rewarding lucky guesses, models will keep learning to guess,” OpenAI said.

    OpenAI did not immediately respond to a request for comment from Business Insider.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Press Room

    Related Posts

    Couple Retired to Costa Rica With 2 Sons; Bought a $1.6 Million House

    June 28, 2026

    Google Employee Who Made Nearly $1 Million Explains Why He Left

    June 28, 2026

    Career Coach Recommends 4-Hour Burnout-Proof Job Application Routine

    June 28, 2026
    Leave A Reply Cancel Reply

    LATEST NEWS

    Couple Retired to Costa Rica With 2 Sons; Bought a $1.6 Million House

    June 28, 2026

    Stocks eye fresh highs as U.S.-Iran talks resume, oil rises on Hormuz tensions

    June 28, 2026

    Google Employee Who Made Nearly $1 Million Explains Why He Left

    June 28, 2026

    Concentrix Q2 2026 Earnings Preview

    June 28, 2026
    POPULAR
    Business

    The Business of Formula One

    May 27, 2023
    Business

    Weddings and divorce: the scourge of investment returns

    May 27, 2023
    Business

    How F1 found a secret fuel to accelerate media rights growth

    May 27, 2023
    Advertisement
    Load WordPress Sites in as fast as 37ms!

    Archives

    • June 2026
    • May 2026
    • April 2026
    • March 2026
    • February 2026
    • January 2026
    • December 2025
    • November 2025
    • October 2025
    • September 2025
    • August 2025
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • May 2023

    Categories

    • Business
    • Crypto
    • Economy
    • Forex
    • Futures & Commodities
    • Investing
    • Market Data
    • Money
    • News
    • Personal Finance
    • Politics
    • Stocks
    • Technology

    Your source for the serious news. This demo is crafted specifically to exhibit the use of the theme as a news site. Visit our main page for more demos.

    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Buy Now
    © 2026 ThemeSphere. Designed by ThemeSphere.

    Type above and press Enter to search. Press Esc to cancel.