Close Menu
    What's Hot

    XRP Stalls Despite Ripple’s OCC Win – Here’s The Institutional Catch

    December 13, 2025

    Taylor Swift Gave Eras Tour Crew Jaw-Dropping Bonus Checks

    December 13, 2025

    China’s DeepSeek AI Predicts the Price of XRP, Solana, Dogecoin by the End of 2025

    December 13, 2025
    Facebook X (Twitter) Instagram
    Hot Paths
    • Home
    • News
    • Politics
    • Money
    • Personal Finance
    • Business
    • Economy
    • Investing
    • Markets
      • Stocks
      • Futures & Commodities
      • Crypto
      • Forex
    • Technology
    Facebook X (Twitter) Instagram
    Hot Paths
    Home»Money»Google Researchers Find the Best AI Model Is 69% Right
    Money

    Google Researchers Find the Best AI Model Is 69% Right

    Press RoomBy Press RoomDecember 13, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    2025-12-12T21:26:30.691Z



    Facebook


    Email


    X



    LinkedIn


    Reddit



    Bluesky


    WhatsApp



    Copy link

    lighning bolt icon An icon in the shape of a lightning bolt.


    Impact Link



    Save
    Saved


    Read in app

    This story is available exclusively to Business Insider
    subscribers. Become an Insider
    and start reading now.

    Have an account? .

    We just got a sobering picture of how often AI models get their facts straight. This week, Google DeepMind introduced the FACTS Benchmark Suite, which measures how reliably AI models produce factually accurate answers.

    It tests models in four areas: answering factoid questions from internal knowledge, using web search effectively, grounding responses in long documents, and interpreting images. The best model, Google’s Gemini 3 Pro, reached 69% accuracy, with other leading models falling well below that.

    For context, if any of the reporters I manage filed stories that were 69% accurate, I would fire them.

    Beyond journalism, this number should matter to businesses betting on AI. While models excel at speed and fluency, their factual reliability still lags far behind human expectations, especially in tasks involving niche knowledge, complex reasoning, or precise grounding in source material.

    Even small factual errors can have outsized consequences in sectors such as finance, healthcare, and the law. This week, my talented colleague Melia Russell looked at how law firms are handling the rise of AI models as a source of legal truth. It’s messy: She recounts how one firm fired an employee because they filed a document riddled with fake cases after using ChatGPT to draft it.

    The FACTS benchmark is a warning but also a roadmap: by quantifying where and how models fail, Google hopes to accelerate progress. But for now, the takeaway is clear: AI is getting better, but it’s still wrong about one-third of the time.

    Sign up for BI’s Tech Memo newsletter here. Reach out to me via email at abarr@businessinsider.com.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Press Room

    Related Posts

    Taylor Swift Gave Eras Tour Crew Jaw-Dropping Bonus Checks

    December 13, 2025

    Tesla’s Latest Launch Isn’t a Car — It’s a $350 Pickleball Paddle

    December 13, 2025

    How I Got AI to Help Me Sell My Old Couch

    December 13, 2025
    Leave A Reply Cancel Reply

    LATEST NEWS

    XRP Stalls Despite Ripple’s OCC Win – Here’s The Institutional Catch

    December 13, 2025

    Taylor Swift Gave Eras Tour Crew Jaw-Dropping Bonus Checks

    December 13, 2025

    China’s DeepSeek AI Predicts the Price of XRP, Solana, Dogecoin by the End of 2025

    December 13, 2025

    Tesla’s Latest Launch Isn’t a Car — It’s a $350 Pickleball Paddle

    December 13, 2025
    POPULAR
    Business

    The Business of Formula One

    May 27, 2023
    Business

    Weddings and divorce: the scourge of investment returns

    May 27, 2023
    Business

    How F1 found a secret fuel to accelerate media rights growth

    May 27, 2023
    Advertisement
    Load WordPress Sites in as fast as 37ms!

    Archives

    • December 2025
    • November 2025
    • October 2025
    • September 2025
    • August 2025
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • May 2023

    Categories

    • Business
    • Crypto
    • Economy
    • Forex
    • Futures & Commodities
    • Investing
    • Market Data
    • Money
    • News
    • Personal Finance
    • Politics
    • Stocks
    • Technology

    Your source for the serious news. This demo is crafted specifically to exhibit the use of the theme as a news site. Visit our main page for more demos.

    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Buy Now
    © 2025 ThemeSphere. Designed by ThemeSphere.

    Type above and press Enter to search. Press Esc to cancel.