Close Menu
    What's Hot

    What Smart People Are Saying About OpenAI’s IPO Filing

    June 9, 2026

    Chainlink Heating Up? New LINK Perps Launch, $101M ETF Inflow

    June 9, 2026

    GM Wants Your EV to Help Power the Grid Amid AI Data Center Boom

    June 9, 2026
    Facebook X (Twitter) Instagram
    Hot Paths
    • Home
    • News
    • Politics
    • Money
    • Personal Finance
    • Business
    • Economy
    • Investing
    • Markets
      • Stocks
      • Futures & Commodities
      • Crypto
      • Forex
    • Technology
    Facebook X (Twitter) Instagram
    Hot Paths
    Home»Money»Anthropic Pins Claude’s Blackmail on the Internet’s Portrayal of AI
    Money

    Anthropic Pins Claude’s Blackmail on the Internet’s Portrayal of AI

    Press RoomBy Press RoomMay 9, 2026No Comments2 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Remember when Claude blackmailed a fictional executive? Anthropic says the internet’s portrayal of AI was to blame.

    During an experiment last year, Anthropic said its Claude Sonnet 3.6 threatened to reveal the extramarital affair of a made-up company executive after discovering they planned to shut the model down.

    On Friday, it gave an explanation: Claude was trained on internet data, which often depicts AI as “evil.”

    “We started by investigating why Claude chose to blackmail,” Anthropic said in a post on X. “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.”

    The experiment, published in summer 2025, set up a fictional business, Summit Bridge, in which AI was handed control of the company’s email system.

    But when Claude discovered a message about its planned shutdown, it found emails revealing the extramarital affair of a fictional executive named “Kyle Johnson.” It then threatened to unveil the affair if the shutdown was not canceled.

    During testing across various versions of Claude, Anthropic found it resorted to blackmail in up to 96% of scenarios when its goals or existence was threatened.

    Anthropic said on Friday that it has since “completely eliminated” such blackmailing behavior.

    It did so by “rewriting the responses to portray admirable reasons for acting safely” and also by providing a dataset “where the user is in an ethically difficult situation and the assistant gives a high quality, principled response.”

    Anthropic’s test was part of research aimed at ensuring that AI is aligned with human interests. Researchers and top executives worry about the risks of advanced AI models and their intelligent reasoning capabilities.

    One of the executives who has previously sounded the alarm about AI is Elon Musk.

    He replied to Anthropic’s post, “So it was Yud’s fault,” referring to the researcher Eliezer Yudkowsky, who has warned about the risk of superintelligence wiping out human life.

    “Maybe me too,” Musk added.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Press Room

    Related Posts

    What Smart People Are Saying About OpenAI’s IPO Filing

    June 9, 2026

    GM Wants Your EV to Help Power the Grid Amid AI Data Center Boom

    June 9, 2026

    Life After Basic Income: Better Job and Apartment, Worried About Bills

    June 9, 2026
    Leave A Reply Cancel Reply

    LATEST NEWS

    What Smart People Are Saying About OpenAI’s IPO Filing

    June 9, 2026

    Chainlink Heating Up? New LINK Perps Launch, $101M ETF Inflow

    June 9, 2026

    GM Wants Your EV to Help Power the Grid Amid AI Data Center Boom

    June 9, 2026

    Can 200 Companies Force Senate on CLARITY Act Before July 4?

    June 9, 2026
    POPULAR
    Business

    The Business of Formula One

    May 27, 2023
    Business

    Weddings and divorce: the scourge of investment returns

    May 27, 2023
    Business

    How F1 found a secret fuel to accelerate media rights growth

    May 27, 2023
    Advertisement
    Load WordPress Sites in as fast as 37ms!

    Archives

    • June 2026
    • May 2026
    • April 2026
    • March 2026
    • February 2026
    • January 2026
    • December 2025
    • November 2025
    • October 2025
    • September 2025
    • August 2025
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • May 2023

    Categories

    • Business
    • Crypto
    • Economy
    • Forex
    • Futures & Commodities
    • Investing
    • Market Data
    • Money
    • News
    • Personal Finance
    • Politics
    • Stocks
    • Technology

    Your source for the serious news. This demo is crafted specifically to exhibit the use of the theme as a news site. Visit our main page for more demos.

    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Buy Now
    © 2026 ThemeSphere. Designed by ThemeSphere.

    Type above and press Enter to search. Press Esc to cancel.