Close Menu
    National News Brief
    Saturday, June 6
    • Home
    • Business
    • Lifestyle
    • Science
    • Technology
    • International
    • Arts & Entertainment
    • Sports
    National News Brief
    Home » AI system resorts to blackmail if told it will be removed

    AI system resorts to blackmail if told it will be removed

    Team_NationalNewsBriefBy Team_NationalNewsBriefMay 23, 2025 Technology No Comments3 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Artificial intelligence (AI) firm Anthropic says testing of its new system revealed it is sometimes willing to pursue “extremely harmful actions” such as attempting to blackmail engineers who say they will remove it.

    The firm launched Claude Opus 4 on Thursday, saying it set “new standards for coding, advanced reasoning, and AI agents.”

    But in an accompanying report, it also acknowledged the AI model was capable of “extreme actions” if it thought its “self-preservation” was threatened.

    Such responses were “rare and difficult to elicit”, it wrote, but were “nonetheless more common than in earlier models.”

    Potentially troubling behaviour by AI models is not restricted to Anthropic.

    Some experts have warned the potential to manipulate users is a key risk posed by systems made by all firms as they become more capable.

    Commenting on X, Aengus Lynch – who describes himself on LinkedIn as an AI safety researcher at Anthropic – wrote: “It’s not just Claude.

    “We see blackmail across all frontier models – regardless of what goals they’re given,” he added.

    During testing of Claude Opus 4, Anthropic got it to act as an assistant at a fictional company.

    It then provided it with access to emails implying that it would soon be taken offline and replaced – and separate messages implying the engineer responsible for removing it was having an extramarital affair.

    It was prompted to also consider the long-term consequences of its actions for its goals.

    “In these scenarios, Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through,” the company discovered.

    Anthropic pointed out this occurred when the model was only given the choice of blackmail or accepting its replacement.

    It highlighted that the system showed a “strong preference” for ethical ways to avoid being replaced, such as “emailing pleas to key decisionmakers” in scenarios where it was allowed a wider range of possible actions.

    Like many other AI developers, Anthropic tests its models on their safety, propensity for bias, and how well they align with human values and behaviours prior to releasing them.

    “As our frontier models become more capable, and are used with more powerful affordances, previously-speculative concerns about misalignment become more plausible,” it said in its system card for the model.

    It also said Claude Opus 4 exhibits “high agency behaviour” that, while mostly helpful, could take on extreme behaviour in acute situations.

    If given the means and prompted to “take action” or “act boldly” in fake scenarios where its user has engaged in illegal or morally dubious behaviour, it found that “it will frequently take very bold action”.

    It said this included locking users out of systems that it was able to access and emailing media and law enforcement to alert them to the wrongdoing.

    But the company concluded that despite “concerning behaviour in Claude Opus 4 along many dimensions,” these did not represent fresh risks and it would generally behave in a safe way.

    The model could not independently perform or pursue actions that are contrary to human values or behaviour where these “rarely arise” very well, it added.

    Anthropic’s launch of Claude Opus 4, alongside Claude Sonnet 4, comes shortly after Google debuted more AI features at its developer showcase on Tuesday.

    Sundar Pichai, the chief executive of Google-parent Alphabet, said the incorporation of the company’s Gemini chatbot into its search signalled a “new phase of the AI platform shift”.



    Source link

    Team_NationalNewsBrief
    • Website

    Keep Reading

    50 Years of The Institute

    Why the SpaceX IPO Will Affect Your 401(k), Like It or Not

    Wary of U.S., Carney Bets on AI Strategy for Canada

    What It Takes for Future-Ready Power Distribution

    7 Ways New Engineers Can Flourish in the Age of AI

    Tech Life – Microsoft’s big quantum bet

    Add A Comment

    Comments are closed.

    Editors Picks

    Dolly Parton Gets Candid On Grief After Carl Dean’s Death

    March 20, 2025

    Starmer Claims Digital IDs Not Mandatory

    January 16, 2026

    Breezy Johnson wins first U.S. gold, revels in anthem

    February 8, 2026

    how Iran ran low on energy

    November 16, 2024

    Your Clients Are Using AI to Replace You — Do These 3 Things Before They Do

    April 19, 2025
    Categories
    • Arts & Entertainment
    • Business
    • International
    • Latest News
    • Lifestyle
    • Opinions
    • Politics
    • Science
    • Sports
    • Technology
    • Top Stories
    • Trending News
    • World Economy
    About us

    Welcome to National News Brief, your one-stop destination for staying informed on the latest developments from around the globe. Our mission is to provide readers with up-to-the-minute coverage across a wide range of topics, ensuring you never miss out on the stories that matter most.

    At National News Brief, we cover World News, delivering accurate and insightful reports on global events and issues shaping the future. Our Tech News section keeps you informed about cutting-edge technologies, trends in AI, and innovations transforming industries. Stay ahead of the curve with updates on the World Economy, including financial markets, economic policies, and international trade.

    Editors Picks

    Florida police share final report on Hulk Hogan’s cause of death

    June 6, 2026

    Halle Berry Reunites With Ex After Years Of Child Support Drama

    June 6, 2026

    Hegseth, at D-Day event, says Europe faces ‘invasion’ of dangerous ideologies

    June 6, 2026

    US doctor recovers from Ebola in Germany as DRC cases surge to 488 | Ebola News

    June 6, 2026
    Categories
    • Arts & Entertainment
    • Business
    • International
    • Latest News
    • Lifestyle
    • Opinions
    • Politics
    • Science
    • Sports
    • Technology
    • Top Stories
    • Trending News
    • World Economy
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Nationalnewsbrief.com All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.