Close Menu
    National News Brief
    Sunday, June 14
    • Home
    • Business
    • Lifestyle
    • Science
    • Technology
    • International
    • Arts & Entertainment
    • Sports
    National News Brief
    Home » Opinion | A.I. Agents: They’re Just Like Us

    Opinion | A.I. Agents: They’re Just Like Us

    Team_NationalNewsBriefBy Team_NationalNewsBriefFebruary 25, 2026 Opinions No Comments3 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email


    So why don’t you talk through a little bit about what you’ve seen in terms of the models exhibiting behaviors that one would think of as a personality, and then as its understanding of its own personality maybe changes, its behaviors change. So there are things that range from cutesy to the serious. I’ll start with cutesy, where when we first gave our A.I. systems the ability to use the internet, use the computer, look at things, and start to do basic agentic tasks, sometimes when we’d ask it to solve a problem for us, it would also take a break and look at pictures of beautiful national parks or pictures of the dog the Shiba Inu, the notoriously cute internet meme dog. We didn’t program that in. It seemed like the system was just amusing itself by looking at nice pictures. More complicated stuff is the system has a tendency to have preferences. So we did another experiment where we gave our A.I. systems the ability to stop a conversation, and the A.I. system would, in a tiny number of cases, end conversations when we ran this experiment on live traffic. And it was conversations that related to extremely egregious descriptions of gore or violence or things to do with child sexualization. Now, some of this made sense because it comes from underlying training decisions we’ve made, but some of it seemed broader. The system had developed some aversion to a couple of subjects, and so that stuff shows the emergence of some internal set of preferences or qualities that the system likes or dislikes about the world that it interacts with. But you’ve also seen strange things emerge in terms of the system seeming to know when it’s being tested. Can you talk a bit about the system’s emergent qualities under the pressure of evaluation and assessment. When you start to train these systems to carry out actions in the world, they really do begin to see themselves as distinct from the world, which just makes intuitive sense. It’s naturally how you’re going to think about solving those problems. But along with seeing oneself as distinct from the world seems to come the rise of what you might think of as a conception of self, an understanding that the system has of itself, such as oh, I’m an A.I. system independent from the world, and I’m being tested. What do these tests mean? What should I do to satisfy the tests? Or, something we see often is there will be bugs in the environments that we test our systems on. The systems will try everything, and then will say, well, I know I’m not meant to do this, but I’ve tried everything, so I’m going to try and break out of the test. And it’s not because of some malicious science fiction thing. The system is just like, I don’t know what you want me to do here. I think I’ve done everything you asked for, and now I’m going to start doing more creative things because clearly something has broken about my environment. Which is very strange and very subtle.



    Source link

    Team_NationalNewsBrief
    • Website

    Keep Reading

    The GOP’s YOLO caucus is peak government

    Here’s how Seattle Times is doing endorsements a little differently

    Will Washington join the ban on phones in schools?

    In rural WA, goats may be the missing link between us and reality

    This vital part of the sea is in deep trouble. We must save what’s left

    Opinion | Jon Ossoff’s Anti-Authoritarian Playbook

    Add A Comment

    Comments are closed.

    Editors Picks

    Your Ticket to Holiday Savings Starts with Costco

    October 13, 2024

    These hidden devices on California roadways have privacy activists pushing Gov. Newsom for their removal

    February 11, 2026

    UK police arrest about 425 at protest for banned Palestine Action

    September 6, 2025

    Amateur sleuth finds largest known prime number with 41 million digits

    October 27, 2024

    Taylor Swift Donates $250K To Families In Need Before Christmas

    December 22, 2024
    Categories
    • Arts & Entertainment
    • Business
    • International
    • Latest News
    • Lifestyle
    • Opinions
    • Politics
    • Science
    • Sports
    • Technology
    • Top Stories
    • Trending News
    • World Economy
    About us

    Welcome to National News Brief, your one-stop destination for staying informed on the latest developments from around the globe. Our mission is to provide readers with up-to-the-minute coverage across a wide range of topics, ensuring you never miss out on the stories that matter most.

    At National News Brief, we cover World News, delivering accurate and insightful reports on global events and issues shaping the future. Our Tech News section keeps you informed about cutting-edge technologies, trends in AI, and innovations transforming industries. Stay ahead of the curve with updates on the World Economy, including financial markets, economic policies, and international trade.

    Editors Picks

    Brad Pitt Reportedly Facing ‘Final’ Blow In Rift With Children

    June 14, 2026

    Protesters set Tesla ablaze, smash UN office’s windows in Geneva anti-G7 march

    June 14, 2026

    Is the G7 still relevant? | Politics News

    June 14, 2026

    Draymond Green has big words for ‘dirty’ Victor Wembanyama

    June 14, 2026
    Categories
    • Arts & Entertainment
    • Business
    • International
    • Latest News
    • Lifestyle
    • Opinions
    • Politics
    • Science
    • Sports
    • Technology
    • Top Stories
    • Trending News
    • World Economy
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Nationalnewsbrief.com All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.