Close Menu
    National News Brief
    Thursday, April 30
    • Home
    • Business
    • Lifestyle
    • Science
    • Technology
    • International
    • Arts & Entertainment
    • Sports
    National News Brief
    Home»Science

    Google’s Gemini 3 model keeps the AI hype train going – for now

    Team_NationalNewsBriefBy Team_NationalNewsBriefNovember 24, 2025 Science No Comments4 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Gemini 3 is Google’s latest AI model

    VCG via Getty Images

    Google’s latest chatbot, Gemini 3, has made significant leaps on a raft of benchmarks designed to measure AI progress, according to the company. These achievements may be enough to allay fears of an AI bubble bursting for the moment, but it is unclear how well these scores translate to real-world capabilities.

    What’s more, persistent factual inaccuracies and hallucinations that have become a hallmark of all large language models show no signs of being ironed out, which could prove problematic for any uses where reliability is vital.

    In a blog post announcing the new model, Google bosses Sundar Pichai, Demis Hassabis and Koray Kavukcuoglu write that Gemini 3 has “PhD-level reasoning”, a phrase that competitor OpenAI also used when it announced its GPT-5 model. As evidence for this, they list scores on several tests designed to test “graduate-level” knowledge, such as Humanity’s Last Exam, a set of 2500 research-level questions from maths, science and the humanities. Gemini 3 scored 37.5 per cent on this test, outclassing the previous record holder, a version of OpenAI’s GPT-5, which scored 26.5 per cent.

    Jumps like this can indicate that a model has become more capable in certain respects, says Luc Rocher at the University of Oxford, but we need to be careful about how we interpret these results. “If a model goes from 80 per cent to 90 per cent on a benchmark, what does it mean? Does it mean that a model was 80 per cent PhD level and now is 90 per cent PhD level? I think it’s quite difficult to understand,” they say. “There is no number that we can put on whether an AI model has reasoning, because this is a very subjective notion.”

    Benchmark tests have many limitations, such as requiring a single answer or multiple choice answers for which models don’t need to show their working. “It’s very easy to use multiple choice questions to grade [the models],” says Rocher, “but if you go to a doctor, the doctor will not assess you with a multiple choice. If you ask a lawyer, a lawyer will not give you legal advice with multiple choice answers.” There is also a risk that the answers to such tests were hoovered up in the training data of the AI models being tested, effectively letting them cheat.

    The real test for Gemini 3 and the most advanced AI models – and whether their performance will be enough to justify the trillions of dollars that companies like Google and OpenAI are spending on AI data centres – will be in how people use the model and how reliable they find it, says Rocher.

    Google says the model’s improved capabilities will make it better at producing software, organising email and analysing documents. The firm also says it will improve Google search by supplementing AI-generated results with graphics and simulations.

    It is likely that the real improvements will be for people who use AI tools to autonomously write code, a process called agentic coding, says Adam Mahdi at the University of Oxford. “I think we’re hitting the upper limit of what a typical chatbot can do, and the real benefits of Gemini 3 Pro [the standard version of Gemini 3] will probably be in more complex, potentially agentic workflows, rather than everyday chatting,” he says.

     Initial reactions online have included people praising Gemini’s coding capabilities and ability to reason, but as with all new model releases, there have also been posts highlighting failures to do apparently simple tasks, such as tracing hand-drawn arrows pointing to different people, or simple visual reasoning tests.

    Google admits, in Gemini 3’s technical specifications, that the model will continue to hallucinate and produce factual inaccuracies some of the time, at a rate that is roughly comparable with other leading AI models. The lack of improvement in this area is a big concern, says Artur d’Avila Garcez at City St George’s, University of London. “The problem is that all AI companies have been trying to reduce hallucinations for more than two years, but you only need one very bad hallucination to destroy trust in the system for good,” he says.

    Topics:



    Source link

    Team_NationalNewsBrief
    • Website

    Keep Reading

    ‘Green’ cryptocurrency uses 18 times more energy than makers claim

    Doubts cast over ‘wild’ claim that magnetic control can turn on genes

    We have figured out a new way to send messages into the past

    10,000 new planets found hidden in NASA telescope data

    Weird ‘transdimensional’ state of matter is neither 2D nor 3D

    Simple treatment tweak drastically reduces blood loss from severe cuts

    Add A Comment

    Comments are closed.

    Editors Picks

    Branded Hospitality: Where Strategy Meets Shtick

    April 15, 2025

    Surprising new biography of Francis Crick unravels the story of DNA

    November 14, 2025

    UK says several proposals for Ukraine ceasefire on table after France floats one-month truce

    March 3, 2025

    Migrants returning to Venezuela face debt and harsh living conditions

    August 12, 2025

    Feds Spent At Least $267 Million Spent on Fighting “Misinformation”

    December 17, 2024
    Categories
    • Arts & Entertainment
    • Business
    • International
    • Latest News
    • Lifestyle
    • Opinions
    • Politics
    • Science
    • Sports
    • Technology
    • Top Stories
    • Trending News
    • World Economy
    About us

    Welcome to National News Brief, your one-stop destination for staying informed on the latest developments from around the globe. Our mission is to provide readers with up-to-the-minute coverage across a wide range of topics, ensuring you never miss out on the stories that matter most.

    At National News Brief, we cover World News, delivering accurate and insightful reports on global events and issues shaping the future. Our Tech News section keeps you informed about cutting-edge technologies, trends in AI, and innovations transforming industries. Stay ahead of the curve with updates on the World Economy, including financial markets, economic policies, and international trade.

    Editors Picks

    Market Talk – April 30, 2026

    April 30, 2026

    Jinger Duggar Talks ‘Old Wounds’ Amid Brother’s Abuse Charges

    April 30, 2026

    ‘Everyone wants to come to China now’: Chinese universities draw more foreign students beyond Western countries

    April 30, 2026

    First US-Venezuela flight lands in Caracas after seven-year suspension | Aviation News

    April 30, 2026
    Categories
    • Arts & Entertainment
    • Business
    • International
    • Latest News
    • Lifestyle
    • Opinions
    • Politics
    • Science
    • Sports
    • Technology
    • Top Stories
    • Trending News
    • World Economy
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2024 Nationalnewsbrief.com All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.