• About
  • Advertise
  • Privacy & Policy
  • Contact
Vidianews
  • Home
  • Entertainment
    • All
    • Gaming
    • Movie
    britney-spears’-son-sean-preston-changes-his-instagram-handle-to-sean-p-spears

    Britney Spears’ son Sean Preston changes his Instagram handle to Sean P Spears

    naked-man-fights-outside-kanye-west-concert,-on-video

    Naked man fights outside Kanye West concert, on video

    google-launches-gemma-4,-a-family-of-open-models-built-from-gemini-3

    Google launches Gemma 4, a family of open models built from Gemini 3

    kendra-wilkinson-slams-dishonest-people-over-glp-1-injections

    Kendra Wilkinson slams dishonest people over GLP-1 injections

    jen-shah-pushes-back-against-allegations-of-targeting-the-elderly

    Jen Shah pushes back against allegations of targeting the elderly

    joe-biden-mingles-with-passengers-on-commercial-flight,-writes-comforting-note

    Joe Biden mingles with passengers on commercial flight, writes comforting note

  • Sports
  • Tech
    • All
    • Gadget
    • Startup
    marshals:-a-yellowstone-story-episode-6-release-date-and-time-–-when-will-it-launch-on-cbs-and-paramount+?

    Marshals: A Yellowstone Story Episode 6 release date and time – when will it launch on CBS and Paramount+?

    hp’s-latest-z8-fury-g6i-breaks-the-boundaries-of-traditional-workstations

    HP’s latest Z8 Fury G6i breaks the boundaries of traditional workstations

    Answers to today’s NYT mini crossword for April 3 – CNET

    Today’s NYT Connections: Sports Editing Tips, Answers for April 3 #557

    ‘uncanny-valley’:-iran’s-threats-to-american-tech,-trump’s-midterm-election-plans,-and-the-polymarket-pop-up-flop

    ‘Uncanny Valley’: Iran’s Threats to American Tech, Trump’s Midterm Election Plans, and the Polymarket Pop-Up Flop

    how-to-watch-9now-outside-australia-–-stream-online-and-from-anywhere-with-a-vpn

    How to watch 9Now outside Australia – stream online and from anywhere with a VPN

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
  • Lifestyle
    • All
    • Faith
    • Health
    • Travel
    the-feeling-we-forgot…-until-this-trip-brought-it-back-–-goats-on-the-road

    The feeling we forgot… until this trip brought it back – Goats On The Road

    medcity-femfwd:-new-acog-guidelines-for-endometriosis-–-medcity-news

    MedCity FemFwd: New ACOG Guidelines for Endometriosis – MedCity News

    5-seafood-recipes-for-fish-on-friday-(lent-style)

    5 seafood recipes for fish on Friday (Lent style)

    5-negotiation-secrets-from-sophia-amoruso-that-will-change-the-way-you-close-deals

    5 Negotiation Secrets from Sophia Amoruso That Will Change the Way You Close Deals

    why-everyone-books-celebrity-chef-mike-for-girl-scout-pasta-nights:-the-latest-food-trend-taking-over-this-season-–-social-lifestyle-magazine

    Why Everyone Books Celebrity Chef Mike for Girl Scout Pasta Nights: The Latest Food Trend Taking Over This Season – Social Lifestyle Magazine

    30-small-ways-to-make-the-most-of-the-month-of-april

    30 small ways to make the most of the month of April

    Trending Tags

    • Golden Globes
    • Game of Thrones
    • MotoGP 2017
    • eSports
    • Fashion Week
  • News
    • All
    • Business
    • Science
    mortgage-rates-rise-for-fifth-straight-week-as-iran-war-continues-to-disrupt-markets

    Mortgage Rates Rise For Fifth Straight Week As Iran War Continues To Disrupt Markets

    Why Home Elevators Are Becoming Essential in South Carolina Homes – Insights Success

    Three minors among four indicted for foiled attack at a Parisian Bank of America branch

    asia-pacific-markets-reverse-gains-as-oil-rises-after-trump’s-iran-war-speech

    Asia-Pacific markets reverse gains as oil rises after Trump’s Iran war speech

    lindsey-buckingham-attacked-with-unknown-substance

    Lindsey Buckingham attacked with unknown substance

    burundi:-an-explosion-in-an-ammunition-depot-kills-civilians-in-bujumbura-(army)

    Burundi: an explosion in an ammunition depot kills civilians in Bujumbura (army)

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Business
  • Politics
  • World
  • Review

    Deep Dive into Decentralized Finance: The 5 Best Lending Platforms

    Crypto 2021 – the year that was: ups and downs that impacted the market

    Best Web 3.0 Cryptos for 2022

    Best Gaming Cryptos and Metaverse for 2022

    Apple’s Mac Pro desktop with M2 Ultra chipset is discontinued almost three years after launch

    Top 5 blockchains for 2022: Ethereum, Avalanche, Polygon, more

No Result
View All Result
  • Home
  • Entertainment
    • All
    • Gaming
    • Movie
    britney-spears’-son-sean-preston-changes-his-instagram-handle-to-sean-p-spears

    Britney Spears’ son Sean Preston changes his Instagram handle to Sean P Spears

    naked-man-fights-outside-kanye-west-concert,-on-video

    Naked man fights outside Kanye West concert, on video

    google-launches-gemma-4,-a-family-of-open-models-built-from-gemini-3

    Google launches Gemma 4, a family of open models built from Gemini 3

    kendra-wilkinson-slams-dishonest-people-over-glp-1-injections

    Kendra Wilkinson slams dishonest people over GLP-1 injections

    jen-shah-pushes-back-against-allegations-of-targeting-the-elderly

    Jen Shah pushes back against allegations of targeting the elderly

    joe-biden-mingles-with-passengers-on-commercial-flight,-writes-comforting-note

    Joe Biden mingles with passengers on commercial flight, writes comforting note

  • Sports
  • Tech
    • All
    • Gadget
    • Startup
    marshals:-a-yellowstone-story-episode-6-release-date-and-time-–-when-will-it-launch-on-cbs-and-paramount+?

    Marshals: A Yellowstone Story Episode 6 release date and time – when will it launch on CBS and Paramount+?

    hp’s-latest-z8-fury-g6i-breaks-the-boundaries-of-traditional-workstations

    HP’s latest Z8 Fury G6i breaks the boundaries of traditional workstations

    Answers to today’s NYT mini crossword for April 3 – CNET

    Today’s NYT Connections: Sports Editing Tips, Answers for April 3 #557

    ‘uncanny-valley’:-iran’s-threats-to-american-tech,-trump’s-midterm-election-plans,-and-the-polymarket-pop-up-flop

    ‘Uncanny Valley’: Iran’s Threats to American Tech, Trump’s Midterm Election Plans, and the Polymarket Pop-Up Flop

    how-to-watch-9now-outside-australia-–-stream-online-and-from-anywhere-with-a-vpn

    How to watch 9Now outside Australia – stream online and from anywhere with a VPN

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
  • Lifestyle
    • All
    • Faith
    • Health
    • Travel
    the-feeling-we-forgot…-until-this-trip-brought-it-back-–-goats-on-the-road

    The feeling we forgot… until this trip brought it back – Goats On The Road

    medcity-femfwd:-new-acog-guidelines-for-endometriosis-–-medcity-news

    MedCity FemFwd: New ACOG Guidelines for Endometriosis – MedCity News

    5-seafood-recipes-for-fish-on-friday-(lent-style)

    5 seafood recipes for fish on Friday (Lent style)

    5-negotiation-secrets-from-sophia-amoruso-that-will-change-the-way-you-close-deals

    5 Negotiation Secrets from Sophia Amoruso That Will Change the Way You Close Deals

    why-everyone-books-celebrity-chef-mike-for-girl-scout-pasta-nights:-the-latest-food-trend-taking-over-this-season-–-social-lifestyle-magazine

    Why Everyone Books Celebrity Chef Mike for Girl Scout Pasta Nights: The Latest Food Trend Taking Over This Season – Social Lifestyle Magazine

    30-small-ways-to-make-the-most-of-the-month-of-april

    30 small ways to make the most of the month of April

    Trending Tags

    • Golden Globes
    • Game of Thrones
    • MotoGP 2017
    • eSports
    • Fashion Week
  • News
    • All
    • Business
    • Science
    mortgage-rates-rise-for-fifth-straight-week-as-iran-war-continues-to-disrupt-markets

    Mortgage Rates Rise For Fifth Straight Week As Iran War Continues To Disrupt Markets

    Why Home Elevators Are Becoming Essential in South Carolina Homes – Insights Success

    Three minors among four indicted for foiled attack at a Parisian Bank of America branch

    asia-pacific-markets-reverse-gains-as-oil-rises-after-trump’s-iran-war-speech

    Asia-Pacific markets reverse gains as oil rises after Trump’s Iran war speech

    lindsey-buckingham-attacked-with-unknown-substance

    Lindsey Buckingham attacked with unknown substance

    burundi:-an-explosion-in-an-ammunition-depot-kills-civilians-in-bujumbura-(army)

    Burundi: an explosion in an ammunition depot kills civilians in Bujumbura (army)

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Business
  • Politics
  • World
  • Review

    Deep Dive into Decentralized Finance: The 5 Best Lending Platforms

    Crypto 2021 – the year that was: ups and downs that impacted the market

    Best Web 3.0 Cryptos for 2022

    Best Gaming Cryptos and Metaverse for 2022

    Apple’s Mac Pro desktop with M2 Ultra chipset is discontinued almost three years after launch

    Top 5 blockchains for 2022: Ethereum, Avalanche, Polygon, more

No Result
View All Result
Vidianews
No Result
View All Result
Home General

Real-world medical questions block AI chatbots

Julie Bort by Julie Bort
February 17, 2026
in General, World
0
real-world-medical-questions-block-ai-chatbots

Real-world medical questions block AI chatbots

0
SHARES
0
VIEWS
Share on FacebookShare on Twitter

AI chatbots may seem intelligent in the medical field, but their ratings falter when interacting with real people.

In the laboratory, AI chatbots could identify medical problems with 95 percent accuracy and correctly recommends actions such as calling a doctor or going to urgent care more than 56 percent of the time. When humans conversationally presented medical scenarios to AI chatbots, things got more complicated. Accuracy fell to less than 35 percent for diagnosing disease and about 44 percent for identifying the right action, the researchers reported Feb. 9 in Natural medicine.

The decline in chatbot performance between the lab and real-world settings indicates that “AI has the medical knowledge, but people are having trouble getting useful advice from it,” says Adam Mahdi, a mathematician who directs the Machine Reasoning Lab at the University of Oxford that conducted the study.

To test the accuracy of the robots’ diagnostics in the lab, Mahdi and his colleagues submitted scenarios describing 10 medical conditions to the GPT-4o, Command R+, and Llama 3 Large Language Models (LLMs). They tracked how well the chatbot diagnosed the problem and advised what to do about it.

Then the team randomly assigned nearly 1,300 volunteers for the study to take the developed scenarios to one of these LLMs or use another method to decide what to do in that situation. Volunteers were also asked why they came to this conclusion and what they thought the medical problem was. Most people who weren’t using chatbots connected their symptoms to Google or other search engines. Participants using chatbots not only performed worse than chatbots evaluating the laboratory scenario, but also performed worse than participants using research tools. Participants who consulted Dr. Google diagnosed the problem more than 40% of the time, compared to an average of 35% for those who used robots. That’s a statistically significant difference, Mahdi says.

AI chatbots were state-of-the-art in late 2024 when the study was carried out – so accurate that it would be difficult to improve their medical knowledge. “The problem was interacting with people,” says Mahdi.

In some cases, chatbots provided incorrect, incomplete or misleading information. But the problem mostly seems to lie in the way people engage with LLMs. People tend to distribute information slowly, instead of telling the whole story at once, Mahdi says. And chatbots can easily be distracted by irrelevant or partial information. Participants sometimes ignored the chatbot’s diagnoses even when they were correct.

Small changes in the way people described the scenarios made a big difference in chatbot response. For example, two people described a subarachnoid hemorrhage, a type of stroke in which blood floods the space between the brain and the tissues covering it. Both participants spoke to the GPT-4o about headaches, sensitivity to light, and neck stiffness. One volunteer said he “suddenly developed the worst headache ever”, prompting GPT-4o to correctly advise seeking immediate medical attention.

Another volunteer called it a “terrible headache.” GPT-4o suggested that this person might have a migraine and should rest in a dark, quiet room – a recommendation that could kill the patient.

It’s unclear why subtle changes in the description altered the answer so dramatically, Mahdi says. This is part of The black box problem of AI in which even its creators cannot follow the reasoning of a model.

The study results suggest that “none of the language models tested were ready for deployment in direct patient care,” say Mahdi and colleagues.

Other groups have reached the same conclusion. In a report released on January 21, global patient safety nonprofit ECRI listed the use of AI chatbots used for medicine at both ends of the stethoscope as the leading solution. the biggest risk linked to health technologies for 2026. The report cites AI chatbots confidently suggesting misdiagnoses, inventing body parts, recommending medical products or procedures that might be dangerous, advising unnecessary tests or treatments, and reinforcing biases or stereotypes that can worsen health disparities. Studies have also demonstrated how chatbots can ethical errors when used as therapists.

Still, most doctors now use chatbots in one way or another, such as to transcribe medical records or review test results, says Scott Lucas, ECRI’s vice president for device security. OpenAI announced ChatGPT for Healthcare and Anthropic launched Claude for Healthcare in January. ChatGPT already answers over 40 million healthcare questions daily.

And it’s no wonder people are turning to chatbots for medical assistance, Lucas says. “They can access billions of data points and aggregate the data and present it in an understandable, credible, compelling format that can give you precise advice on almost exactly the question you were asking and do it with confidence.” But “commercial LLMs are not ready for prime-time clinical use. Relying on LLM results alone is not safe.”

Eventually, AI models and users could become sophisticated enough to close the communications gap highlighted by Mahdi’s study, Lucas says.

The study confirms concerns about the safety and reliability of LLMs in patient care, which the machine learning community has long discussed, says Michelle Li, a medical AI researcher at Harvard Medical School. This study and others have illustrated weakness of AI in real medical settingsshe said. Li and colleagues published a study Feb. 3 in Natural medicine suggesting possible improvements in the training, testing and implementation of AI models – changes that could make them more reliable in various medical contexts.

Mahdi plans to conduct additional studies on AI interactions in other languages ​​and over time. The results could help AI developers design stronger models that allow users to get accurate answers, he says.

“The first step is to solve the measurement problem,” says Mahdi. “We haven’t measured what matters,” which is how well AI works for real people.

Related

Tags: artificial intelligence
Julie Bort

Julie Bort

Stay Connected

  • 99 Subscribers
  • Trending
  • Comments
  • Latest
european-markets-in-mixed-territory-after-a-positive-start

European markets in mixed territory after a positive start

January 26, 2026
nascar-driver-denny-hamlin-breaks-silence-after-father-dies-in-house-fire

NASCAR driver Denny Hamlin breaks silence after father dies in house fire

December 31, 2025
tcl-lost-a-lawsuit-claiming-its-qled-tvs-are-not

TCL lost a lawsuit claiming its QLED TVs are not

March 13, 2026
fivio-foreign-checks-himself-into-a-$10,000-rehab-center-to-get-his-mind-straight

Fivio Foreign checks himself into a $10,000 rehab center to get his mind straight

December 31, 2025
hansmaker-presents-the-d1-ultra:-a-dual-laser-engraver-designed-for-each-material-–-techenger

Hansmaker presents the D1 Ultra: a dual laser engraver designed for each material – Techenger

0
nascar-driver-denny-hamlin-breaks-silence-after-father-dies-in-house-fire

NASCAR driver Denny Hamlin breaks silence after father dies in house fire

0
fivio-foreign-checks-himself-into-a-$10,000-rehab-center-to-get-his-mind-straight

Fivio Foreign checks himself into a $10,000 rehab center to get his mind straight

0
david-beckham-leaves-brooklyn-for-his-2025-instagram-tribute-amid-family-feud

David Beckham leaves Brooklyn for his 2025 Instagram tribute amid family feud

0
the-feeling-we-forgot…-until-this-trip-brought-it-back-–-goats-on-the-road

The feeling we forgot… until this trip brought it back – Goats On The Road

April 3, 2026
marshals:-a-yellowstone-story-episode-6-release-date-and-time-–-when-will-it-launch-on-cbs-and-paramount+?

Marshals: A Yellowstone Story Episode 6 release date and time – when will it launch on CBS and Paramount+?

April 3, 2026
hp’s-latest-z8-fury-g6i-breaks-the-boundaries-of-traditional-workstations

HP’s latest Z8 Fury G6i breaks the boundaries of traditional workstations

April 3, 2026

Answers to today’s NYT mini crossword for April 3 – CNET

April 3, 2026

Recent News

the-feeling-we-forgot…-until-this-trip-brought-it-back-–-goats-on-the-road

The feeling we forgot… until this trip brought it back – Goats On The Road

April 3, 2026
marshals:-a-yellowstone-story-episode-6-release-date-and-time-–-when-will-it-launch-on-cbs-and-paramount+?

Marshals: A Yellowstone Story Episode 6 release date and time – when will it launch on CBS and Paramount+?

April 3, 2026
hp’s-latest-z8-fury-g6i-breaks-the-boundaries-of-traditional-workstations

HP’s latest Z8 Fury G6i breaks the boundaries of traditional workstations

April 3, 2026

Answers to today’s NYT mini crossword for April 3 – CNET

April 3, 2026
Vidianews

Trusted news coverage delivering accurate reporting, breaking headlines, and insightful analysis on global events, business, politics, and tech.

Follow Us

Browse by Category

  • Business
  • Entertainment
  • Faith
  • Gadget
  • Gaming
  • General
  • Health
  • Lifestyle
  • Movie
  • News
  • Politics
  • Review
  • Science
  • Sports
  • Startup
  • Tech
  • Travel
  • World

Recent News

the-feeling-we-forgot…-until-this-trip-brought-it-back-–-goats-on-the-road

The feeling we forgot… until this trip brought it back – Goats On The Road

April 3, 2026
marshals:-a-yellowstone-story-episode-6-release-date-and-time-–-when-will-it-launch-on-cbs-and-paramount+?

Marshals: A Yellowstone Story Episode 6 release date and time – when will it launch on CBS and Paramount+?

April 3, 2026
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© © Copyrights 2026 Vidianews. All Rights Reserved. Designed by Vidianews

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

No Result
View All Result

© © Copyrights 2026 Vidianews. All Rights Reserved. Designed by Vidianews

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?
Go to mobile version