3,000 firms
Independent
Trusted

Save up to 70% on staff

News » AI tools face challenges in real-world medical conversations, says study

AI tools face challenges in real-world medical conversations, says study

HJ Suroy suroy

Posted on January 9, 2025 2 min read

Copied URL

Photo from Shutterstock

BOSTON, United States — Artificial intelligence (AI) tools are increasingly being explored for use in healthcare, offering potential solutions to alleviate clinician workloads by triaging patients, taking medical histories, and even providing preliminary diagnoses.

However, a recent study led by researchers from Harvard Medical School (HMS) and Stanford University reveals that while these AI models excel in standardized medical tests, they struggle significantly in real-world medical conversations.

CRAFT-MD: A new benchmark for AI clinicians

Published in Nature Medicine, the study introduces CRAFT-MD (Conversational Reasoning Assessment Framework for Testing in Medicine), a novel evaluation framework designed to simulate real-world doctor-patient interactions. Unlike traditional multiple-choice tests, CRAFT-MD assesses how well large language models (LLMs) can gather patient information through open-ended conversations and provide accurate diagnoses.

Researchers tested four LLMs across 2,000 clinical scenarios spanning primary care and 12 specialties. While the models performed well on exam-style questions, their diagnostic accuracy declined sharply when engaging in dynamic, conversational exchanges.

“Our work reveals a striking paradox — while these AI models excel at medical board exams, they struggle with the basic back-and-forth of a doctor’s visit,” said Pranav Rajpurkar, senior author of the study.

The real-world gap in AI diagnostic skills

The study highlights several challenges faced by AI clinicians:

Difficulty asking relevant questions during patient history-taking
Missing critical information scattered throughout conversations
Struggling to synthesize unstructured data into accurate diagnoses
Reduced performance in dynamic exchanges compared to structured formats

These limitations highlight the need for more realistic training and evaluation methods before deploying AI tools in clinical settings.

Strategies to improve AI’s clinical performance

To address these gaps, the researchers propose several strategies for optimizing AI tools:

Training models with open-ended, conversational datasets to reflect real-world interactions
Enhancing capabilities to extract key information from unstructured inputs
Developing systems that integrate textual data with non-textual inputs like images or lab results
Incorporating nonverbal cues such as tone and body language into AI design

CRAFT-MD itself exemplifies innovation by using an AI agent to simulate patient interactions and evaluate diagnostic accuracy efficiently. This method processed thousands of conversations within hours while minimizing risks to real patients.

“As a physician-scientist, I am interested in AI models that can augment clinical practice effectively and ethically,” said co-senior author Roxana Daneshjou from Stanford University.

The study highlights the importance of aligning AI tools with the complexities of actual medical practice before widespread deployment. By addressing these challenges, researchers hope to pave the way for more reliable and effective AI applications in healthcare settings.

Get instant pricingfor your offshore team

Hundreds of roles • Thousands of configurations • Detailed pricing report

Outsourcing Calculator

Top articles & guides

Outsourcing directory

Top outsourcing articles

Ultimate guides & white papers

Outsourcing podcast & videos

Outsourcing glossary

About Outsource Accelerator

Outsource Accelerator is the leading Business Process Outsourcing (BPO) marketplace globally. We are the trusted, independent resource for businesses of all sizes to explore, initiate, and embed outsourcing into their operations.

With 15,000+ articles, and 2,500+ firms, the platform covers all major outsourcing destinations, including the Philippines, India, Colombia, and others.

Learn more

OA in the media

Get 3 Free Quotes

Save 70% on employment costs, whilst driving quality & growth. Access world-class offshore staff.

3 free consultations
Unrivaled expertise
Verified leading firms
Transparent, safe, secure

How many staff do you need to outsource?

In the last 12 months, we’ve helped 18k businesses like yours!

18k businesses
36k full-time staff
$1.1bn value
42 sectors

Enterprise & big teams

Get exclusive assistance

Independent
Trusted
Transparent

About OA

Outsource Accelerator is the trusted source of independent information, advisory and expert implementation of Business Process Outsourcing (BPO)

The #1 outsourcing authority

Outsource Accelerator offers the world’s leading aggregator marketplace for outsourcing. It specifically provides the conduit between Philippines outsourcing suppliers and the businesses – clients – across the globe.

The Outsource Accelerator website has over 5,000 articles, 450+ podcast episodes, and a comprehensive directory with 4000+ BPO companies… all designed to make it easier for clients to learn about – and engage with – outsourcing.

About Derek Gallimore

Derek Gallimore has been in business for 20 years, outsourcing for over eight years, and has been living in Manila (the heart of global outsourcing) since 2014. Derek is the founder and CEO of Outsource Accelerator, and is regarded as a leading expert on all things outsourcing.

Learn more about us Watch video

Outsource Accelerator in the media

See all media mentions

“Excellent service for outsourcing advice and expertise for my business.”

Learn more

Get 3 Free Quotes Verified Outsourcing Suppliers

3,000 firms.Just 2 minutes to complete.

SAVE UP TO

70% ON STAFF COSTS

Learn more

Connect with over 3,000 outsourcing services providers.

Transform your business with skilled offshore talent.

3,000 firms
Simple
Transparent

News

BPO Directory

Articles

Guides

Podcast

White Papers

Glossary

Videos

Try the Outsourcing Calculator NEW

Get 3 free quotes

Book a call

Complete Outsourcing Toolkit

Try the Outsourcing Calculator NEW

Get 3 free quotes

Book a call

Complete Outsourcing Toolkit

List/claim your company

Submit Source article

Become a Source Partner

Subscribe to Inside Outsourcing

Submit press release

Advertise with OA

Invite DG as keynote speaker

See all services

Try the Outsourcing Calculator NEW

Get 3 free quotes

Book a call

Complete Outsourcing Toolkit

Try the Outsourcing Calculator NEW

Get 3 free quotes

Book a call

Download Complete Outsourcing Toolkit

AI tools face challenges in real-world medical conversations, says study

CRAFT-MD: A new benchmark for AI clinicians

The real-world gap in AI diagnostic skills

Strategies to improve AI’s clinical performance

Start your
journey today

About OA

The #1 outsourcing authority

About Derek Gallimore

News

BPO Directory

Articles

Guides

Podcast

White Papers

Glossary

Videos

Get started today

Try the Outsourcing Calculator NEW

Get 3 free quotes

Book a call

Complete Outsourcing Toolkit

Industry updates

Sectors

Roles

Get started today

Try the Outsourcing Calculator NEW

Get 3 free quotes

Book a call

Complete Outsourcing Toolkit

Industry updates

List/claim your company

Submit Source article

Become a Source Partner

Subscribe to Inside Outsourcing

Submit press release

Advertise with OA

Invite DG as keynote speaker

See all services

Get started today

Try the Outsourcing Calculator NEW

Get 3 free quotes

Book a call

Complete Outsourcing Toolkit

Industry updates

Try the Outsourcing Calculator NEW

Get 3 free quotes

Book a call

Download Complete Outsourcing Toolkit

CRAFT-MD: A new benchmark for AI clinicians

The real-world gap in AI diagnostic skills

Strategies to improve AI’s clinical performance

Start your journey today

About OA

The #1 outsourcing authority

About Derek Gallimore

Start your
journey today