Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
1
2
5
6
8
11
12
13
14
15
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
1
2
3
4
Forbes Healthcare Summit
2014-12-03    
All Day
Forbes Healthcare Summit: Smart Data Transforming Lives How big will the data get? This year we may collect more data about the human body than [...]
Customer Analytics & Engagement in Health Insurance
2014-12-04 - 2014-12-05    
All Day
Using Data Analytics, Product Experience & Innovation to Build a Profitable Customer-Centric Strategy Takeaway business ROI: Drive business value with customer analytics: learn what every business [...]
mHealth Summit
DECEMBER 7-11, 2014 The mHealth Summit, the largest event of its kind, convenes a diverse international delegation to explore the limits of mobile and connected [...]
The 26th Annual IHI National Forum
Overview ​2014 marks the 26th anniversary of an event that has shaped the course of health care quality in profound, enduring ways — the Annual [...]
Why A Risk Assessment is NOT Enough
2014-12-09    
2:00 pm - 3:30 pm
A common misconception is that  “A risk assessment makes me HIPAA compliant” Sadly this thought can cost your practice more than taking no action at [...]
iHT2 Health IT Summit
2014-12-10 - 2014-12-11    
All Day
Each year, the Institute hosts a series of events & programs which promote improvements in the quality, safety, and efficiency of health care through information technology [...]
Design a premium health insurance plan that engages customers, retains subscribers and understands behaviors
2014-12-16    
11:30 am - 12:30 pm
Wed, Dec 17, 2014 1:00 AM - 2:00 AM IST Join our webinar with John Mills - UPMC, Tim Gilchrist - Columbia University HITLAP, and [...]
Events on 2014-12-03
Forbes Healthcare Summit
3 Dec 14
New York City
Events on 2014-12-04
Events on 2014-12-07
mHealth Summit
7 Dec 14
Washington
Events on 2014-12-09
Events on 2014-12-10
iHT2 Health IT Summit
10 Dec 14
Houston
Articles News

Is clinical note data reliably extracted by AI? Research indicates not.

Is it possible for generative artificial intelligence (AI) to scan clinical notes and accurately and quickly extract pertinent data to aid in patient care or scientific research?

According to recent studies from the Mailman School of Public Health at Columbia University, not yet.

A study conducted from 2019 to 2022 that involved 54,569 ER visits from patients hurt while riding a bicycle, scooter, or other micromobility conveyance. Researchers utilized ChatGPT-4 to read medical records and ascertain whether or not injured riders of bicycles and scooters were wearing helmets.

Can generative artificial intelligence (AI) be used to swiftly and correctly extract relevant data from clinical notes to support scientific study or patient care?

Not yet, per recent research from Columbia University’s Mailman School of Public Health.

A study involving 54,569 ER visits from patients injured while riding a bicycle, scooter, or other micromobility conveyance was carried out between 2019 and 2022. Researchers reviewed medical records using ChatGPT-4 to determine whether or not injured scooter and bicycle users had on helmets.

According to reports, the text string-search-based approach’s LLM only functioned properly when the prompt contained all of the text. Additionally, over the course of five consecutive days, it struggled to replicate its work across trials; it performed better at simulating its hallucinations than its precise work. It had the most trouble reading negated phrases like “w/o helmet” or “unhelmeted” and reporting that the patient wore a helmet.

Written clinical notes, a form of unstructured data, contain a significant amount of medically relevant data that is kept in electronic medical records. Research would greatly benefit from effective methods for reading these notes and extracting information.

At the moment, data from these clinical notes can be retrieved by artificial intelligence (AI)-based techniques like natural language processing or more complex methods like string-matching text search methodology. New LLM, like ChatGPT-4, were supposed to be able to extract data more quickly and accurately.

Professor of epidemiology at Columbia Mailman School and senior author Andrew Rundle, DrPH, stated, “While we see potential efficiency gains in using the generative AI LLM for information extraction tasks, issues of reliability and hallucinations currently limit its utility.”

There were days when ChatGPT-4 was able to extract precise data from the clinical notes when we utilized extremely specific prompts that contained every text string pertaining to helmets. However, the amount of time needed to define and test every word that needed to be in the prompt and ChatGPT-4’s incapacity to consistently reproduce its work suggest that ChatGPT-4 was not yet ready for this assignment.

The most recent study expands on their earlier research on injury prevention strategies for micromobility users, such as scooter, e-bike, and bicycle riders.

“Although wearing a helmet reduces the severity of an injury, information about helmet use is typically hidden in the clinical notes that the attending physician or emergency medical services provider writes in emergency department medical records and incident reports. The lead author of the work and a post-doctoral associate in the Mailman School’s Department of Epidemiology, Kathryn Burford, stated that there is a great need for research to be able to rapidly and reliably access this information.

According to Rundle, “our study investigated the potential of an LLM for information extraction from clinical notes, a rich source of information for health professionals and researchers.”

However, ChatGPT-4 was unable to provide us with data in a dependable manner when we used it.

The study’s results have been published in JAMA Network Open. Nicole G. Itzkowitz from Columbia Mailman School of Public Health, Ashley G. Ortega from Columbia Population Research Center, and Julien O. Teitler from Columbia School of Social Work are co-authors of this work.