Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
29
1
2
3
4
6
7
8
9
10
11
13
14
15
16
17
7:30 AM - HLTH 2025
18
19
20
22
23
24
25
26
27
28
29
30
31
1
2
12:00 AM - NextGen UGM 2025
TigerConnect + eVideon Unite Healthcare Communications
2025-09-30    
10:00 am
TigerConnect’s acquisition of eVideon represents a significant step forward in our mission to unify healthcare communications. By combining smart room technology with advanced clinical collaboration [...]
Pathology Visions 2025
2025-10-05 - 2025-10-07    
8:00 am - 5:00 pm
Elevate Patient Care: Discover the Power of DP & AI Pathology Visions unites 800+ digital pathology experts and peers tackling today's challenges and shaping tomorrow's [...]
AHIMA25  Conference
2025-10-12 - 2025-10-14    
9:00 am - 10:00 pm
Register for AHIMA25  Conference Today! HI professionals—Minneapolis is calling! Join us October 12-14 for AHIMA25 Conference, the must-attend HI event of the year. In a city known for its booming [...]
HLTH 2025
2025-10-17 - 2025-10-22    
7:30 am - 12:00 pm
One of the top healthcare innovation events that brings together healthcare startups, investors, and other healthcare innovators. This is comparable to say an investor and [...]
Federal EHR Annual Summit
2025-10-21 - 2025-10-23    
9:00 am - 10:00 pm
The Federal Electronic Health Record Modernization (FEHRM) office brings together clinical staff from the Department of Defense, Department of Veterans Affairs, Department of Homeland Security’s [...]
NextGen UGM 2025
2025-11-02 - 2025-11-05    
12:00 am
NextGen UGM 2025 is set to take place in Nashville, TN, from November 2 to 5 at the Gaylord Opryland Resort & Convention Center. This [...]
Events on 2025-10-05
Events on 2025-10-12
AHIMA25  Conference
12 Oct 25
Minnesota
Events on 2025-10-17
HLTH 2025
17 Oct 25
Nevada
Events on 2025-10-21
Events on 2025-11-02
NextGen UGM 2025
2 Nov 25
TN
Articles News

Is clinical note data reliably extracted by AI? Research indicates not.

Is it possible for generative artificial intelligence (AI) to scan clinical notes and accurately and quickly extract pertinent data to aid in patient care or scientific research?

According to recent studies from the Mailman School of Public Health at Columbia University, not yet.

A study conducted from 2019 to 2022 that involved 54,569 ER visits from patients hurt while riding a bicycle, scooter, or other micromobility conveyance. Researchers utilized ChatGPT-4 to read medical records and ascertain whether or not injured riders of bicycles and scooters were wearing helmets.

Can generative artificial intelligence (AI) be used to swiftly and correctly extract relevant data from clinical notes to support scientific study or patient care?

Not yet, per recent research from Columbia University’s Mailman School of Public Health.

A study involving 54,569 ER visits from patients injured while riding a bicycle, scooter, or other micromobility conveyance was carried out between 2019 and 2022. Researchers reviewed medical records using ChatGPT-4 to determine whether or not injured scooter and bicycle users had on helmets.

According to reports, the text string-search-based approach’s LLM only functioned properly when the prompt contained all of the text. Additionally, over the course of five consecutive days, it struggled to replicate its work across trials; it performed better at simulating its hallucinations than its precise work. It had the most trouble reading negated phrases like “w/o helmet” or “unhelmeted” and reporting that the patient wore a helmet.

Written clinical notes, a form of unstructured data, contain a significant amount of medically relevant data that is kept in electronic medical records. Research would greatly benefit from effective methods for reading these notes and extracting information.

At the moment, data from these clinical notes can be retrieved by artificial intelligence (AI)-based techniques like natural language processing or more complex methods like string-matching text search methodology. New LLM, like ChatGPT-4, were supposed to be able to extract data more quickly and accurately.

Professor of epidemiology at Columbia Mailman School and senior author Andrew Rundle, DrPH, stated, “While we see potential efficiency gains in using the generative AI LLM for information extraction tasks, issues of reliability and hallucinations currently limit its utility.”

There were days when ChatGPT-4 was able to extract precise data from the clinical notes when we utilized extremely specific prompts that contained every text string pertaining to helmets. However, the amount of time needed to define and test every word that needed to be in the prompt and ChatGPT-4’s incapacity to consistently reproduce its work suggest that ChatGPT-4 was not yet ready for this assignment.

The most recent study expands on their earlier research on injury prevention strategies for micromobility users, such as scooter, e-bike, and bicycle riders.

“Although wearing a helmet reduces the severity of an injury, information about helmet use is typically hidden in the clinical notes that the attending physician or emergency medical services provider writes in emergency department medical records and incident reports. The lead author of the work and a post-doctoral associate in the Mailman School’s Department of Epidemiology, Kathryn Burford, stated that there is a great need for research to be able to rapidly and reliably access this information.

According to Rundle, “our study investigated the potential of an LLM for information extraction from clinical notes, a rich source of information for health professionals and researchers.”

However, ChatGPT-4 was unable to provide us with data in a dependable manner when we used it.

The study’s results have been published in JAMA Network Open. Nicole G. Itzkowitz from Columbia Mailman School of Public Health, Ashley G. Ortega from Columbia Population Research Center, and Julien O. Teitler from Columbia School of Social Work are co-authors of this work.