Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
1
2
5
6
8
11
12
13
14
15
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
1
2
3
4
Forbes Healthcare Summit
2014-12-03    
All Day
Forbes Healthcare Summit: Smart Data Transforming Lives How big will the data get? This year we may collect more data about the human body than [...]
Customer Analytics & Engagement in Health Insurance
2014-12-04 - 2014-12-05    
All Day
Using Data Analytics, Product Experience & Innovation to Build a Profitable Customer-Centric Strategy Takeaway business ROI: Drive business value with customer analytics: learn what every business [...]
mHealth Summit
DECEMBER 7-11, 2014 The mHealth Summit, the largest event of its kind, convenes a diverse international delegation to explore the limits of mobile and connected [...]
The 26th Annual IHI National Forum
Overview ​2014 marks the 26th anniversary of an event that has shaped the course of health care quality in profound, enduring ways — the Annual [...]
Why A Risk Assessment is NOT Enough
2014-12-09    
2:00 pm - 3:30 pm
A common misconception is that  “A risk assessment makes me HIPAA compliant” Sadly this thought can cost your practice more than taking no action at [...]
iHT2 Health IT Summit
2014-12-10 - 2014-12-11    
All Day
Each year, the Institute hosts a series of events & programs which promote improvements in the quality, safety, and efficiency of health care through information technology [...]
Design a premium health insurance plan that engages customers, retains subscribers and understands behaviors
2014-12-16    
11:30 am - 12:30 pm
Wed, Dec 17, 2014 1:00 AM - 2:00 AM IST Join our webinar with John Mills - UPMC, Tim Gilchrist - Columbia University HITLAP, and [...]
Events on 2014-12-03
Forbes Healthcare Summit
3 Dec 14
New York City
Events on 2014-12-04
Events on 2014-12-07
mHealth Summit
7 Dec 14
Washington
Events on 2014-12-09
Events on 2014-12-10
iHT2 Health IT Summit
10 Dec 14
Houston
Articles

Large models identify social determinants in records

Social determinants of health (SDoH) significantly influence patient outcomes, yet their documentation is frequently incomplete or absent in the structured data of electronic health records (EHRs). The utilization of large language models (LLMs) holds promise in efficiently extracting SDoH from EHRs, contributing to both research and clinical care. However, challenges such as class imbalance and data limitations arise when handling this sparsely documented yet vital information.

In our investigation, we explored effective approaches to leverage LLMs for extracting six distinct SDoH categories from narrative EHR text. The standout performers included the fine-tuned Flan-T5 XL, achieving a macro-F1 of 0.71 for any SDoH mentions, and Flan-T5 XXL, attaining a macro-F1 of 0.70 for adverse SDoH mentions. The incorporation of LLM-generated synthetic data during training had varying effects across models and architectures but notably improved the performance of smaller Flan-T5 models (delta F1 + 0.12 to +0.23).

Our best-fine-tuned models outperformed zero- and few-shot performance of ChatGPT-family models in their respective settings, except for GPT4 with 10-shot prompting for adverse SDoH. These fine-tuned models exhibited a reduced likelihood of changing predictions when race/ethnicity and gender descriptors were introduced to the text, indicating diminished algorithmic bias (p < 0.05). Notably, our models identified 93.8% of patients with adverse SDoH, a significant improvement compared to the mere 2.0% captured by ICD-10 codes. These results highlight the potential of LLMs in enhancing real-world evidence related to SDoH and in identifying patients who could benefit from additional resource support.