Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
19
11:00 AM - Charmalot 2025
20
21
22
23
24
25
26
27
29
30
1
2
3
4
5
AI Leadership Strategy Summit
2025-09-18 - 2025-09-19    
12:00 am
AI is reshaping healthcare, but for executive leaders, adoption is only part of the equation. Success also requires making informed investments, establishing strong governance, and [...]
Charmalot 2025
2025-09-19 - 2025-09-21    
11:00 am
This is the CharmHealth annual user conference which also includes the CharmHealth Innovation Challenge. We enjoyed the event last year and we’re excited to be [...]
Civitas 2025 Annual Conference
2025-09-28 - 2025-09-30    
8:00 am
Civitas’ Annual Conference gathers hundreds of dedicated industry leaders, decision-makers, implementers, and innovators to explore key topics such as interoperability, data-driven quality improvement, social determinants [...]
Pathology Visions 2025
2025-10-05 - 2025-10-07    
8:00 am - 5:00 pm
Elevate Patient Care: Discover the Power of DP & AI Pathology Visions unites 800+ digital pathology experts and peers tackling today's challenges and shaping tomorrow's [...]
Events on 2025-09-18
Events on 2025-09-19
Charmalot 2025
19 Sep 25
CA
Events on 2025-09-28
Civitas 2025 Annual Conference
28 Sep 25
California
Events on 2025-10-05

Events

Articles

Large models identify social determinants in records

Social determinants of health (SDoH) significantly influence patient outcomes, yet their documentation is frequently incomplete or absent in the structured data of electronic health records (EHRs). The utilization of large language models (LLMs) holds promise in efficiently extracting SDoH from EHRs, contributing to both research and clinical care. However, challenges such as class imbalance and data limitations arise when handling this sparsely documented yet vital information.

In our investigation, we explored effective approaches to leverage LLMs for extracting six distinct SDoH categories from narrative EHR text. The standout performers included the fine-tuned Flan-T5 XL, achieving a macro-F1 of 0.71 for any SDoH mentions, and Flan-T5 XXL, attaining a macro-F1 of 0.70 for adverse SDoH mentions. The incorporation of LLM-generated synthetic data during training had varying effects across models and architectures but notably improved the performance of smaller Flan-T5 models (delta F1 + 0.12 to +0.23).

Our best-fine-tuned models outperformed zero- and few-shot performance of ChatGPT-family models in their respective settings, except for GPT4 with 10-shot prompting for adverse SDoH. These fine-tuned models exhibited a reduced likelihood of changing predictions when race/ethnicity and gender descriptors were introduced to the text, indicating diminished algorithmic bias (p < 0.05). Notably, our models identified 93.8% of patients with adverse SDoH, a significant improvement compared to the mere 2.0% captured by ICD-10 codes. These results highlight the potential of LLMs in enhancing real-world evidence related to SDoH and in identifying patients who could benefit from additional resource support.