Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
27
28
29
30
31
1
2
3
4
5
6
7
8
9
10
11
12
14
15
16
17
18
19
20
21
23
24
25
26
28
29
San Jose Health IT Summit
2017-04-13 - 2017-04-14    
All Day
About Health IT Summits U.S. healthcare is at an inflection point right now, as policy mandates and internal healthcare system reform begin to take hold, [...]
Annual IHI Summit
2017-04-20 - 2017-04-22    
All Day
The Office Practice & Community Improvement Conference ​​​​​​The 18th Annual Summit on Improving Patient Care in the Office Practice and the Community taking place April 20–22, 2017, in Orlando, FL, brings together 1,000 health improvers from around the globe, in [...]
Stanford Medicine X | ED
2017-04-22 - 2017-04-23    
All Day
Stanford Medicine X | ED is a conference on the future of medical education at the intersections of people, technology and design. As an Everyone [...]
2017 Health Datapalooza
2017-04-27 - 2017-04-28    
All Day
Health Datapalooza brings together a diverse audience of over 1,600 people from the public and private sectors to learn how health and health care can [...]
The 14th Annual World Health Care Congress
2017-04-30 - 2017-05-03    
All Day
The 14th Annual World Health Care Congress April 30 - May 3, 2017 • Washington, DC • The Marriott Wardman Park Hotel Connecting and Preparing [...]
Events on 2017-04-13
San Jose Health IT Summit
13 Apr 17
San Jose
Events on 2017-04-20
Annual IHI Summit
20 Apr 17
Orlando
Events on 2017-04-22
Events on 2017-04-27
2017 Health Datapalooza
27 Apr 17
Washington, D.C
Events on 2017-04-30
Articles

Large models identify social determinants in records

Social determinants of health (SDoH) significantly influence patient outcomes, yet their documentation is frequently incomplete or absent in the structured data of electronic health records (EHRs). The utilization of large language models (LLMs) holds promise in efficiently extracting SDoH from EHRs, contributing to both research and clinical care. However, challenges such as class imbalance and data limitations arise when handling this sparsely documented yet vital information.

In our investigation, we explored effective approaches to leverage LLMs for extracting six distinct SDoH categories from narrative EHR text. The standout performers included the fine-tuned Flan-T5 XL, achieving a macro-F1 of 0.71 for any SDoH mentions, and Flan-T5 XXL, attaining a macro-F1 of 0.70 for adverse SDoH mentions. The incorporation of LLM-generated synthetic data during training had varying effects across models and architectures but notably improved the performance of smaller Flan-T5 models (delta F1 + 0.12 to +0.23).

Our best-fine-tuned models outperformed zero- and few-shot performance of ChatGPT-family models in their respective settings, except for GPT4 with 10-shot prompting for adverse SDoH. These fine-tuned models exhibited a reduced likelihood of changing predictions when race/ethnicity and gender descriptors were introduced to the text, indicating diminished algorithmic bias (p < 0.05). Notably, our models identified 93.8% of patients with adverse SDoH, a significant improvement compared to the mere 2.0% captured by ICD-10 codes. These results highlight the potential of LLMs in enhancing real-world evidence related to SDoH and in identifying patients who could benefit from additional resource support.