Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
1
2
3
4
5
6
7
8
10
11
12
13
14
15
16
19
11:00 AM - Charmalot 2025
20
21
22
23
24
25
26
27
29
1
2
3
4
5
Oracle Health and Life Sciences Summit 2025
2025-09-09 - 2025-09-11    
12:00 am
The largest gathering of Oracle Health (Formerly Cerner) users. It seems like Oracle Health has learned that it’s not enough for healthcare users to be [...]
MEDITECH Live 2025
2025-09-17 - 2025-09-19    
8:00 am - 4:30 pm
This is the MEDITECH user conference hosted at the amazing MEDITECH conference venue in Foxborough (just outside Boston). We’ll be covering all of the latest [...]
AI Leadership Strategy Summit
2025-09-18 - 2025-09-19    
12:00 am
AI is reshaping healthcare, but for executive leaders, adoption is only part of the equation. Success also requires making informed investments, establishing strong governance, and [...]
OMD Educates: Digital Health Conference 2025
2025-09-18 - 2025-09-19    
7:00 am - 5:00 pm
Why Attend? This is a one-of-a-kind opportunity to get tips from experts and colleagues on how to use your EMR and other innovative health technology [...]
Charmalot 2025
2025-09-19 - 2025-09-21    
11:00 am - 9:00 pm
This is the CharmHealth annual user conference which also includes the CharmHealth Innovation Challenge. We enjoyed the event last year and we’re excited to be [...]
Civitas 2025 Annual Conference
2025-09-28 - 2025-09-30    
8:00 am
Civitas Networks for Health 2025 Annual Conference: From Data to Doing Civitas’ Annual Conference convenes hundreds of industry leaders, decision-makers, and innovators to explore interoperability, [...]
TigerConnect + eVideon Unite Healthcare Communications
2025-09-30    
10:00 am
TigerConnect’s acquisition of eVideon represents a significant step forward in our mission to unify healthcare communications. By combining smart room technology with advanced clinical collaboration [...]
Pathology Visions 2025
2025-10-05 - 2025-10-07    
8:00 am - 5:00 pm
Elevate Patient Care: Discover the Power of DP & AI Pathology Visions unites 800+ digital pathology experts and peers tackling today's challenges and shaping tomorrow's [...]
Events on 2025-09-09
Events on 2025-09-17
MEDITECH Live 2025
17 Sep 25
MA
Events on 2025-09-18
OMD Educates: Digital Health Conference 2025
18 Sep 25
Toronto Congress Centre
Events on 2025-09-19
Charmalot 2025
19 Sep 25
CA
Events on 2025-09-28
Civitas 2025 Annual Conference
28 Sep 25
California
Events on 2025-10-05
Articles News

Using machine learning to transform the handling of missing data in EHRs

EMR Industry

A thorough systematic review assessing methods for dealing with missing data in electronic health records (EHRs) was carried out by researchers from Peking University’s National Institute of Health Data Science and Peking University People’s Hospital’s Department of Clinical Epidemiology and Biostatistics. The study, which was published in Health Data Science, emphasizes how machine learning techniques are becoming more and more crucial than conventional statistical methods for handling missing data situations.

Because they allow for analysis of clinical trials, treatment effectiveness studies, and genetic association research, electronic health records have emerged as a key component of contemporary healthcare research. Missing data, however, continues to be a problem since it can introduce bias and compromise the validity of results. This study examined 46 research papers from 2010 to 2024, methodically contrasting the effectiveness of contemporary machine learning techniques like k-Nearest Neighbors (KNN) and Generative Adversarial Networks (GANs) with more conventional statistical techniques like Multiple Imputation by Chained Equations (MICE).

The results show that while addressing both longitudinal and cross-sectional datasets, machine learning techniques—in particular, GAN-based methods and context-aware time-series imputation (CATSI)—consistently performed better than conventional statistical approaches. While probabilistic principle component analysis (PCA) and MICE performed better for cross-sectional datasets, Med.KNN and CATSI performed better for longitudinal data.

The potential of machine learning techniques to solve missing data in EHRs is substantial. The necessity for uniform benchmarking analyses across various datasets and missingness circumstances is highlighted by the fact that no single method provides a solution that is generally applicable.

Associate Professor Dr. Huixin Liu of Peking University People’s Hospital

The opacity of machine learning models, the variability of EHR datasets, and the absence of common standards for evaluating technique success are some of the major issues the report also highlights. Future studies seek to create benchmarking datasets for thorough assessment and standardize the process for managing missing EHR data.

According to Dr. Shenda Hong, an assistant professor at Peking University’s National Institute of Health Data Science, “our ultimate goal is to create a universally accepted protocol for handling missing data in electronic health records, ensuring more reliable and reproducible findings across medical research,” she added.

By providing insights that can aid in bridging the gap between robust analysis and data paucity, this research represents a big step toward tackling one of the most critical difficulties in digital healthcare research.