Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
27
28
29
30
31
1
2
3
4
5
6
7
8
9
10
12
13
14
15
16
19
21
22
24
26
27
28
29
30
11 Jun
2019-06-11 - 2019-06-13    
All Day
HIMSS and Health 2.0 European Conference Helsinki, Finland 11-13 June 2019 The HIMSS & Health 2.0 European Conference will be a unique three day event you [...]
7th Epidemiology and Public Health Conference
2019-06-17 - 2019-06-18    
All Day
Time : June 17-18, 2019 Dubai, UAE Theme: Global Health a major topic of concern in Epidemiology Research and Public Health study Epidemiology Meet 2019 in [...]
Inaugural Digital Health Pharma Congress
2019-06-17 - 2019-06-21    
All Day
Inaugural Digital Health Pharma Congress Join us for World Pharma Week 2019, where 15th Annual Biomarkers & Immuno-Oncology World Congress and 18th Annual World Preclinical Congress, two of Cambridge [...]
International Forum on Advancements in Healthcare - IFAH USA 2019
2019-06-18 - 2019-06-20    
All Day
International Forum on Advancements in Healthcare - IFAH (formerly Smart Health Conference) USA, will bring together 1000+ healthcare professionals from across the world on a [...]
Annual Congress on  Yoga and Meditation
2019-06-20 - 2019-06-21    
All Day
About Conference With the support of Organizing Committee Members, “Annual Congress on Yoga and Meditation” (Yoga Meditation 2019) is planned to be held in Dubai, [...]
Collaborative Care & Health IT Innovations Summit
2019-06-23 - 2019-06-25    
All Day
Technology Integrating Pre-Acute and LTPAC Services into the Healthcare and Payment EcosystemsHyatt Regency Inner Harbor 300 Light Street, Baltimore, Maryland, United States of America, 21202 [...]
2019 AHA LEADERSHIP SUMMIT
2019-06-25 - 2019-06-27    
All Day
Welcome Welcome to attendee registration for the 27th Annual AHA/AHA Center for Health Innovation Leadership Summit! The 2019 AHA Leadership Summit promotes a revolution in thinking [...]
Events on 2019-06-11
11 Jun
Events on 2019-06-17
Events on 2019-06-20
Events on 2019-06-23
Events on 2019-06-25
2019 AHA LEADERSHIP SUMMIT
25 Jun 19
San Diego
Articles News

Using machine learning to transform the handling of missing data in EHRs

EMR Industry

A thorough systematic review assessing methods for dealing with missing data in electronic health records (EHRs) was carried out by researchers from Peking University’s National Institute of Health Data Science and Peking University People’s Hospital’s Department of Clinical Epidemiology and Biostatistics. The study, which was published in Health Data Science, emphasizes how machine learning techniques are becoming more and more crucial than conventional statistical methods for handling missing data situations.

Because they allow for analysis of clinical trials, treatment effectiveness studies, and genetic association research, electronic health records have emerged as a key component of contemporary healthcare research. Missing data, however, continues to be a problem since it can introduce bias and compromise the validity of results. This study examined 46 research papers from 2010 to 2024, methodically contrasting the effectiveness of contemporary machine learning techniques like k-Nearest Neighbors (KNN) and Generative Adversarial Networks (GANs) with more conventional statistical techniques like Multiple Imputation by Chained Equations (MICE).

The results show that while addressing both longitudinal and cross-sectional datasets, machine learning techniques—in particular, GAN-based methods and context-aware time-series imputation (CATSI)—consistently performed better than conventional statistical approaches. While probabilistic principle component analysis (PCA) and MICE performed better for cross-sectional datasets, Med.KNN and CATSI performed better for longitudinal data.

The potential of machine learning techniques to solve missing data in EHRs is substantial. The necessity for uniform benchmarking analyses across various datasets and missingness circumstances is highlighted by the fact that no single method provides a solution that is generally applicable.

Associate Professor Dr. Huixin Liu of Peking University People’s Hospital

The opacity of machine learning models, the variability of EHR datasets, and the absence of common standards for evaluating technique success are some of the major issues the report also highlights. Future studies seek to create benchmarking datasets for thorough assessment and standardize the process for managing missing EHR data.

According to Dr. Shenda Hong, an assistant professor at Peking University’s National Institute of Health Data Science, “our ultimate goal is to create a universally accepted protocol for handling missing data in electronic health records, ensuring more reliable and reproducible findings across medical research,” she added.

By providing insights that can aid in bridging the gap between robust analysis and data paucity, this research represents a big step toward tackling one of the most critical difficulties in digital healthcare research.