Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
29
1
2
3
4
6
7
8
9
10
11
13
14
15
16
17
7:30 AM - HLTH 2025
18
19
20
22
23
24
25
26
27
28
29
30
31
1
2
12:00 AM - NextGen UGM 2025
TigerConnect + eVideon Unite Healthcare Communications
2025-09-30    
10:00 am
TigerConnect’s acquisition of eVideon represents a significant step forward in our mission to unify healthcare communications. By combining smart room technology with advanced clinical collaboration [...]
Pathology Visions 2025
2025-10-05 - 2025-10-07    
8:00 am - 5:00 pm
Elevate Patient Care: Discover the Power of DP & AI Pathology Visions unites 800+ digital pathology experts and peers tackling today's challenges and shaping tomorrow's [...]
AHIMA25  Conference
2025-10-12 - 2025-10-14    
9:00 am - 10:00 pm
Register for AHIMA25  Conference Today! HI professionals—Minneapolis is calling! Join us October 12-14 for AHIMA25 Conference, the must-attend HI event of the year. In a city known for its booming [...]
HLTH 2025
2025-10-17 - 2025-10-22    
7:30 am - 12:00 pm
One of the top healthcare innovation events that brings together healthcare startups, investors, and other healthcare innovators. This is comparable to say an investor and [...]
Federal EHR Annual Summit
2025-10-21 - 2025-10-23    
9:00 am - 10:00 pm
The Federal Electronic Health Record Modernization (FEHRM) office brings together clinical staff from the Department of Defense, Department of Veterans Affairs, Department of Homeland Security’s [...]
NextGen UGM 2025
2025-11-02 - 2025-11-05    
12:00 am
NextGen UGM 2025 is set to take place in Nashville, TN, from November 2 to 5 at the Gaylord Opryland Resort & Convention Center. This [...]
Events on 2025-10-05
Events on 2025-10-12
AHIMA25  Conference
12 Oct 25
Minnesota
Events on 2025-10-17
HLTH 2025
17 Oct 25
Nevada
Events on 2025-10-21
Events on 2025-11-02
NextGen UGM 2025
2 Nov 25
TN

Events

Articles

Cluster analysis, EHRs visualize, detect rare genetic

The study utilized a dataset comprising deidentified structured medical records from approximately 1.28 million patients across three healthcare institutions under the Singapore Health Services (SingHealth) cluster. This dataset covered a 3-year period from January 1, 2018, to March 1, 2022, and included the National Heart Centre Singapore, KK Women’s and Children’s Hospital, and Singapore General Hospital. The research adhered to relevant guidelines and regulations, receiving approval from the SingHealth Data Governance committee, with the SingHealth Centralised Institutional Review Board waiving the need for informed consent.

Data extraction involved collecting information from diverse sources within the SingHealth Database, such as laboratory results, radiology reports, pathology records, diagnoses, and detailed patient information. To mitigate privacy risks, only structured data was extracted initially, excluding free-text fields. Sensitive data fields were pseudonymized based on the “SingHealth Policy for Data Anonymisation” through a trusted third party. The pseudonymized data were then transferred to the Office of Insights and Analytics High-Performance Computer Lab, ensuring strict security measures to restrict access to authorized personnel only.

Post-deidentification, the structured data underwent normalization and standardization using the Population Builder tool, a third-party platform. Value sets in Population Builder facilitated grouping codes related to the same disease/phenotype, streamlining the filtering process. Two rare diseases, Fabry Disease and Familial Hypercholesterolemia (FH), were selected for the pilot project due to well-defined diagnostic criteria and extractable data from health records.

The diagnostic criteria for Fabry Disease and FH were outlined, and value sets were created to identify patients with known diagnoses. Data wrangling involved specific metrics examination for each patient cohort, retrieving relevant data using SQL queries, and subsequent manipulation in RStudio for analysis.

Data analysis encompassed visualization and statistical testing. The tidyverse and lubridate R packages were employed for visualizing demographic data through pie charts, scatterplots, boxplots, bar graphs, and a Venn diagram. Statistical testing involved a two-sample t-test to assess the difference in mean LDL-C levels between FH True Positives (TP) and suspects.

In summary, the study employed rigorous methods for data extraction, deidentification, and analysis, aiming to identify undiagnosed patients with rare genetic diseases through cluster analysis and visualization of electronic health records data.