Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
1
2
5
6
8
11
12
13
14
15
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
1
2
3
4
Forbes Healthcare Summit
2014-12-03    
All Day
Forbes Healthcare Summit: Smart Data Transforming Lives How big will the data get? This year we may collect more data about the human body than [...]
Customer Analytics & Engagement in Health Insurance
2014-12-04 - 2014-12-05    
All Day
Using Data Analytics, Product Experience & Innovation to Build a Profitable Customer-Centric Strategy Takeaway business ROI: Drive business value with customer analytics: learn what every business [...]
mHealth Summit
DECEMBER 7-11, 2014 The mHealth Summit, the largest event of its kind, convenes a diverse international delegation to explore the limits of mobile and connected [...]
The 26th Annual IHI National Forum
Overview ​2014 marks the 26th anniversary of an event that has shaped the course of health care quality in profound, enduring ways — the Annual [...]
Why A Risk Assessment is NOT Enough
2014-12-09    
2:00 pm - 3:30 pm
A common misconception is that  “A risk assessment makes me HIPAA compliant” Sadly this thought can cost your practice more than taking no action at [...]
iHT2 Health IT Summit
2014-12-10 - 2014-12-11    
All Day
Each year, the Institute hosts a series of events & programs which promote improvements in the quality, safety, and efficiency of health care through information technology [...]
Design a premium health insurance plan that engages customers, retains subscribers and understands behaviors
2014-12-16    
11:30 am - 12:30 pm
Wed, Dec 17, 2014 1:00 AM - 2:00 AM IST Join our webinar with John Mills - UPMC, Tim Gilchrist - Columbia University HITLAP, and [...]
Events on 2014-12-03
Forbes Healthcare Summit
3 Dec 14
New York City
Events on 2014-12-04
Events on 2014-12-07
mHealth Summit
7 Dec 14
Washington
Events on 2014-12-09
Events on 2014-12-10
iHT2 Health IT Summit
10 Dec 14
Houston
Articles

Cluster analysis, EHRs visualize, detect rare genetic

The study utilized a dataset comprising deidentified structured medical records from approximately 1.28 million patients across three healthcare institutions under the Singapore Health Services (SingHealth) cluster. This dataset covered a 3-year period from January 1, 2018, to March 1, 2022, and included the National Heart Centre Singapore, KK Women’s and Children’s Hospital, and Singapore General Hospital. The research adhered to relevant guidelines and regulations, receiving approval from the SingHealth Data Governance committee, with the SingHealth Centralised Institutional Review Board waiving the need for informed consent.

Data extraction involved collecting information from diverse sources within the SingHealth Database, such as laboratory results, radiology reports, pathology records, diagnoses, and detailed patient information. To mitigate privacy risks, only structured data was extracted initially, excluding free-text fields. Sensitive data fields were pseudonymized based on the “SingHealth Policy for Data Anonymisation” through a trusted third party. The pseudonymized data were then transferred to the Office of Insights and Analytics High-Performance Computer Lab, ensuring strict security measures to restrict access to authorized personnel only.

Post-deidentification, the structured data underwent normalization and standardization using the Population Builder tool, a third-party platform. Value sets in Population Builder facilitated grouping codes related to the same disease/phenotype, streamlining the filtering process. Two rare diseases, Fabry Disease and Familial Hypercholesterolemia (FH), were selected for the pilot project due to well-defined diagnostic criteria and extractable data from health records.

The diagnostic criteria for Fabry Disease and FH were outlined, and value sets were created to identify patients with known diagnoses. Data wrangling involved specific metrics examination for each patient cohort, retrieving relevant data using SQL queries, and subsequent manipulation in RStudio for analysis.

Data analysis encompassed visualization and statistical testing. The tidyverse and lubridate R packages were employed for visualizing demographic data through pie charts, scatterplots, boxplots, bar graphs, and a Venn diagram. Statistical testing involved a two-sample t-test to assess the difference in mean LDL-C levels between FH True Positives (TP) and suspects.

In summary, the study employed rigorous methods for data extraction, deidentification, and analysis, aiming to identify undiagnosed patients with rare genetic diseases through cluster analysis and visualization of electronic health records data.