Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
30
2
4
5
6
8
9
10
11
12
13
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
1
2
3
World Congress on Medical Toxicology
2020-12-01 - 2020-12-02    
12:00 am
World Congress on Medical Toxicology Medical Toxicology Pharma 2020 provides a global platform to meet and develop interpersonal relationship with the world’s leading toxicologists, pharmacologists, [...]
01 Dec
2020-12-01 - 2020-12-02    
All Day
International Conference on Food Technology & Beverages” at Kyoto, Japan in the course of Kyoto, Japan, December, 01-02, 2020 Theme of the Food Tech 2020 [...]
Biomedical, Bio Pharma and Clinical Research
2020-12-03 - 2020-12-04    
12:00 am
Biomedical, Bio Pharma and Clinical Research Conference Series LLC LTD cordially invites you to be a part of “2nd International Conference on Biomedical, Bio Pharma [...]
NODE Health 4th Annual Digital Medicine Conference
2020-12-07 - 2020-12-12    
12:00 am
NODE.Health is delighted to announce the 4th Annual Digital Medicine Conference - Evidence Matters. Never before has the transformation of our healthcare system been more [...]
2020 Global Digital Health Forum
2020-12-07 - 2020-12-09    
12:00 am
Organized by Global Digital Health Network Digital health can be the great leveler – it can give anyone access to information about health and disease. [...]
International Conference on Cancer Treatment and Prevention
2020-12-14 - 2020-12-15    
12:00 am
Cancer Treatment Forum 2020 regards each one of the individuals to go to the "Cancer Treatment Forum 2020" amidst December 15, 2020 UK-Time Zone( GMT [...]
International Conference on Neurology and Neural Disorders
2020-12-14 - 2020-12-15    
12:00 am
International Conference on Neurology and Neural Disorders Neurology Research 2020 will join world-class professors, scientists, researchers, students, perfusionist, neurologist to discuss methodology for ailment remediation [...]
Events on 2020-12-03
Articles

Cluster analysis, EHRs visualize, detect rare genetic

The study utilized a dataset comprising deidentified structured medical records from approximately 1.28 million patients across three healthcare institutions under the Singapore Health Services (SingHealth) cluster. This dataset covered a 3-year period from January 1, 2018, to March 1, 2022, and included the National Heart Centre Singapore, KK Women’s and Children’s Hospital, and Singapore General Hospital. The research adhered to relevant guidelines and regulations, receiving approval from the SingHealth Data Governance committee, with the SingHealth Centralised Institutional Review Board waiving the need for informed consent.

Data extraction involved collecting information from diverse sources within the SingHealth Database, such as laboratory results, radiology reports, pathology records, diagnoses, and detailed patient information. To mitigate privacy risks, only structured data was extracted initially, excluding free-text fields. Sensitive data fields were pseudonymized based on the “SingHealth Policy for Data Anonymisation” through a trusted third party. The pseudonymized data were then transferred to the Office of Insights and Analytics High-Performance Computer Lab, ensuring strict security measures to restrict access to authorized personnel only.

Post-deidentification, the structured data underwent normalization and standardization using the Population Builder tool, a third-party platform. Value sets in Population Builder facilitated grouping codes related to the same disease/phenotype, streamlining the filtering process. Two rare diseases, Fabry Disease and Familial Hypercholesterolemia (FH), were selected for the pilot project due to well-defined diagnostic criteria and extractable data from health records.

The diagnostic criteria for Fabry Disease and FH were outlined, and value sets were created to identify patients with known diagnoses. Data wrangling involved specific metrics examination for each patient cohort, retrieving relevant data using SQL queries, and subsequent manipulation in RStudio for analysis.

Data analysis encompassed visualization and statistical testing. The tidyverse and lubridate R packages were employed for visualizing demographic data through pie charts, scatterplots, boxplots, bar graphs, and a Venn diagram. Statistical testing involved a two-sample t-test to assess the difference in mean LDL-C levels between FH True Positives (TP) and suspects.

In summary, the study employed rigorous methods for data extraction, deidentification, and analysis, aiming to identify undiagnosed patients with rare genetic diseases through cluster analysis and visualization of electronic health records data.