Events Calendar

Mon
Tue
Wed
Thu
Fri
Sat
Sun
M
T
W
T
F
S
S
27
28
29
30
1
2
3
4
5
6
7
9
11
12
13
14
15
16
18
19
20
21
22
23
25
27
28
29
30
31
1
2
3
4
5
6
Pollution Control & Sustainable 2021
2021-04-26 - 2021-04-27    
All Day
Pollution Control 2021 conference is organizing with the theme of “Accelerating Innovations for Environmental Sustainability” Conference Series llc LTD organizes environmental conferences series 1000+ Global [...]
Food and Beverages
2021-05-05 - 2021-05-06    
All Day
Conference Series LLC Ltd Organizes 3000+Global Events inclusive of 600+ Conferences, 1200+ Workshops and 1200+ Symposiums every year across USA, Europe & Asia with support [...]
Dental Public Health and Dental Diseases
2021-05-08 - 2021-05-09    
All Day
Conference series LLC would like to take the immense pleasure to announce the “ International Conference on Dental Public Health and Dental Diseases” (Dental Public [...]
10 May
2021-05-10 - 2021-05-11    
All Day
Are you planning to start a new business?? Don't have any background?? Want some useful tips from the successful Entrepreneurs then come and participate in [...]
Climate Change and Ecosystem 2021
2021-05-17 - 2021-05-18    
All Day
Conference Series LLC Ltd in conjunction with its institutional partners and whereas Advisory board members are delighted to invite you all to the World Congress [...]
Machine Learning and Deep learning 2021
2021-05-24 - 2021-05-25    
All Day
Looking for a moment to learn something new and need a short break for professional life. Both are possible by attending the Machine Learning 2021 [...]
Artificial Intelligence and Neural Networks
2021-05-24 - 2021-05-25    
All Day
The year 2020 hasn’t turned out the way people expected, we all aware of Covid-19 pandemic. As countries around the world started to open its [...]
Asia Pacific Entrepreneurship Congress
2021-05-26 - 2021-05-27    
All Day
We welcome all the Business Tycoons, Women Entrepreneurs, and enthusiastic youth, Academic Entrepreneurs, Small-scale Industrial People to come and participate in our conference and take [...]
Events on 2021-04-26
Events on 2021-05-05
Events on 2021-05-08
Events on 2021-05-10
Events on 2021-05-17
Events on 2021-05-26
Articles

Cluster analysis, EHRs visualize, detect rare genetic

The study utilized a dataset comprising deidentified structured medical records from approximately 1.28 million patients across three healthcare institutions under the Singapore Health Services (SingHealth) cluster. This dataset covered a 3-year period from January 1, 2018, to March 1, 2022, and included the National Heart Centre Singapore, KK Women’s and Children’s Hospital, and Singapore General Hospital. The research adhered to relevant guidelines and regulations, receiving approval from the SingHealth Data Governance committee, with the SingHealth Centralised Institutional Review Board waiving the need for informed consent.

Data extraction involved collecting information from diverse sources within the SingHealth Database, such as laboratory results, radiology reports, pathology records, diagnoses, and detailed patient information. To mitigate privacy risks, only structured data was extracted initially, excluding free-text fields. Sensitive data fields were pseudonymized based on the “SingHealth Policy for Data Anonymisation” through a trusted third party. The pseudonymized data were then transferred to the Office of Insights and Analytics High-Performance Computer Lab, ensuring strict security measures to restrict access to authorized personnel only.

Post-deidentification, the structured data underwent normalization and standardization using the Population Builder tool, a third-party platform. Value sets in Population Builder facilitated grouping codes related to the same disease/phenotype, streamlining the filtering process. Two rare diseases, Fabry Disease and Familial Hypercholesterolemia (FH), were selected for the pilot project due to well-defined diagnostic criteria and extractable data from health records.

The diagnostic criteria for Fabry Disease and FH were outlined, and value sets were created to identify patients with known diagnoses. Data wrangling involved specific metrics examination for each patient cohort, retrieving relevant data using SQL queries, and subsequent manipulation in RStudio for analysis.

Data analysis encompassed visualization and statistical testing. The tidyverse and lubridate R packages were employed for visualizing demographic data through pie charts, scatterplots, boxplots, bar graphs, and a Venn diagram. Statistical testing involved a two-sample t-test to assess the difference in mean LDL-C levels between FH True Positives (TP) and suspects.

In summary, the study employed rigorous methods for data extraction, deidentification, and analysis, aiming to identify undiagnosed patients with rare genetic diseases through cluster analysis and visualization of electronic health records data.