Among 108,714 patients, mean age of 81 years and predominant female (61%) population was observed. 21 features were included in the study using K-Means, Hierarchical and DBSCAN algorithm and 5, 6, 4 clusters were observed respectively. K-means with MCA gave most consistent subgroups based on distance measures cosine and Eigen values.
Hypertension was identified as prominent risk factor and was present in all the clusters. Cluster 1 (Mild) had 49.5k patients with hypertension~85% and diabetes~34%. Cluster 2 (Severe) had 31.8k patients with hypertension~97%, diabetes~67%, heart failure~63%, coronary artery disease~78%, Kidney-disease~69%, atrial-fibliration~53%. Cluster 3 (Moderate) had 5k patients with hypertension~91%, diabetes~50%, fall~20%. Cluster 4 (Onset) had 6k patients with no significant comorbidities. Cluster 5 (Caution) had 16k patients with hypertension~ 92%, fall~85%, confusion~54%, depression~45%, memory loss~ 41%. Further, we leveraged clinical notes to identify the impact of presence of APOE alleles in different clusters.