Scientific Reports (May 2021)
Machine learning based models for prediction of subtype diagnosis of primary aldosteronism using blood test
Abstract
Abstract Primary aldosteronism (PA) is associated with an increased risk of cardiometabolic diseases, especially in unilateral subtype. Despite its high prevalence, the case detection rate of PA is limited, partly because of no clinical models available in general practice to identify patients highly suspicious of unilateral subtype of PA, who should be referred to specialized centers. The aim of this retrospective cross-sectional study was to develop a predictive model for subtype diagnosis of PA based on machine learning methods using clinical data available in general practice. Overall, 91 patients with unilateral and 138 patients with bilateral PA were randomly assigned to the training and test cohorts. Four supervised machine learning classifiers; logistic regression, support vector machines, random forests (RF), and gradient boosting decision trees, were used to develop predictive models from 21 clinical variables. The accuracy and the area under the receiver operating characteristic curve (AUC) for predicting of subtype diagnosis of PA in the test cohort were compared among the optimized classifiers. Of the four classifiers, the accuracy and AUC were highest in RF, with 95.7% and 0.990, respectively. Serum potassium, plasma aldosterone, and serum sodium levels were highlighted as important variables in this model. For feature-selected RF with the three variables, the accuracy and AUC were 89.1% and 0.950, respectively. With an independent external PA cohort, we confirmed a similar accuracy for feature-selected RF (accuracy: 85.1%). Machine learning models developed using blood test can help predict subtype diagnosis of PA in general practice.