Journal of Biostatistics and Epidemiology (Jul 2022)

Statistical Considerations in Combining Multiple Biomarkers for Diagnostic Classification

  • Karimollah Hajian-tilaki,
  • Zahra Geraili,
  • Vahid Nassiri

Journal volume & issue
Vol. 8, no. 2

Abstract

Read online

Introduction: In clinical practices, multiple biomarkers are frequently used on the same subjects for diagnosis of an adverse outcome. This study compares two alternative multiple linear regression approaches as the logistic regression model and the discriminant function score in combing several markers. Methods: Ten thousand simulated data sets were generated from binormal and non-binormal pairs of distributions with different sample sizes and correlation structures. Each dataset underwent a logistic regression and the discriminant analysis simultaneously. The ROC analysis was performed with each marker alone and also their combining scores. For two alternative approaches, the average of AUC and its root mean square error (RMSE) were estimated over 10000 replications trials for all configurations and sample sizes used. The practical utility of the two methods is further illustrated with a clinical example of real data as well. Results: The two approaches yielded identical accuracy in particular with binormal data. With non- binormal data, the logistic regression risk score produced an equal or a slightly better accuracy than the discriminate function score. Conclusion: Overall, the two approaches yield rather identical results. However, adopting the logistic regression model may incorporate slightly better accuracy index than discriminant analysis with non-binormal data.

Keywords