Scientific Reports (Jun 2024)

Performance evaluation of different regression models: application in a breast cancer patient data

  • Mona Mahmoud Abo El Nasr,
  • Alaa A. Abdelmegaly,
  • Doaa A. Abdo

DOI
https://doi.org/10.1038/s41598-024-62627-6
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract This paper provides a comprehensive analysis of linear regression models, focusing on addressing multicollinearity challenges in breast cancer patient data. Linear regression methodologies, including GAM, Beta, GAM Beta, Ridge, and Beta Ridge, are compared using two statistical criteria. The study, conducted with R software, showcases the Beta regression model’s exceptional performance, achieving a BIC of − 5520.416. Furthermore, the Ridge regression model demonstrates remarkable results with the best AIC at − 8002.647. The findings underscore the practical application of these models in real-world scenarios and emphasize the Beta regression model’s superior ability to handle multicollinearity challenges. The preference for AIC over BIC in Generalized Additive Models (GAMs) is rooted in the AIC’s calculation framework, highlighting its effectiveness in capturing the complexity and flexibility inherent in GAMs.