Journal of Allergy and Clinical Immunology: Global (Aug 2024)

Predictive genetic panel for adult asthma using machine learning methods

  • Luciano Gama da Silva Gomes, PhD,
  • Álvaro Augusto Souza da Cruz, MD, PhD,
  • Maria Borges Rabêlo de Santana, PhD,
  • Gabriela Pimentel Pinheiro, PhD,
  • Cinthia Vila Nova Santana, PhD,
  • Carolina Barbosa Souza Santos, PhD,
  • Meher Preethi Boorgula, MS,
  • Monica Campbell, MS,
  • Adelmir de Souza Machado, MD, PhD,
  • Rafael Valente Veiga, PhD,
  • Kathleen C. Barnes, PhD,
  • Ryan dos Santos Costa, PhD,
  • Camila Alexandrina Figueiredo, PhD

Journal volume & issue
Vol. 3, no. 3
p. 100282

Abstract

Read online

Background: Asthma is a chronic inflammatory disease of the airways that is heterogeneous and multifactorial, making its accurate characterization a complex process. Therefore, identifying the genetic variations associated with asthma and discovering the molecular interactions between the omics that confer risk of developing this disease will help us to unravel the biological pathways involved in its pathogenesis. Objective: We sought to develop a predictive genetic panel for asthma using machine learning methods. Methods: We tested 3 variable selection methods: Boruta’s algorithm, the top 200 genome-wide association study markers according to their respective P values, and an elastic net regression. Ten different algorithms were chosen for the classification tests. A predictive panel was built on the basis of joint scores between the classification algorithms. Results: Two variable selection methods, Boruta and genome-wide association studies, were statistically similar in terms of the average accuracies generated, whereas elastic net had the worst overall performance. The predictive genetic panel was completed with 155 single-nucleotide variants, with 91.18% accuracy, 92.75% sensitivity, and 89.55% specificity using the support vector machine algorithm. The markers used range from known single-nucleotide variants to those not previously described in the literature. Our study shows potential in creating genetic prediction panels with tailored penalties per marker, aiding in the identification of optimal machine learning methods for intricate results. Conclusions: This method is able to classify asthma and nonasthma effectively, proving its potential utility in clinical prediction and diagnosis.

Keywords