BioMedInformatics (Feb 2024)

Machine Learning Approach to Identify Case-Control Studies on ApoE Gene Mutations Linked to Alzheimer’s Disease in Italy

  • Giorgia Francesca Saraceno,
  • Diana Marisol Abrego-Guandique,
  • Roberto Cannataro,
  • Maria Cristina Caroleo,
  • Erika Cione

DOI
https://doi.org/10.3390/biomedinformatics4010033
Journal volume & issue
Vol. 4, no. 1
pp. 600 – 622

Abstract

Read online

Background: An application of artificial intelligence is machine learning, which allows computer programs to learn and create data. Methods: In this work, we aimed to evaluate the performance of the MySLR machine learning platform, which implements the Latent Dirichlet Allocation (LDA) algorithm in the identification and screening of papers present in the literature that focus on mutations of the apolipoprotein E (ApoE) gene in Italian Alzheimer’s Disease patients. Results: MySLR excludes duplicates and creates topics. MySLR was applied to analyze a set of 164 scientific publications. After duplicate removal, the results allowed us to identify 92 papers divided into two relevant topics characterizing the investigated research area. Topic 1 contains 70 papers, and topic 2 contains the remaining 22. Despite the current limitations, the available evidence suggests that articles containing studies on Italian Alzheimer’s Disease (AD) patients were 65.22% (n = 60). Furthermore, the presence of papers about mutations, including single nucleotide polymorphisms (SNPs) ApoE gene, the primary genetic risk factor of AD, for the Italian population was 5.4% (n = 5). Conclusion: The results show that the machine learning platform helped to identify case-control studies on ApoE gene mutations, including SNPs, but not only conducted in Italy.

Keywords