Genes (Dec 2023)

Machine Learning to Advance Human Genome-Wide Association Studies

  • Rafaella E. Sigala,
  • Vasiliki Lagou,
  • Aleksey Shmeliov,
  • Sara Atito,
  • Samaneh Kouchaki,
  • Muhammad Awais,
  • Inga Prokopenko,
  • Adam Mahdi,
  • Ayse Demirkan

DOI
https://doi.org/10.3390/genes15010034
Journal volume & issue
Vol. 15, no. 1
p. 34

Abstract

Read online

Machine learning, including deep learning, reinforcement learning, and generative artificial intelligence are revolutionising every area of our lives when data are made available. With the help of these methods, we can decipher information from larger datasets while addressing the complex nature of biological systems in a more efficient way. Although machine learning methods have been introduced to human genetic epidemiological research as early as 2004, those were never used to their full capacity. In this review, we outline some of the main applications of machine learning to assigning human genetic loci to health outcomes. We summarise widely used methods and discuss their advantages and challenges. We also identify several tools, such as Combi, GenNet, and GMSTool, specifically designed to integrate these methods for hypothesis-free analysis of genetic variation data. We elaborate on the additional value and limitations of these tools from a geneticist’s perspective. Finally, we discuss the fast-moving field of foundation models and large multi-modal omics biobank initiatives.

Keywords