Frontiers in Systems Biology (Jun 2024)

De novo prediction of functional effects of genetic variants from DNA sequences based on context-specific molecular information

  • Jiaxin Yang,
  • Sikta Das Adhikari,
  • Sikta Das Adhikari,
  • Hao Wang,
  • Binbin Huang,
  • Wenjie Qi,
  • Wenjie Qi,
  • Yuehua Cui,
  • Jianrong Wang

DOI
https://doi.org/10.3389/fsysb.2024.1402664
Journal volume & issue
Vol. 4

Abstract

Read online

Deciphering the functional effects of noncoding genetic variants stands as a fundamental challenge in human genetics. Traditional approaches, such as Genome-Wide Association Studies (GWAS), Transcriptome-Wide Association Studies (TWAS), and Quantitative Trait Loci (QTL) studies, are constrained by obscured the underlying molecular-level mechanisms, making it challenging to unravel the genetic basis of complex traits. The advent of Next-Generation Sequencing (NGS) technologies has enabled context-specific genome-wide measurements, encompassing gene expression, chromatin accessibility, epigenetic marks, and transcription factor binding sites, to be obtained across diverse cell types and tissues, paving the way for decoding genetic variation effects directly from DNA sequences only. The de novo predictions of functional effects are pivotal for enhancing our comprehension of transcriptional regulation and its disruptions caused by the plethora of noncoding genetic variants linked to human diseases and traits. This review provides a systematic overview of the state-of-the-art models and algorithms for genetic variant effect predictions, including traditional sequence-based models, Deep Learning models, and the cutting-edge Foundation Models. It delves into the ongoing challenges and prospective directions, presenting an in-depth perspective on contemporary developments in this domain.

Keywords