Genome Medicine (Jan 2024)

MAGPIE: accurate pathogenic prediction for multiple variant types using machine learning approach

  • Yicheng Liu,
  • Tianyun Zhang,
  • Ningyuan You,
  • Sai Wu,
  • Ning Shen

DOI
https://doi.org/10.1186/s13073-023-01274-4
Journal volume & issue
Vol. 16, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Identifying pathogenic variants from the vast majority of nucleotide variation remains a challenge. We present a method named Multimodal Annotation Generated Pathogenic Impact Evaluator (MAGPIE) that predicts the pathogenicity of multi-type variants. MAGPIE uses the ClinVar dataset for training and demonstrates superior performance in both the independent test set and multiple orthogonal validation datasets, accurately predicting variant pathogenicity. Notably, MAGPIE performs best in predicting the pathogenicity of rare variants and highly imbalanced datasets. Overall, results underline the robustness of MAGPIE as a valuable tool for predicting pathogenicity in various types of human genome variations. MAGPIE is available at https://github.com/shenlab-genomics/magpie .

Keywords