Journal of the American Heart Association: Cardiovascular and Cerebrovascular Disease (Sep 2023)

Evolutionary Action–Machine Learning Model Identifies Candidate Genes Associated With Early‐Onset Coronary Artery Disease

  • Dillon Shapiro,
  • Kwanghyuk Lee,
  • Jennifer Asmussen,
  • Thomas Bourquard,
  • Olivier Lichtarge

DOI
https://doi.org/10.1161/JAHA.122.029103
Journal volume & issue
Vol. 12, no. 17

Abstract

Read online

Background Coronary artery disease is a primary cause of death around the world, with both genetic and environmental risk factors. Although genome‐wide association studies have linked >100 unique loci to its genetic basis, these only explain a fraction of disease heritability. Methods and Results To find additional gene drivers of coronary artery disease, we applied machine learning to quantitative evolutionary information on the impact of coding variants in whole exomes from the Myocardial Infarction Genetics Consortium. Using ensemble‐based supervised learning, the Evolutionary Action–Machine Learning framework ranked each gene's ability to classify case and control samples and identified 79 significant associations. These were connected to known risk loci; enriched in cardiovascular processes like lipid metabolism, blood clotting, and inflammation; and enriched for cardiovascular phenotypes in knockout mouse models. Among them, INPP5F and MST1R are examples of potentially novel coronary artery disease risk genes that modulate immune signaling in response to cardiac stress. Conclusions We concluded that machine learning on the functional impact of coding variants, based on a massive amount of evolutionary information, has the power to suggest novel coronary artery disease risk genes for mechanistic and therapeutic discoveries in cardiovascular biology, and should also apply in other complex polygenic diseases.

Keywords