Machine learning modelling of blood lipid biomarkers in familial hypercholesterolaemia versus polygenic/environmental dyslipidaemia

Marta Correia; Eva Kagenaar; Daniël Bernardus van Schalkwijk; Mafalda Bourbon; Margarida Gama-Carvalho

doi:10.1038/s41598-021-83392-w

Scientific Reports (Feb 2021)

Machine learning modelling of blood lipid biomarkers in familial hypercholesterolaemia versus polygenic/environmental dyslipidaemia

Marta Correia,
Eva Kagenaar,
Daniël Bernardus van Schalkwijk,
Mafalda Bourbon,
Margarida Gama-Carvalho

Affiliations

Marta Correia: University of Lisboa, Faculty of Sciences, BioISI—Biosystems & Integrative Sciences Institute
Eva Kagenaar: Amsterdam University College
Daniël Bernardus van Schalkwijk: Amsterdam University College
Mafalda Bourbon: University of Lisboa, Faculty of Sciences, BioISI—Biosystems & Integrative Sciences Institute
Margarida Gama-Carvalho: University of Lisboa, Faculty of Sciences, BioISI—Biosystems & Integrative Sciences Institute

DOI: https://doi.org/10.1038/s41598-021-83392-w
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Familial hypercholesterolaemia increases circulating LDL-C levels and leads to premature cardiovascular disease when undiagnosed or untreated. Current guidelines support genetic testing in patients complying with clinical diagnostic criteria and cascade screening of their family members. However, most of hyperlipidaemic subjects do not present pathogenic variants in the known disease genes, and most likely suffer from polygenic hypercholesterolaemia, which translates into a relatively low yield of genetic screening programs. This study aims to identify new biomarkers and develop new approaches to improve the identification of individuals carrying monogenic causative variants. Using a machine-learning approach in a paediatric dataset of individuals, tested for disease causative genes and with an extended lipid profile, we developed new models able to classify familial hypercholesterolaemia patients with a much higher specificity than currently used methods. The best performing models incorporated parameters absent from the most common FH clinical criteria, namely apoB/apoA-I, TG/apoB and LDL1. These parameters were found to contribute to an improved identification of monogenic individuals. Furthermore, models using only TC and LDL-C levels presented a higher specificity of classification when compared to simple cut-offs. Our results can be applied towards the improvement of the yield of genetic screening programs and corresponding costs.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal