Identifying and characterizing disease subpopulations that most benefit from polygenic risk scores

Monica Isgut; Felipe Giuste; Logan Gloster; Aniketh Swain; Katherine Choi; Andrew Hornback; Shriprasad R. Deshpande; May D. Wang

doi:10.1038/s41598-024-63705-5

Scientific Reports (Sep 2024)

Identifying and characterizing disease subpopulations that most benefit from polygenic risk scores

Monica Isgut,
Felipe Giuste,
Logan Gloster,
Aniketh Swain,
Katherine Choi,
Andrew Hornback,
Shriprasad R. Deshpande,
May D. Wang

Affiliations

Monica Isgut: Department of Bioinformatics, Georgia Institute of Technology
Felipe Giuste: School of Biomedical Engineering, Georgia Institute of Technology and Emory University
Logan Gloster: Department of Bioinformatics, Georgia Institute of Technology
Aniketh Swain: School of Biomedical Engineering, Georgia Institute of Technology and Emory University
Katherine Choi: School of Biomedical Engineering, Georgia Institute of Technology and Emory University
Andrew Hornback: School of Biomedical Engineering, Georgia Institute of Technology and Emory University
Shriprasad R. Deshpande: Advanced Cardiac Therapies and Heart Transplant Program, Children’s National Hospital
May D. Wang: School of Biomedical Engineering, Georgia Institute of Technology and Emory University

DOI: https://doi.org/10.1038/s41598-024-63705-5
Journal volume & issue: Vol. 14, no. 1
pp. 1 – 13

Abstract

Read online

Abstract Polygenic risk scores (PRSs) hold promise in their potential translation into clinical settings to improve disease risk prediction. An important consideration in integrating PRSs into clinical settings is to gain an understanding of how to identify which subpopulations of individuals most benefit from PRSs for risk prediction. In this study, using the UK Biobank dataset, we trained logistic regression models to predict the 10 year incident risk of myocardial infarction, breast cancer, and schizophrenia using either just clinical features or clinical features combined with PRSs. For each disease, we identified the top 10% subgroup with the greatest magnitude of improvement in risk prediction accuracy attributed to PRSs in the multi-modal model. Using up to ~ 3.6 k demographic, lifestyle, diagnostic, lab, and physical measurement features from the UK Biobank dataset of ~ 500 k individuals, we characterized these subgroups based on various clinical, lifestyle, and demographic characteristics. The incident cases in the top 10% subgroup for each disease represent distinct phenotypes that differ from other cases and that are strongly correlated with genetic predisposition. Our findings provide insights into disease subtypes and can encourage future studies aimed at classifying these individuals to enhance the targeting of polygenic risk scoring in practice.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal