Polygenic prediction across populations is influenced by ancestry, genetic architecture, and methodology
Ying Wang,
Masahiro Kanai,
Taotao Tan,
Mireille Kamariza,
Kristin Tsuo,
Kai Yuan,
Wei Zhou,
Yukinori Okada,
Hailiang Huang,
Patrick Turley,
Elizabeth G. Atkinson,
Alicia R. Martin
Affiliations
Ying Wang
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Corresponding author
Masahiro Kanai
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA; Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
Taotao Tan
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
Mireille Kamariza
Society of Fellows, Harvard University, Cambridge, MA 02138, USA
Kristin Tsuo
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Kai Yuan
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Wei Zhou
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Yukinori Okada
Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan; Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan; Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Center for Infectious Disease Education and Research (CiDER), and Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita 565-0871, Japan; Department of Genome Informatics, Graduate School of Medicine, the University of Tokyo, Tokyo 113-0033, Japan
Hailiang Huang
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Patrick Turley
Department of Economics, and Center for Economic and Social Research, University of Southern California, Los Angeles, CA, USA
Elizabeth G. Atkinson
Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
Alicia R. Martin
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA; Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Corresponding author
Summary: Polygenic risk scores (PRSs) developed from multi-ancestry genome-wide association studies (GWASs), PRSmulti, hold promise for improving PRS accuracy and generalizability across populations. To establish best practices for leveraging the increasing diversity of genomic studies, we investigated how various factors affect the performance of PRSmulti compared with PRSs constructed from single-ancestry GWASs (PRSsingle). Through extensive simulations and empirical analyses, we showed that PRSmulti overall outperformed PRSsingle in understudied populations, except when the understudied population represented a small proportion of the multi-ancestry GWAS. Furthermore, integrating PRSs based on local ancestry-informed GWASs and large-scale, European-based PRSs improved predictive performance in understudied African populations, especially for less polygenic traits with large-effect ancestry-enriched variants. Our work highlights the importance of diversifying genomic studies to achieve equitable PRS performance across ancestral populations and provides guidance for developing PRSs from multiple studies.