PLoS ONE (Jan 2022)

Identifying genetic variants associated with the ICD10 (International Classification of Diseases10)-based diagnosis of cerebrovascular disease using a large-scale biomedical database

  • Fahad Alkhalfan,
  • Alex Gyftopoulos,
  • Yi-Ju Chen,
  • Charles H. Williams,
  • James A. Perry,
  • Charles C. Hong

Journal volume & issue
Vol. 17, no. 8

Abstract

Read online

Objectives To utilize the UK Biobank to identify genetic variants associated with the ICD10 (International Classification of Diseases10)-based diagnosis of cerebrovascular disease (CeVD). Background Cerebrovascular disease occurs because of a complex interplay between vascular, environmental, and genetic factors. It is the second leading cause of disability worldwide. Understanding who may be genetically predisposed to cerebrovascular disease can help guide preventative efforts. Moreover, there is considerable interest in the use of real-world data, such as EHR (electronic health records) to better understand disease mechanisms and to discover new treatment strategies, but whether ICD10-based diagnosis can be used to study CeVD genetics is unknown. Methods Using the UK Biobank, we conducted a genome-wide association study (GWAS) where we analyzed the genomes of 11,155 cases and 122,705 controls who were sex, age and ancestry-matched in a 1:11 case: control design. Genetic variants were identified by Plink’s firth logistic regression and assessed for association with the ICD10 codes corresponding to CeVD. Results We identified two groups of SNPs closely linked to PITX2 and LRRTM4 that were significantly associated with CeVD in this study (p 0.5%. Discussion Disease assignment based on ICD10 codes may underestimate prevalence; however, for CeVD, this does not appear to be the case. Compared to the age- and sex-matched control population, individuals with CeVD were more frequently diagnosed with comorbid conditions, such as hypertension, hyperlipidemia & atrial fibrillation or flutter, confirming their contribution to CeVD. The UK Biobank based ICD10 study identified 2 groups of variants that were associated with CeVD. The association between PITX2 and CeVD is likely explained by the increased rates of atrial fibrillation and flutter. While the mechanism explaining the relationship between LRRTM4 and CeVD is unclear, this has been documented in previous studies.