Frontiers in Pharmacology (Mar 2022)

Patient-Patient Similarity-Based Screening of a Clinical Data Warehouse to Support Ciliopathy Diagnosis

  • Xiaoyi Chen,
  • Xiaoyi Chen,
  • Xiaoyi Chen,
  • Carole Faviez,
  • Carole Faviez,
  • Marc Vincent,
  • Luis Briseño-Roa,
  • Hassan Faour,
  • Jean-Philippe Annereau,
  • Stanislas Lyonnet,
  • Mohamad Zaidan,
  • Sophie Saunier,
  • Nicolas Garcelon,
  • Nicolas Garcelon,
  • Nicolas Garcelon,
  • Anita Burgun,
  • Anita Burgun,
  • Anita Burgun

DOI
https://doi.org/10.3389/fphar.2022.786710
Journal volume & issue
Vol. 13

Abstract

Read online

A timely diagnosis is a key challenge for many rare diseases. As an expanding group of rare and severe monogenic disorders with a broad spectrum of clinical manifestations, ciliopathies, notably renal ciliopathies, suffer from important underdiagnosis issues. Our objective is to develop an approach for screening large-scale clinical data warehouses and detecting patients with similar clinical manifestations to those from diagnosed ciliopathy patients. We expect that the top-ranked similar patients will benefit from genetic testing for an early diagnosis. The dependence and relatedness between phenotypes were taken into account in our similarity model through medical concept embedding. The relevance of each phenotype to each patient was also considered by adjusted aggregation of phenotype similarity into patient similarity. A ranking model based on the best-subtype-average similarity was proposed to address the phenotypic overlapping and heterogeneity of ciliopathies. Our results showed that using less than one-tenth of learning sources, our language and center specific embedding provided comparable or better performances than other existing medical concept embeddings. Combined with the best-subtype-average ranking model, our patient-patient similarity-based screening approach was demonstrated effective in two large scale unbalanced datasets containing approximately 10,000 and 60,000 controls with kidney manifestations in the clinical data warehouse (about 2 and 0.4% of prevalence, respectively). Our approach will offer the opportunity to identify candidate patients who could go through genetic testing for ciliopathy. Earlier diagnosis, before irreversible end-stage kidney disease, will enable these patients to benefit from appropriate follow-up and novel treatments that could alleviate kidney dysfunction.

Keywords