Scientific Reports (Jan 2024)

Next generation phenotyping for diagnosis and phenotype–genotype correlations in Kabuki syndrome

  • Quentin Hennocq,
  • Marjolaine Willems,
  • Jeanne Amiel,
  • Stéphanie Arpin,
  • Tania Attie-Bitach,
  • Thomas Bongibault,
  • Thomas Bouygues,
  • Valérie Cormier-Daire,
  • Pierre Corre,
  • Klaus Dieterich,
  • Maxime Douillet,
  • Jean Feydy,
  • Eva Galliani,
  • Fabienne Giuliano,
  • Stanislas Lyonnet,
  • Arnaud Picard,
  • Thantrira Porntaveetus,
  • Marlène Rio,
  • Flavien Rouxel,
  • Vorasuk Shotelersuk,
  • Annick Toutain,
  • Kevin Yauy,
  • David Geneviève,
  • Roman H. Khonsari,
  • Nicolas Garcelon

DOI
https://doi.org/10.1038/s41598-024-52691-3
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract The field of dysmorphology has been changed by the use Artificial Intelligence (AI) and the development of Next Generation Phenotyping (NGP). The aim of this study was to propose a new NGP model for predicting KS (Kabuki Syndrome) on 2D facial photographs and distinguish KS1 (KS type 1, KMT2D-related) from KS2 (KS type 2, KDM6A-related). We included retrospectively and prospectively, from 1998 to 2023, all frontal and lateral pictures of patients with a molecular confirmation of KS. After automatic preprocessing, we extracted geometric and textural features. After incorporation of age, gender, and ethnicity, we used XGboost (eXtreme Gradient Boosting), a supervised machine learning classifier. The model was tested on an independent validation set. Finally, we compared the performances of our model with DeepGestalt (Face2Gene). The study included 1448 frontal and lateral facial photographs from 6 centers, corresponding to 634 patients (527 controls, 107 KS); 82 (78%) of KS patients had a variation in the KMT2D gene (KS1) and 23 (22%) in the KDM6A gene (KS2). We were able to distinguish KS from controls in the independent validation group with an accuracy of 95.8% (78.9–99.9%, p < 0.001) and distinguish KS1 from KS2 with an empirical Area Under the Curve (AUC) of 0.805 (0.729–0.880, p < 0.001). We report an automatic detection model for KS with high performances (AUC 0.993 and accuracy 95.8%). We were able to distinguish patients with KS1 from KS2, with an AUC of 0.805. These results outperform the current commercial AI-based solutions and expert clinicians.