Genome Biology (Mar 2022)

TADA—a machine learning tool for functional annotation-based prioritisation of pathogenic CNVs

  • Jakob Hertzberg,
  • Stefan Mundlos,
  • Martin Vingron,
  • Giuseppe Gallone

DOI
https://doi.org/10.1186/s13059-022-02631-z
Journal volume & issue
Vol. 23, no. 1
pp. 1 – 21

Abstract

Read online

Abstract Few methods have been developed to investigate copy number variants (CNVs) based on their predicted pathogenicity. We introduce TADA, a method to prioritise pathogenic CNVs through assisted manual filtering and automated classification, based on an extensive catalogue of functional annotation supported by rigourous enrichment analysis. We demonstrate that our classifiers are able to accurately predict pathogenic CNVs, outperforming current alternative methods, and produce a well-calibrated pathogenicity score. Our results suggest that functional annotation-based prioritisation of pathogenic CNVs is a promising approach to support clinical diagnostics and to further the understanding of mechanisms controlling the disease impact of larger genomic alterations.

Keywords