Communications Biology (May 2023)

Copy number alteration features in pan-cancer homologous recombination deficiency prediction and biology

  • Huizi Yao,
  • Huimin Li,
  • Jinyu Wang,
  • Tao Wu,
  • Wei Ning,
  • Kaixuan Diao,
  • Chenxu Wu,
  • Guangshuai Wang,
  • Ziyu Tao,
  • Xiangyu Zhao,
  • Jing Chen,
  • Xiaoqin Sun,
  • Xue-Song Liu

DOI
https://doi.org/10.1038/s42003-023-04901-3
Journal volume & issue
Vol. 6, no. 1
pp. 1 – 9

Abstract

Read online

Abstract Homologous recombination deficiency (HRD) renders cancer cells vulnerable to unrepaired double-strand breaks and is an important therapeutic target as exemplified by the clinical efficacy of poly ADP-ribose polymerase (PARP) inhibitors as well as the platinum chemotherapy drugs applied to HRD patients. However, it remains a challenge to predict HRD status precisely and economically. Copy number alteration (CNA), as a pervasive trait of human cancers, can be extracted from a variety of data sources, including whole genome sequencing (WGS), SNP array, and panel sequencing, and thus can be easily applied clinically. Here we systematically evaluate the predictive performance of various CNA features and signatures in HRD prediction and build a gradient boosting machine model (HRDCNA) for pan-cancer HRD prediction based on these CNA features. CNA features BP10MB[1] (The number of breakpoints per 10MB of DNA is 1) and SS[ > 7 & <=8] (The log10-based size of segments is greater than 7 and less than or equal to 8) are identified as the most important features in HRD prediction. HRDCNA suggests the biallelic inactivation of BRCA1, BRCA2, PALB2, RAD51C, RAD51D, and BARD1 as the major genetic basis for human HRD, and may also be applied to effectively validate the pathogenicity of BRCA1/2 variants of uncertain significance (VUS). Together, this study provides a robust tool for cost-effective HRD prediction and also demonstrates the applicability of CNA features and signatures in cancer precision medicine.