Scientific Reports (Feb 2024)
A gene expression-based classifier for HER2-low breast cancer
Abstract
Abstract In clinical trials evaluating antibody-conjugated drugs (ADCs), HER2-low breast cancer is defined through protein immunohistochemistry scoring (IHC) 1+ or 2+ without gene amplification. However, in daily practice, the accuracy of IHC is compromised by inter-observer variability. Herein, we aimed to identify HER2-low breast cancer primary tumors by leveraging gene expression profiling. A discovery approach was applied to gene expression profile of institutional INT1 (n = 125) and INT2 (n = 84) datasets. We identified differentially expressed genes (DEGs) in each specific HER2 IHC category 0, 1+, 2+ and 3+. Principal Component Analysis was used to generate a HER2-low signature whose performance was evaluated in the independent INT3 (n = 95), and in the publicly available TCGA and GSE81538 datasets. The association between the HER2-low signature and HER2 IHC categories was evaluated by Kruskal–Wallis test with post hoc pair-wise comparisons. The HER2-low signature discriminatory capability was assessed by estimating the area under the receiver operating characteristic curve (AUC). Gene Ontology and KEGG analyses were performed to evaluate the HER2-low signature genes functional enrichment. A HER2-low signature was computed based on HER2 IHC category-specific DEGs. The twenty genes included in the signature were significantly enriched with lipid and steroid metabolism pathways, peptidase regulation, and humoral immune response. The HER2-low signature values showed a bell-shaped distribution across IHC categories (low values in 0 and 3+; high values in 1+ and 2+), effectively distinguishing HER2-low from 0 (p < 0.001) to 3+ (p < 0.001). Notably, the signature values were higher in tumors scored with 1+ as compared to 0. The HER2-low signature association with IHC categories and its bell-shaped distribution was confirmed in the independent INT3, TCGA and GSE81538 datasets. In the combined INT1 and INT3 datasets, the HER2-low signature achieved an AUC value of 0.74 (95% confidence interval, CI 0.67–0.81) in distinguishing HER2-low vs. the other categories, outperforming the individual ERBB2 mRNA AUC value of 0.52 (95% CI 0.43–0.60). These results represent a proof-of-concept for an observer-independent gene-expression-based classifier of HER2-low status. The herein identified 20-gene signature shows promise in distinguishing between HER2 0 and HER2-low expressing tumors, including those scored as 1+ at IHC, and in developing a selection approach for ADCs candidates.