Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery

Sergio Mosquim Junior; Valentina Siino; Lisa Rydén; Johan Vallon-Christersson; Fredrik Levander

doi:10.3390/cancers14235761

Cancers (Nov 2022)

Choice of High-Throughput Proteomics Method Affects Data Integration with Transcriptomics and the Potential Use in Biomarker Discovery

Sergio Mosquim Junior,
Valentina Siino,
Lisa Rydén,
Johan Vallon-Christersson,
Fredrik Levander

Affiliations

Sergio Mosquim Junior: Department of Immunotechnology, Lund University, 223 81 Lund, Sweden
Valentina Siino: Department of Immunotechnology, Lund University, 223 81 Lund, Sweden
Lisa Rydén: Division of Surgery, Department of Clinical Sciences Lund, Lund University, 223 81 Lund, Sweden
Johan Vallon-Christersson: Division of Oncology, Department of Clinical Sciences Lund, Lund University, 223 81 Lund, Sweden
Fredrik Levander: Department of Immunotechnology, Lund University, 223 81 Lund, Sweden

DOI: https://doi.org/10.3390/cancers14235761
Journal volume & issue: Vol. 14, no. 23
p. 5761

Abstract

Read online

In recent years, several advances have been achieved in breast cancer (BC) classification and treatment. However, overdiagnosis, overtreatment, and recurrent disease are still significant causes of complication and death. Here, we present the development of a protocol aimed at parallel transcriptome and proteome analysis of BC tissue samples using mass spectrometry, via Data Dependent and Independent Acquisitions (DDA and DIA). Protein digestion was semi-automated and performed on flowthroughs after RNA extraction. Data for 116 samples were acquired in DDA and DIA modes and processed using MaxQuant, EncyclopeDIA, or DIA-NN. DIA-NN showed an increased number of identified proteins, reproducibility, and correlation with matching RNA-seq data, therefore representing the best alternative for this setup. Gene Set Enrichment Analysis pointed towards complementary information being found between transcriptomic and proteomic data. A decision tree model, designed to predict the intrinsic subtypes based on differentially abundant proteins across different conditions, selected protein groups that recapitulate important clinical features, such as estrogen receptor status, HER2 status, proliferation, and aggressiveness. Taken together, our results indicate that the proposed protocol performed well for the application. Additionally, the relevance of the selected proteins points to the possibility of using such data as a biomarker discovery tool for personalized medicine.

Published in Cancers

ISSN: 2072-6694 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neoplasms. Tumors. Oncology. Including cancer and carcinogens
Website: https://www.mdpi.com/journal/cancers/

About the journal

Abstract

Keywords