Scientific Reports (May 2024)

Validating reference-based algorithms to determine cell-type heterogeneity in ovarian cancer DNA methylation studies

  • Edyta Biskup,
  • Joanna Lopacinska-Jørgensen,
  • Lau Kræsing Vestergaard,
  • Estrid Høgdall

DOI
https://doi.org/10.1038/s41598-024-61857-y
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Information about cell composition in tissue samples is crucial for biomarker discovery and prognosis. Specifically, cancer tissue samples present challenges in deconvolution studies due to mutations and genetic rearrangements. Here, we optimized a robust, DNA methylation-based protocol, to be used for deconvolution of ovarian cancer samples. We compared several state-of-the-art methods (HEpiDISH, MethylCIBERSORT and ARIC) and validated the proposed protocol in an in-silico mixture and in an external dataset containing samples from ovarian cancer patients and controls. The deconvolution protocol we eventually implemented is based on MethylCIBERSORT. Comparing deconvolution methods, we paid close attention to the role of a reference panel. We postulate that a possibly high number of samples (in our case: 247) should be used when building a reference panel to ensure robustness and to compensate for biological and technical variation between samples. Subsequently, we tested the performance of the validated protocol in our own study cohort, consisting of 72 patients with malignant and benign ovarian disease as well as in five external cohorts. In conclusion, we refined and validated a reference-based algorithm to determine cell type composition of ovarian cancer tissue samples to be used in cancer biology studies in larger cohorts.

Keywords