Frontiers in Oncology (Apr 2024)
Exploration of identifying individual tumor tissue based on probabilistic model
Abstract
Variations in the tumor genome can result in allelic changes compared to the reference profile of its homogenous body source on genetic markers. This brings a challenge to source identification of tumor samples, such as clinically collected pathological paraffin-embedded tissue and sections. In this study, a probabilistic model was developed for calculating likelihood ratio (LR) to tackle this issue, which utilizes short tandem repeat (STR) genotyping data. The core of the model is to consider tumor tissue as a mixture of normal and tumor cells and introduce the incidence of STR variants (φ) and the percentage of normal cells (Mxn) as a priori parameters when performing calculations. The relationship between LR values and φ or Mxn was also investigated. Analysis of tumor samples and reference blood samples from 17 colorectal cancer patients showed that all samples had Log10(LR) values greater than 1014. In the non-contributor test, 99.9% of the quartiles had Log10(LR) values less than 0. When the defense’s hypothesis took into account the possibility that the tumor samples came from the patient’s relatives, LR greater than 0 was still obtained. Furthermore, this study revealed that LR values increased with decreasing φ and increasing Mxn. Finally, LR interval value was provided for each tumor sample by considering the confidence interval of Mxn. The probabilistic model proposed in this paper could deal with the possibility of tumor allele variability and offers an evaluation of the strength of evidence for determining tumor origin in clinical practice and forensic identification.
Keywords