Molecular Systems Biology (Mar 2020)

Improved detection of differentially represented DNA barcodes for high‐throughput clonal phenomics

  • Yevhen Akimov,
  • Daria Bulanova,
  • Sanna Timonen,
  • Krister Wennerberg,
  • Tero Aittokallio

DOI
https://doi.org/10.15252/msb.20199195
Journal volume & issue
Vol. 16, no. 3
pp. n/a – n/a

Abstract

Read online

Abstract Cellular DNA barcoding has become a popular approach to study heterogeneity of cell populations and to identify clones with differential response to cellular stimuli. However, there is a lack of reliable methods for statistical inference of differentially responding clones. Here, we used mixtures of DNA‐barcoded cell pools to generate a realistic benchmark read count dataset for modelling a range of outcomes of clone‐tracing experiments. By accounting for the statistical properties intrinsic to the DNA barcode read count data, we implemented an improved algorithm that results in a significantly lower false‐positive rate, compared to current RNA‐seq data analysis algorithms, especially when detecting differentially responding clones in experiments with strong selection pressure. Building on the reliable statistical methodology, we illustrate how multidimensional phenotypic profiling enables one to deconvolute phenotypically distinct clonal subpopulations within a cancer cell line. The mixture control dataset and our analysis results provide a foundation for benchmarking and improving algorithms for clone‐tracing experiments.

Keywords