Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; School of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia
The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Department of Medical Biology, The University of Melbourne, Parkville, Australia
Tania Tan
The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Department of Medical Biology, The University of Melbourne, Parkville, Australia
Lawrence Leong
Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; School of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia
The Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; Department of Medical Biology, The University of Melbourne, Parkville, Australia
Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, Australia; School of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia
Mass cytometry (CyTOF) is a technology that has revolutionised single-cell biology. By detecting over 40 proteins on millions of single cells, CyTOF allows the characterisation of cell subpopulations in unprecedented detail. However, most CyTOF studies require the integration of data from multiple CyTOF batches usually acquired on different days and possibly at different sites. To date, the integration of CyTOF datasets remains a challenge due to technical differences arising in multiple batches. To overcome this limitation, we developed an approach called CytofRUV for analysing multiple CyTOF batches, which includes an R-Shiny application with diagnostic plots. CytofRUV can correct for batch effects and integrate data from large numbers of patients and conditions across batches, to confidently compare cellular changes and correlate these with clinically relevant outcomes.