DEcancer: Machine learning framework tailored to liquid biopsy based cancer detection and biomarker signature selection
Andreas Halner,
Luke Hankey,
Zhu Liang,
Francesco Pozzetti,
Daniel Szulc,
Ella Mi,
Geoffrey Liu,
Benedikt M Kessler,
Junetha Syed,
Peter Jianrui Liu
Affiliations
Andreas Halner
Oxford Cancer Analytics Ltd, 696, BioEscalator, Innovation Building, Old Road Campus, Roosevelt Drive, Headington, Oxford, UK; Corresponding author
Luke Hankey
Oxford Cancer Analytics Ltd, 696, BioEscalator, Innovation Building, Old Road Campus, Roosevelt Drive, Headington, Oxford, UK
Zhu Liang
Oxford Cancer Analytics Ltd, 696, BioEscalator, Innovation Building, Old Road Campus, Roosevelt Drive, Headington, Oxford, UK
Francesco Pozzetti
Oxford Cancer Analytics Ltd, 696, BioEscalator, Innovation Building, Old Road Campus, Roosevelt Drive, Headington, Oxford, UK
Daniel Szulc
Oxford Cancer Analytics Ltd, 696, BioEscalator, Innovation Building, Old Road Campus, Roosevelt Drive, Headington, Oxford, UK
Ella Mi
Oxford Cancer Analytics Ltd, 696, BioEscalator, Innovation Building, Old Road Campus, Roosevelt Drive, Headington, Oxford, UK
Geoffrey Liu
Princess Margaret Cancer Centre, University Health Network, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
Benedikt M Kessler
Target Discovery Institute, Center for Medicines Discovery, Nuffield Department of Medicine, University of Oxford, Roosevelt Drive, Oxford, OX3 7FZ, UK
Junetha Syed
Oxford Cancer Analytics Ltd, 696, BioEscalator, Innovation Building, Old Road Campus, Roosevelt Drive, Headington, Oxford, UK
Peter Jianrui Liu
Oxford Cancer Analytics Ltd, 696, BioEscalator, Innovation Building, Old Road Campus, Roosevelt Drive, Headington, Oxford, UK; Corresponding author
Summary: Cancer is a leading cause of mortality worldwide. Over 50% of cancers are diagnosed late, rendering many treatments ineffective. Existing liquid biopsy studies demonstrate a minimally invasive and inexpensive approach for disease detection but lack parsimonious biomarker selection, exhibit poor cancer detection performance and lack appropriate validation and testing. We established a tailored machine learning pipeline, DEcancer, for liquid biopsy analysis that addresses these limitations and improved performance. In a test set from a published cohort of 1,005 patients including 8 cancer types and 812 cancer-free individuals, DEcancer increased stage 1 cancer detection sensitivity across cancer types from 48 to 90%. In addition, with a test set cohort of patients from a high dimensional proteomics dataset of 61 lung cancer patients and 80 cancer-free individuals, DEcancer’s performance using a 14-43 protein panel was comparable to 1,000 original proteins. DEcancer is a promising tool which may facilitate improved cancer detection and management.