Communications Biology (May 2023)

Single-cell subcellular protein localisation using novel ensembles of diverse deep architectures

  • Syed Sameed Husain,
  • Eng-Jon Ong,
  • Dmitry Minskiy,
  • Mikel Bober-Irizar,
  • Amaia Irizar,
  • Miroslaw Bober

DOI
https://doi.org/10.1038/s42003-023-04840-z
Journal volume & issue
Vol. 6, no. 1
pp. 1 – 14

Abstract

Read online

Abstract Unravelling protein distributions within individual cells is vital to understanding their function and state and indispensable to developing new treatments. Here we present the Hybrid subCellular Protein Localiser (HCPL), which learns from weakly labelled data to robustly localise single-cell subcellular protein patterns. It comprises innovative DNN architectures exploiting wavelet filters and learnt parametric activations that successfully tackle drastic cell variability. HCPL features correlation-based ensembling of novel architectures that boosts performance and aids generalisation. Large-scale data annotation is made feasible by our AI-trains-AI approach, which determines the visual integrity of cells and emphasises reliable labels for efficient training. In the Human Protein Atlas context, we demonstrate that HCPL is best performing in the single-cell classification of protein localisation patterns. To better understand the inner workings of HCPL and assess its biological relevance, we analyse the contributions of each system component and dissect the emergent features from which the localisation predictions are derived.