Computational and Structural Biotechnology Journal (Dec 2024)

Decoding phenotypic screening: A comparative analysis of image representations

  • Adriana Borowa,
  • Dawid Rymarczyk,
  • Marek Żyła,
  • Maciej Kańdula,
  • Ana Sánchez-Fernández,
  • Krzysztof Rataj,
  • Łukasz Struski,
  • Jacek Tabor,
  • Bartosz Zieliński

Journal volume & issue
Vol. 23
pp. 1181 – 1188

Abstract

Read online

Biomedical imaging techniques such as high content screening (HCS) are valuable for drug discovery, but high costs limit their use to pharmaceutical companies. To address this issue, The JUMP-CP consortium released a massive open image dataset of chemical and genetic perturbations, providing a valuable resource for deep learning research. In this work, we aim to utilize the JUMP-CP dataset to develop a universal representation model for HCS data, mainly data generated using U2OS cells and CellPainting protocol, using supervised and self-supervised learning approaches. We propose an evaluation protocol that assesses their performance on mode of action and property prediction tasks using a popular phenotypic screening dataset. Results show that the self-supervised approach that uses data from multiple consortium partners provides representation that is more robust to batch effects whilst simultaneously achieving performance on par with standard approaches. Together with other conclusions, it provides recommendations on the training strategy of a representation model for HCS images.

Keywords