A tool for federated training of segmentation models on whole slide images

Brendon Lutnick; David Manthey; Jan U. Becker; Jonathan E. Zuckerman; Luis Rodrigues; Kuang-Yu Jen; Pinaki Sarder

doi:10.1016/j.jpi.2022.100101

Journal of Pathology Informatics (Jan 2022)

A tool for federated training of segmentation models on whole slide images

Brendon Lutnick,
David Manthey,
Jan U. Becker,
Jonathan E. Zuckerman,
Luis Rodrigues,
Kuang-Yu Jen,
Pinaki Sarder

Affiliations

Brendon Lutnick: Department of Pathology and Anatomical Sciences, SUNY Buffalo, Buffalo, NY, USA
David Manthey: Kitware Incorporated, Clifton Park, NY, USA
Jan U. Becker: Institute of Pathology, University Hospital Cologne, Cologne, Germany
Jonathan E. Zuckerman: Department of Pathology and Laboratory Medicine, University of California at Los Angeles, Los Angeles, CA, USA
Luis Rodrigues: University Clinic of Nephrology, Faculty of Medicine, University of Coimbra, Portugal
Kuang-Yu Jen: University of California, Davis School of Medicine, Sacramento, CA, USA
Pinaki Sarder: Department of Pathology and Anatomical Sciences, SUNY Buffalo, Buffalo, NY, USA; Corresponding author.

DOI: https://doi.org/10.1016/j.jpi.2022.100101
Journal volume & issue: Vol. 13
p. 100101

Abstract

Read online

The largest bottleneck to the development of convolutional neural network (CNN) models in the computational pathology domain is the collection and curation of diverse training datasets. Training CNNs requires large cohorts of image data, and model generalizability is dependent on training data heterogeneity. Including data from multiple centers enhances the generalizability of CNN-based models, but this is hindered by the logistical challenges of sharing medical data. In this paper, we explore the feasibility of training our recently developed cloud-based segmentation tool (Histo-Cloud) using federated learning. Using a dataset of renal tissue biopsies we show that federated training to segment interstitial fibrosis and tubular atrophy (IFTA) using datasets from three institutions is not found to be different from a training by pooling the data on one server when tested on a fourth (holdout) institution’s data. Further, training a model to segment glomeruli for a federated dataset (split by staining) demonstrates similar performance.

Published in Journal of Pathology Informatics

ISSN: 2229-5089 (Print); 2153-3539 (Online)
Publisher: Elsevier
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Medicine: Pathology
Website: https://www.journals.elsevier.com/journal-of-pathology-informatics

About the journal

Abstract

Keywords