Wide kernels and their DCT compression in convolutional networks for nuclei segmentation

Vincent Andrearczyk; Valentin Oreiller; Adrien Depeursinge

Informatics in Medicine Unlocked (Jan 2023)

Wide kernels and their DCT compression in convolutional networks for nuclei segmentation

Vincent Andrearczyk,
Valentin Oreiller,
Adrien Depeursinge

Affiliations

Vincent Andrearczyk: Corresponding author at: University of Applied Sciences of Western Switzerland HES-SO Valais, Rue de Technopole 3, 3960 Sierre, Switzerland.; University of Applied Sciences of Western Switzerland HES-SO Valais, Rue de Technopole 3, 3960 Sierre, Switzerland; Service of Nuclear Medicine and Molecular Imaging, CHUV, Lausanne, Switzerland
Valentin Oreiller: University of Applied Sciences of Western Switzerland HES-SO Valais, Rue de Technopole 3, 3960 Sierre, Switzerland; Service of Nuclear Medicine and Molecular Imaging, CHUV, Lausanne, Switzerland
Adrien Depeursinge: University of Applied Sciences of Western Switzerland HES-SO Valais, Rue de Technopole 3, 3960 Sierre, Switzerland; Service of Nuclear Medicine and Molecular Imaging, CHUV, Lausanne, Switzerland

Journal volume & issue: Vol. 43
p. 101403

Abstract

Read online

The locality and spatial field of view of image operators have played a major role in image analysis, from hand-crafted to deep learning methods. In Convolutional Neural Networks (CNNs), the field of view is traditionally set to very small values (e.g. 3 × 3 pixels) for individual kernels and grown throughout the network by cascading layers. Automatically learning or adapting the best spatial support of the kernels can be done by using large kernels. Due to the computation requirements of standard CNN architectures, this has been little investigated in the literature. However, if large receptive fields are needed to capture wider contextual information on a given task, it could be learned from the data. Obtaining an optimal receptive field with few layers is very relevant in applications with a limited amount of annotated training data, e.g. in medical imaging.We show that CNNs (2D U-Nets) with large kernels outperform similar models with standard small kernels on the task of nuclei segmentation in histopathology images. We observe that the large kernels mostly capture low-frequency information, which motivates the need for large kernels and their efficient compression via the Discrete Cosine Transform (DCT). Following this idea, we develop a U-Net model with wide and compressed DCT kernels that leads to similar performance and trends to the standard U-Net, with reduced complexity. Visualizations of the kernels in the spatial and frequency domains, as well as the effective receptive fields, provide insights into the models’ behaviors and the learned features.

Published in Informatics in Medicine Unlocked

ISSN: 2352-9148 (Online)
Publisher: Elsevier
Country of publisher: United Kingdom
LCC subjects: Medicine: Medicine (General): Computer applications to medicine. Medical informatics
Website: https://www.journals.elsevier.com/informatics-in-medicine-unlocked/

About the journal

Abstract

Keywords