Multi-Modal Convolutional Parameterisation Network for Guided Image Inverse Problems

Mikolaj Czerkawski; Priti Upadhyay; Christopher Davison; Robert Atkinson; Craig Michie; Ivan Andonovic; Malcolm Macdonald; Javier Cardona; Christos Tachtatzis

doi:10.3390/jimaging10030069

Journal of Imaging (Mar 2024)

Multi-Modal Convolutional Parameterisation Network for Guided Image Inverse Problems

Mikolaj Czerkawski,
Priti Upadhyay,
Christopher Davison,
Robert Atkinson,
Craig Michie,
Ivan Andonovic,
Malcolm Macdonald,
Javier Cardona,
Christos Tachtatzis

Affiliations

Mikolaj Czerkawski: Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow G1 1XW, UK
Priti Upadhyay: Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow G1 1XW, UK
Christopher Davison: Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow G1 1XW, UK
Robert Atkinson: Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow G1 1XW, UK
Craig Michie: Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow G1 1XW, UK
Ivan Andonovic: Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow G1 1XW, UK
Malcolm Macdonald: Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow G1 1XW, UK
Javier Cardona: Department of Chemical Engineering, University of Strathclyde, Glasgow G1 1XJ, UK
Christos Tachtatzis: Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow G1 1XW, UK

DOI: https://doi.org/10.3390/jimaging10030069
Journal volume & issue: Vol. 10, no. 3
p. 69

Abstract

Read online

There are several image inverse tasks, such as inpainting or super-resolution, which can be solved using deep internal learning, a paradigm that involves employing deep neural networks to find a solution by learning from the sample itself rather than a dataset. For example, Deep Image Prior is a technique based on fitting a convolutional neural network to output the known parts of the image (such as non-inpainted regions or a low-resolution version of the image). However, this approach is not well adjusted for samples composed of multiple modalities. In some domains, such as satellite image processing, accommodating multi-modal representations could be beneficial or even essential. In this work, Multi-Modal Convolutional Parameterisation Network (MCPN) is proposed, where a convolutional neural network approximates shared information between multiple modes by combining a core shared network with modality-specific head networks. The results demonstrate that these approaches can significantly outperform the single-mode adoption of a convolutional parameterisation network on guided image inverse problems of inpainting and super-resolution.

Published in Journal of Imaging

ISSN: 2313-433X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Photography; Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/jimaging

About the journal

Abstract

Keywords