Evaluation of Deformable Convolution: An Investigation in Image and Video Classification

Andrea Burgos Madrigal; Victor Romero Bautista; Raquel Díaz Hernández; Leopoldo Altamirano Robles

doi:10.3390/math12162448

Mathematics (Aug 2024)

Evaluation of Deformable Convolution: An Investigation in Image and Video Classification

Andrea Burgos Madrigal,
Victor Romero Bautista,
Raquel Díaz Hernández,
Leopoldo Altamirano Robles

Affiliations

Andrea Burgos Madrigal: Instituto Nacional de Astrofísica, Óptica y Electrónica, Puebla 72840, Mexico
Victor Romero Bautista: Instituto Nacional de Astrofísica, Óptica y Electrónica, Puebla 72840, Mexico
Raquel Díaz Hernández: Instituto Nacional de Astrofísica, Óptica y Electrónica, Puebla 72840, Mexico
Leopoldo Altamirano Robles: Instituto Nacional de Astrofísica, Óptica y Electrónica, Puebla 72840, Mexico

DOI: https://doi.org/10.3390/math12162448
Journal volume & issue: Vol. 12, no. 16
p. 2448

Abstract

Read online

Convolutional Neural Networks (CNNs) present drawbacks for modeling geometric transformations, caused by the convolution operation’s locality. Deformable convolution (DCON) is a mechanism that solves these drawbacks and improves the robustness. In this study, we clarify the optimal way to replace the standard convolution with its deformable counterpart in a CNN model. To this end, we conducted several experiments using DCONs applied in the layers that conform a small four-layer CNN model and on the four-layers of several ResNets with depths 18, 34, 50, and 101. The models were tested in binary balanced classes with 2D and 3D data. If DCON is used on the first layers of the proposal of model, the computational resources will tend to increase and produce bigger misclassification than the standard CNN. However, if the DCON is used at the end layers, the quantity of Flops will decrease, and the classification accuracy will improve by up to 20% about the base model. Moreover, it gains robustness because it can adapt to the object of interest. Also, the best kernel size of the DCON is three. With these results, we propose a guideline and contribute to understanding the impact of DCON on the robustness of CNNs.

Published in Mathematics

ISSN: 2227-7390 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/mathematics

About the journal

Abstract

Keywords