Masked Feature Compression for Object Detection

Chengjie Dai; Tiantian Song; Yuxuan Jin; Yixiang Ren; Bowei Yang; Guanghua Song

doi:10.3390/math12121848

Mathematics (Jun 2024)

Masked Feature Compression for Object Detection

Chengjie Dai,
Tiantian Song,
Yuxuan Jin,
Yixiang Ren,
Bowei Yang,
Guanghua Song

Affiliations

Chengjie Dai: The School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, China
Tiantian Song: The Department of Mathematics, The University of Manchester, Manchester M13 9PL, UK
Yuxuan Jin: The School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, China
Yixiang Ren: The School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, China
Bowei Yang: The School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, China
Guanghua Song: The School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, China

DOI: https://doi.org/10.3390/math12121848
Journal volume & issue: Vol. 12, no. 12
p. 1848

Abstract

Read online

Deploying high-accuracy detection models on lightweight edge devices (e.g., drones) is challenging due to hardware constraints. To achieve satisfactory detection results, a common solution is to compress and transmit the images to a cloud server where powerful models can be used. However, the image compression process for transmission may lead to a reduction in detection accuracy. In this paper, we propose a feature compression method tailored for object detection tasks, and it can be easily integrated with existing learned image compression models. In the method, the encoding process consists of two steps. Firstly, we use a feature extractor to obtain the low-level feature, and then use a mask generator to obtain an object mask to select regions containing objects. Secondly, we use a neural network encoder to compress the masked feature. As for decoding, a neural network decoder is used to restore the compressed representation into the feature that can be directly inputted into the object detection model. The experimental results demonstrate that our method surpasses existing compression techniques. Specifically, when compared to one of the leading methods—TCM2023—our approach achieves a 25.3% reduction in compressed file size and a 6.9% increase in mAP0.5.

Published in Mathematics

ISSN: 2227-7390 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/mathematics

About the journal

Abstract

Keywords