Long-Tailed Metrics and Object Detection in Camera Trap Datasets

Wentong He; Ze Luo; Xinyu Tong; Xiaoyi Hu; Can Chen; Zufei Shu

doi:10.3390/app13106029

Applied Sciences (May 2023)

Long-Tailed Metrics and Object Detection in Camera Trap Datasets

Wentong He,
Ze Luo,
Xinyu Tong,
Xiaoyi Hu,
Can Chen,
Zufei Shu

Affiliations

Wentong He: Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China
Ze Luo: Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China
Xinyu Tong: Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China
Xiaoyi Hu: Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China
Can Chen: Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China
Zufei Shu: Guangdong Chebaling National Nature Reserve, Shaoguan 512528, China

DOI: https://doi.org/10.3390/app13106029
Journal volume & issue: Vol. 13, no. 10
p. 6029

Abstract

Read online

With their advantages in wildlife surveys and biodiversity monitoring, camera traps are widely used, and have been used to gather massive amounts of animal images and videos. The application of deep learning techniques has greatly promoted the analysis and utilization of camera trap data in biodiversity management and conservation. However, the long-tailed distribution of the camera trap dataset can degrade the deep learning performance. In this study, for the first time, we quantified the long-tailedness of class and object/box-level scale imbalance of camera trap datasets. In the camera trap dataset, the imbalance problem is prevalent and severe, in terms of class and object/box-level scale. The camera trap dataset has worse object/box-level scale imbalance, and too few samples of small objects, making deep learning more challenging. Furthermore, we used the BatchFormer module to exploit sample relationships, and improved the performance of the general object detection model, DINO, by up to 2.9% and up to 3.3% in terms of class imbalance and object/box-level scale imbalance. The experimental results showed that the sample relationship was simple and effective, improving detection performance in terms of class and object/box-level scale imbalance, but that it could not make up for the low number of small objects in the camera trap dataset.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords