MRSNet: Multi-Resolution Scale Feature Fusion-Based Universal Density Counting Network

Yi Zhang; Wei Song; Mingyue Shao; Xiangchun Liu

doi:10.3390/s24185974

Sensors (Sep 2024)

MRSNet: Multi-Resolution Scale Feature Fusion-Based Universal Density Counting Network

Yi Zhang,
Wei Song,
Mingyue Shao,
Xiangchun Liu

Affiliations

Yi Zhang: School of Information and Engineering, Minzu University of China, Beijing 100081, China
Wei Song: School of Information and Engineering, Minzu University of China, Beijing 100081, China
Mingyue Shao: School of Information and Engineering, Minzu University of China, Beijing 100081, China
Xiangchun Liu: School of Information and Engineering, Minzu University of China, Beijing 100081, China

DOI: https://doi.org/10.3390/s24185974
Journal volume & issue: Vol. 24, no. 18
p. 5974

Abstract

Read online

This study focuses on the problem of dense object counting. In dense scenes, variations in object scales and uneven distributions greatly hinder counting accuracy. The current methods, whether CNNs with fixed convolutional kernel sizes or Transformers with fixed attention sizes, struggle to handle such variability effectively. Lower-resolution features are more sensitive to larger objects closer to the camera, while higher-resolution features are more efficient for smaller objects further away. Thus, preserving features that carry the most relevant information at each scale is crucial for improving counting precision. Motivated by this, we propose a multi-resolution scale feature fusion-based universal density counting network (MRSNet). It utilizes independent modules to process high- and low-resolution features, adaptively adjusts receptive field sizes, and incorporates dynamic sparse attention mechanisms to optimize feature information at each resolution, by integrating optimal features across multiple scales into density maps for counting evaluation. Our proposed network effectively mitigates issues caused by large variations in object scales, thereby enhancing counting accuracy. Furthermore, extensive quantitative analyses on six public datasets demonstrate the algorithm’s strong generalization ability in handling diverse object scale variations.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords