Deep learning in crowd counting: A survey

Lijia Deng; Qinghua Zhou; Shuihua Wang; Juan Manuel Górriz; Yudong Zhang

doi:10.1049/cit2.12241

CAAI Transactions on Intelligence Technology (Oct 2024)

Deep learning in crowd counting: A survey

Lijia Deng,
Qinghua Zhou,
Shuihua Wang,
Juan Manuel Górriz,
Yudong Zhang

Affiliations

Lijia Deng: School of Computing and Mathematical Sciences University of Leicester Leicester UK
Qinghua Zhou: School of Computing and Mathematical Sciences University of Leicester Leicester UK
Shuihua Wang: School of Computing and Mathematical Sciences University of Leicester Leicester UK
Juan Manuel Górriz: Department of Signal Theory Networking and Communications University of Granada Granada Spain
Yudong Zhang: School of Computing and Mathematical Sciences University of Leicester Leicester UK

DOI: https://doi.org/10.1049/cit2.12241
Journal volume & issue: Vol. 9, no. 5
pp. 1043 – 1077

Abstract

Read online

Abstract Counting high‐density objects quickly and accurately is a popular area of research. Crowd counting has significant social and economic value and is a major focus in artificial intelligence. Despite many advancements in this field, many of them are not widely known, especially in terms of research data. The authors proposed a three‐tier standardised dataset taxonomy (TSDT). The Taxonomy divides datasets into small‐scale, large‐scale and hyper‐scale, according to different application scenarios. This theory can help researchers make more efficient use of datasets and improve the performance of AI algorithms in specific fields. Additionally, the authors proposed a new evaluation index for the clarity of the dataset: average pixel occupied by each object (APO). This new evaluation index is more suitable for evaluating the clarity of the dataset in the object counting task than the image resolution. Moreover, the authors classified the crowd counting methods from a data‐driven perspective: multi‐scale networks, single‐column networks, multi‐column networks, multi‐task networks, attention networks and weak‐supervised networks and introduced the classic crowd counting methods of each class. The authors classified the existing 36 datasets according to the theory of three‐tier standardised dataset taxonomy and discussed and evaluated these datasets. The authors evaluated the performance of more than 100 methods in the past five years on different levels of popular datasets. Recently, progress in research on small‐scale datasets has slowed down. There are few new datasets and algorithms on small‐scale datasets. The studies focused on large or hyper‐scale datasets appear to be reaching a saturation point. The combined use of multiple approaches began to be a major research direction. The authors discussed the theoretical and practical challenges of crowd counting from the perspective of data, algorithms and computing resources. The field of crowd counting is moving towards combining multiple methods and requires fresh, targeted datasets. Despite advancements, the field still faces challenges such as handling real‐world scenarios and processing large crowds in real‐time. Researchers are exploring transfer learning to overcome the limitations of small datasets. The development of effective algorithms for crowd counting remains a challenging and important task in computer vision and AI, with many opportunities for future research.

Published in CAAI Transactions on Intelligence Technology

ISSN: 2468-2322 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/24682322

About the journal

Abstract

Keywords