CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea

Chaehee An; Minyoung Lee; Eunil Park

doi:10.1038/s41598-025-15058-w

Scientific Reports (Aug 2025)

CRxK dataset: a multi-view surveillance video dataset for re-enacted crimes in Korea

Chaehee An,
Minyoung Lee,
Eunil Park

Affiliations

Chaehee An: Department of Applied Artificial Intelligence, Sungkyunkwan University
Minyoung Lee: Department of Applied Artificial Intelligence, Sungkyunkwan University
Eunil Park: Department of Applied Artificial Intelligence, Sungkyunkwan University

DOI: https://doi.org/10.1038/s41598-025-15058-w
Journal volume & issue: Vol. 15, no. 1
pp. 1 – 14

Abstract

Read online

Abstract We introduce a novel benchmark dataset, CRxK Dataset, featuring surveillance images based on meticulously re-enacted crime events with curated annotations. CRxK dataset includes a collection of crime re-enacted videos and images categorized into 13 different categories, encompassing a set of offenses such as assault, intoxication, swoon, and more. In addition, we cover a separate normal dataset extracted from scenes occurring five seconds before the crime event and spanning a 10-second duration. Among 13 categories, we employ six core categories, which are the most frequent occurrences in the collected dataset. These categories are assault, robbery, swooning, kidnapping, burglary, and normal scenarios. We conducted experiments using a total of 2,054,013 frames randomly selected and shuffled from the videos. Our training and validation involved four convolutional neural network (CNN) models and a single transformer model. We utilized a smaller sub-dataset, CRxK-6 dataset, containing 8,500 frames randomly sampled from each category video, resulting in 51,000 frames. Despite employing a train-test split ratio of 1:40 and applying face masking using RetinaFace, the dataset exhibited excellent performance with common CNN models, achieving an accuracy exceeding 0.940 for each model. However, it presented some challenges for the Transformer model.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal

Abstract

Keywords