Electronics (Jun 2023)

Learning Strategies for Sensitive Content Detection

  • Daniel Povedano Álvarez,
  • Ana Lucila Sandoval Orozco,
  • Javier Portela García-Miguel,
  • Luis Javier García Villalba

DOI
https://doi.org/10.3390/electronics12112496
Journal volume & issue
Vol. 12, no. 11
p. 2496

Abstract

Read online

Currently, the volume of sensitive content on the Internet, such as pornography and child pornography, and the amount of time that people spend online (especially children) have led to an increase in the distribution of such content (e.g., images of children being sexually abused, real-time videos of such abuse, grooming activities, etc.). It is therefore essential to have effective IT tools that automate the detection and blocking of this type of material, as manual filtering of huge volumes of data is practically impossible. The goal of this study is to carry out a comprehensive review of different learning strategies for the detection of sensitive content available in the literature, from the most conventional techniques to the most cutting-edge deep learning algorithms, highlighting the strengths and weaknesses of each, as well as the datasets used. The performance and scalability of the different strategies proposed in this work depend on the heterogeneity of the dataset, the feature extraction techniques (hashes, visual, audio, etc.) and the learning algorithms. Finally, new lines of research in sensitive-content detection are presented.

Keywords