Proceedings of the XXth Conference of Open Innovations Association FRUCT (May 2021)

The Best Model of Convolutional Neural Networks Combined with LSTM for the Detection of Interpersonal Physical Violence in Videos

  • Hugo David Calderon Vilca,
  • Kent Jhunior Cuadros Ramos,
  • Elmer Y. Diaz Quiroz,
  • Jorge Alexander Angeles Rojas,
  • Rene Alfredo Calderon Vilca,
  • Alejandro Apaza Tarqui

DOI
https://doi.org/10.23919/FRUCT52173.2021.9435563
Journal volume & issue
Vol. 29, no. 1
pp. 81 – 86

Abstract

Read online

Citizen insecurity is directly related to interpersonal physical violence, there are algorithms that allow detecting violence in videos, therefore it is necessary to know which is the best model for detecting violence. We compared three convolutional neural network models Xception, InceptionV3 and VGG16 each together with a recurring LSTM network, to find out which of the models is the best for the detection of interpersonal violence in videos. We train the three models using the Real Life Violence Situations data set, then we classify violence and non-violence, as a result, the InceptionV3 model is the best model, managing to classify with an accuracy of 94% compared to the VGG16 and Xception models, which obtained 88% and 93% respectively. Therefore, we recommend the InceptionV3 model for the detection of interpersonal physical violence in citizen security videos.

Keywords