ICTACT Journal on Image and Video Processing (May 2022)

IMAGE CLASSIFICATION USING MODEL ENSEMBLING

  • Debabrata Datta,
  • Anweshan Mukherjee,
  • Soumen Mukherjee,
  • Arup Kr. Bhattacharjee,
  • Anal Acharya

DOI
https://doi.org/10.21917/ijivp.2022.0381
Journal volume & issue
Vol. 12, no. 4
pp. 2679 – 2692

Abstract

Read online

Classifying images efficiently using various algorithms is very useful now-a-days given that the field of computer vision is growing rapidly. The research work highlighted in this paper focuses on the independent use of various models to classify images and then combining them together to form a better model in terms of performance than each of the individual models. The dataset used consists of 200 classes with 90,000 training images, 10,000 validation images and 10,000 test images. The data preparation step in this work involves resizing the images (data), shuffling them and transforming them into a data generator to provide input to the models. The images were also augmented using two different sets of image transformation effects to get more data for the models to train on. These data were then used to train five different models (one model trained from scratch and four other models using pre-trained weights and transfer learning) independently. The performance of each model was judged by checking two evaluation metrics –f1-score and categorical accuracy. The models were also tried to be fine-tuned to get a better performance, and finally the models were ensembled together to get a better categorical accuracy and f1-score on unseen (validation and test) data.

Keywords