IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (Jan 2022)

Bounding Box-Free Instance Segmentation Using Semi-Supervised Iterative Learning for Vehicle Detection

  • Osmar Luiz Ferreira de Carvalho,
  • Osmar Abilio de Carvalho Junior,
  • Anesmar Olino de Albuquerque,
  • Nickolas Castro Santana,
  • Renato Fontes Guimaraes,
  • Roberto Arnaldo Trancoso Gomes,
  • Dibio Leandro Borges

DOI
https://doi.org/10.1109/JSTARS.2022.3169128
Journal volume & issue
Vol. 15
pp. 3403 – 3420

Abstract

Read online

Vehicle classification is a hot computer vision topic, with studies ranging from ground-view to top-view imagery. Top-view images allow understanding city patterns, traffic management, among others. However, there are some difficulties for pixel-wise classification: most vehicle classification studies use object detection methods, and most publicly available datasets are designed for this task, creating instance segmentation datasets is laborious, and traditional instance segmentation methods underperform on this task since the objects are small. Thus, the present research objectives are as follows: first, propose a novel semisupervised iterative learning approach using the geographic information system software, second, propose a box-free instance segmentation approach, and third, provide a city-scale vehicle dataset. The iterative learning procedure considered the following: first, labeling a few vehicles from the entire scene, second, choosing training samples near those areas, third, training the deep learning model (U-net with efficient-net-B7 backbone), fourth, classifying the whole scene, fifth, converting the predictions into shapefile, sixth, correcting areas with wrong predictions, seventh, including them in the training data, eighth repeating until results are satisfactory. We considered vehicle interior and borders to separate instances using a semantic segmentation model. When removing the borders, the vehicle interior becomes isolated, allowing for unique object identification. Our procedure is very efficient and accurate for generating data iteratively, which resulted in 122 567 mapped vehicles. Metrics-wise, our method presented higher intersection over union when compared to box-based methods (82% against 72%), and per-object metrics surpassed 90% for precision and recall.

Keywords