Известия высших учебных заведений. Поволжский регион:Технические науки (Mar 2022)

A method for reconstructing 3D scenes based on the use of convolutional neural networks, filtering by distance and using an “octree”

  • Yu.V. Dubenko,
  • E.E. Dyshkant,
  • N.N. Timchenko,
  • N.A. Rudeshko

DOI
https://doi.org/10.21685/2072-3059-2021-4-4
Journal volume & issue
no. 4

Abstract

Read online

Background. The object of research is machine vision systems. The subject of research is the methods of reconstruction of three-dimensional scenes. The aim of the work is to develop a method for reconstruction of three-dimensional scenes, including the stages of identification of three-dimensional objects, segmentation (selection) and filtering of their constituent points in the original cloud. Materials and methods. The Block-Matching Algorithm method used to generate a depth map, the Mask R-CNN convolutional neural network – to identify and segment objects in the external environment, the OctTree method – to filter the point cloud, the Delaunay triangulation method - to generate a threedimensional model. Results. Based on the proposed method for reconstructing threedimensional scenes, a software product was developed in the python programming languages (TensorFlow virtual environment for implementing the Mask RCNN convolutional network) and C #, which implements the formation of a three-dimensional model of the road surface. Conclusions. The resulting three-dimensional model of the road surface can then be used to determine the following parameters: boundaries, axis of the road surface, geometric dimensions of defects (potholes, waves, depressions, chipping, sweating, protrusions, cracks), longitudinal evenness index (IRI).

Keywords