Technologies (Apr 2023)

Towards Safe Visual Navigation of a Wheelchair Using Landmark Detection

  • Christos Sevastopoulos,
  • Mohammad Zaki Zadeh,
  • Michail Theofanidis,
  • Sneh Acharya,
  • Nishi Patel,
  • Fillia Makedon

DOI
https://doi.org/10.3390/technologies11030064
Journal volume & issue
Vol. 11, no. 3
p. 64

Abstract

Read online

This article presents a method for extracting high-level semantic information through successful landmark detection using 2D RGB images. In particular, the focus is placed on the presence of particular labels (open path, humans, staircase, doorways, obstacles) in the encountered scene, which can be a fundamental source of information enhancing scene understanding and paving the path towards the safe navigation of the mobile unit. Experiments are conducted using a manual wheelchair to gather image instances from four indoor academic environments consisting of multiple labels. Afterwards, the fine-tuning of a pretrained vision transformer (ViT) is conducted, and the performance is evaluated through an ablation study versus well-established state-of-the-art deep architectures for image classification such as ResNet. Results show that the fine-tuned ViT outperforms all other deep convolutional architectures while achieving satisfactory levels of generalization.

Keywords