International Journal of Advanced Robotic Systems (Sep 2024)

Development of a human-following scheme using point-voxel RCNN-based 3D human leg detection for the robust human-following of mobile robots in cluttered environments

  • Jaehong Park,
  • Jun Hyeong Jo,
  • Chang-bae Moon

DOI
https://doi.org/10.1177/17298806241287313
Journal volume & issue
Vol. 21

Abstract

Read online

Mobile robots often follow humans in warehouse environments or indoor office spaces. A mobile robot requires a control system that stably follows humans without colliding with them. A human-following robot is composed of target detection, target tracking, and control. Conventional 2D laser-based human following systems exploit information from 2D planar laser data to detect human legs through machine learning methods, such as support vector data description and random forest. However, in crowded or cluttered environments, 2D LiDAR data is limited by the lack of features that distinguish human legs from obstacles. Due to the lack of features, false-positive detection is a problem in crowded or cluttered environments. Recent studies using 3D LiDAR have used sensors mounted at an elevated height to measure the overall shape of a person to extract features. Typical mobile robots are mounted in the bottom due to the vibration of the sensors, so there is a cost problem that additional sensors are required. We propose a framework for human following using 3D LiDAR. Our method is able to detect human legs using a LiDAR sensor attached at a low position without additional sensors. This study proposes a 3D laser-based human leg detection and tracking framework to improve the robustness of human-following for autonomous mobile robots. With a deep learning-based human leg detector, using the Point-Voxel-RCNN model, the proposed 3D human leg tracking system can help robots robustly follow humans in cluttered and crowded environments. Additionally, we demonstrate the robustness of our method in a practical cluttered environment by comparing the performance of a conventional human leg detection and following system.