Vision-Based Recognition of Activities by a Humanoid Robot

Mounîm A. El-Yacoubi; Huilong He; Fabien Roualdes; Mouna Selmi; Mossaab Hariz; Franck Gillet

doi:10.5772/61819

International Journal of Advanced Robotic Systems (Dec 2015)

Vision-Based Recognition of Activities by a Humanoid Robot

Mounîm A. El-Yacoubi,
Huilong He,
Fabien Roualdes,
Mouna Selmi,
Mossaab Hariz,
Franck Gillet

Affiliations

Mounîm A. El-Yacoubi: SAMOVAR, Telecom SudParis, CNRS, University Paris Saclay, Palaiseau, France
Huilong He: SAMOVAR, Telecom SudParis, CNRS, University Paris Saclay, Palaiseau, France
Fabien Roualdes: SAMOVAR, Telecom SudParis, CNRS, University Paris Saclay, Palaiseau, France
Mouna Selmi: SAMOVAR, Telecom SudParis, CNRS, University Paris Saclay, Palaiseau, France
Mossaab Hariz: SAMOVAR, Telecom SudParis, CNRS, University Paris Saclay, Palaiseau, France
Franck Gillet: SAMOVAR, Telecom SudParis, CNRS, University Paris Saclay, Palaiseau, France

DOI: https://doi.org/10.5772/61819
Journal volume & issue: Vol. 12

Abstract

Read online

We present an autonomous assistive robotic system for human activity recognition from video sequences. Due to the large variability inherent to video capture from a non-fixed robot (as opposed to a fixed camera), as well as the robot's limited computing resources, implementation has been guided by robustness to this variability and by memory and computing speed efficiency. To accommodate motion speed variability across users, we encode motion using dense interest point trajectories. Our recognition model harnesses the dense interest point bag-of-words representation through an intersection kernel-based SVM that better accommodates the large intra-class variability stemming from a robot operating in different locations and conditions. To contextually assess the engine as implemented in the robot, we compare it with the most recent approaches of human action recognition performed on public datasets (non-robot-based), including a novel approach of our own that is based on a two-layer SVM-hidden conditional random field sequential recognition model. The latter's performance is among the best within the recent state of the art. We show that our robot-based recognition engine, while less accurate than the sequential model, nonetheless shows good performances, especially given the adverse test conditions of the robot, relative to those of a fixed camera.

Published in International Journal of Advanced Robotic Systems

ISSN: 1729-8814 (Online)
Publisher: SAGE Publishing
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://journals.sagepub.com/home/arx

About the journal