A Multi-Modal Egocentric Activity Recognition Approach towards Video Domain Generalization

Antonios Papadakis; Evaggelos Spyrou

doi:10.3390/s24082491

Sensors (Apr 2024)

A Multi-Modal Egocentric Activity Recognition Approach towards Video Domain Generalization

Antonios Papadakis,
Evaggelos Spyrou

Affiliations

Antonios Papadakis: Department of Informatics and Telecommunications, National Kapodistrian University of Athens, 15772 Athens, Greece
Evaggelos Spyrou: Department of Informatics and Telecommunications, University of Thessaly, 35100 Lamia, Greece

DOI: https://doi.org/10.3390/s24082491
Journal volume & issue: Vol. 24, no. 8
p. 2491

Abstract

Read online

Egocentric activity recognition is a prominent computer vision task that is based on the use of wearable cameras. Since egocentric videos are captured through the perspective of the person wearing the camera, her/his body motions severely complicate the video content, imposing several challenges. In this work we propose a novel approach for domain-generalized egocentric human activity recognition. Typical approaches use a large amount of training data, aiming to cover all possible variants of each action. Moreover, several recent approaches have attempted to handle discrepancies between domains with a variety of costly and mostly unsupervised domain adaptation methods. In our approach we show that through simple manipulation of available source domain data and with minor involvement from the target domain, we are able to produce robust models, able to adequately predict human activity in egocentric video sequences. To this end, we introduce a novel three-stream deep neural network architecture combining elements of vision transformers and residual neural networks which are trained using multi-modal data. We evaluate the proposed approach using a challenging, egocentric video dataset and demonstrate its superiority over recent, state-of-the-art research works.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords