Enhancing the Transformer Model with a Convolutional Feature Extractor Block and Vector-Based Relative Position Embedding for Human Activity Recognition

Xin Guo; Young Kim; Xueli Ning; Se Dong Min

doi:10.3390/s25020301

Sensors (Jan 2025)

Enhancing the Transformer Model with a Convolutional Feature Extractor Block and Vector-Based Relative Position Embedding for Human Activity Recognition

Xin Guo,
Young Kim,
Xueli Ning,
Se Dong Min

Affiliations

Xin Guo: Department of Software Convergence, Soonchunhyang University, Asan 31538, Republic of Korea
Young Kim: Department of Software Convergence, Soonchunhyang University, Asan 31538, Republic of Korea
Xueli Ning: Department of Software Convergence, Soonchunhyang University, Asan 31538, Republic of Korea
Se Dong Min: Department of Software Convergence, Soonchunhyang University, Asan 31538, Republic of Korea

DOI: https://doi.org/10.3390/s25020301
Journal volume & issue: Vol. 25, no. 2
p. 301

Abstract

Read online

The Transformer model has received significant attention in Human Activity Recognition (HAR) due to its self-attention mechanism that captures long dependencies in time series. However, for Inertial Measurement Unit (IMU) sensor time-series signals, the Transformer model does not effectively utilize the a priori information of strong complex temporal correlations. Therefore, we proposed using multi-layer convolutional layers as a Convolutional Feature Extractor Block (CFEB). CFEB enables the Transformer model to leverage both local and global time series features for activity classification. Meanwhile, the absolute position embedding (APE) in existing Transformer models cannot accurately represent the distance relationship between individuals at different time points. To further explore positional correlations in temporal signals, this paper introduces the Vector-based Relative Position Embedding (vRPE), aiming to provide more relative temporal position information within sensor signals for the Transformer model. Combining these innovations, we conduct extensive experiments on three HAR benchmark datasets: KU-HAR, UniMiB SHAR, and USC-HAD. Experimental results demonstrate that our proposed enhancement scheme substantially elevates the performance of the Transformer model in HAR.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords