BlazePose-Seq2Seq: Leveraging Regular RGB Cameras for Robust Gait Assessment

Abdul Aziz Hulleck; Aamna AlShehhi; Marwan El Rich; Raviha Khan; Rateb Katmah; Mahdi Mohseni; Navid Arjmand; Kinda Khalaf

doi:10.1109/TNSRE.2024.3391908

IEEE Transactions on Neural Systems and Rehabilitation Engineering (Jan 2024)

BlazePose-Seq2Seq: Leveraging Regular RGB Cameras for Robust Gait Assessment

Abdul Aziz Hulleck,
Aamna AlShehhi,
Marwan El Rich,
Raviha Khan,
Rateb Katmah,
Mahdi Mohseni,
Navid Arjmand,
Kinda Khalaf

Affiliations

Abdul Aziz Hulleck: ORCiD; Department of Mechanical Engineering, Khalifa University, Abu Dhabi, UAE
Aamna AlShehhi: ORCiD; Department of Biomedical Engineering, Khalifa University, Abu Dhabi, UAE
Marwan El Rich: ORCiD; Department of Mechanical Engineering, Khalifa University, Abu Dhabi, UAE
Raviha Khan: Department of Electrical Engineering, Khalifa University, Abu Dhabi, UAE
Rateb Katmah: ORCiD; Department of Biomedical Engineering, Khalifa University, Abu Dhabi, UAE
Mahdi Mohseni: ORCiD; Department of Mechanical Engineering, Sharif University of Technology, Tehran, Iran
Navid Arjmand: Department of Mechanical Engineering, Sharif University of Technology, Tehran, Iran
Kinda Khalaf: ORCiD; Department of Biomedical Engineering, Khalifa University, Abu Dhabi, UAE

DOI: https://doi.org/10.1109/TNSRE.2024.3391908
Journal volume & issue: Vol. 32
pp. 1715 – 1724

Abstract

Read online

Evaluation of human gait through smartphone-based pose estimation algorithms provides an attractive alternative to costly lab-bound instrumented assessment and offers a paradigm shift with real time gait capture for clinical assessment. Systems based on smart phones, such as OpenPose and BlazePose have demonstrated potential for virtual motion assessment but still lack the accuracy and repeatability standards required for clinical viability. Seq2seq architecture offers an alternative solution to conventional deep learning techniques for predicting joint kinematics during gait. This study introduces a novel enhancement to the low-powered BlazePose algorithm by incorporating a Seq2seq autoencoder deep learning model. To ensure data accuracy and reliability, synchronized motion capture involving an RGB camera and ten Vicon cameras were employed across three distinct self-selected walking speeds. This investigation presents a groundbreaking avenue for remote gait assessment, harnessing the potential of Seq2seq architectures inspired by natural language processing (NLP) to enhance pose estimation accuracy. When comparing BlazePose alone to the combination of BlazePose and 1D convolution Long Short-term Memory Network (1D-LSTM), Gated Recurrent Unit (GRU) and Long Short-Term Memory (LSTM), the average mean absolute errors decreased from 13.4° to 5.3° for fast gait, from 16.3° to 7.5° for normal gait, and from 15.5° to 7.5° for slow gait at the left ankle joint angle respectively. The strategic utilization of synchronized data and rigorous testing methodologies further bolsters the robustness and credibility of these findings.

Published in IEEE Transactions on Neural Systems and Rehabilitation Engineering

ISSN: 1534-4320 (Print); 1558-0210 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Medicine: Medicine (General): Medical technology; Medicine: Therapeutics. Pharmacology
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=7333

About the journal

Abstract

Keywords