Two-stage video-based convolutional neural networks for adult spinal deformity classification

Kaixu Chen; Tomoyuki Asada; Naoto Ienaga; Kousei Miura; Kotaro Sakashita; Takahiro Sunami; Hideki Kadone; Hideki Kadone; Masashi Yamazaki; Yoshihiro Kuroda

doi:10.3389/fnins.2023.1278584

Frontiers in Neuroscience (Dec 2023)

Two-stage video-based convolutional neural networks for adult spinal deformity classification

Kaixu Chen,
Tomoyuki Asada,
Naoto Ienaga,
Kousei Miura,
Kotaro Sakashita,
Takahiro Sunami,
Hideki Kadone,
Hideki Kadone,
Masashi Yamazaki,
Yoshihiro Kuroda

Affiliations

Kaixu Chen: Degree Programs in Systems and Information Engineering, University of Tsukuba, Tsukuba, Japan
Tomoyuki Asada: Department of Orthopaedic Surgery, Institute of Medicine, University of Tsukuba, Tsukuba, Japan
Naoto Ienaga: Center for Cybernics Research, University of Tsukuba, Tsukuba, Japan
Kousei Miura: Department of Orthopaedic Surgery, Institute of Medicine, University of Tsukuba, Tsukuba, Japan
Kotaro Sakashita: Department of Orthopaedic Surgery, Institute of Medicine, University of Tsukuba, Tsukuba, Japan
Takahiro Sunami: Department of Orthopaedic Surgery, Institute of Medicine, University of Tsukuba, Tsukuba, Japan
Hideki Kadone: Department of Orthopaedic Surgery, Institute of Medicine, University of Tsukuba, Tsukuba, Japan
Hideki Kadone: Center for Cybernics Research, University of Tsukuba, Tsukuba, Japan
Masashi Yamazaki: Department of Orthopaedic Surgery, Institute of Medicine, University of Tsukuba, Tsukuba, Japan
Yoshihiro Kuroda: Division of Intelligent Interaction Technologies, Institute of Systems and Information Engineering, University of Tsukuba, Tsukuba, Japan

DOI: https://doi.org/10.3389/fnins.2023.1278584
Journal volume & issue: Vol. 17

Abstract

Read online

IntroductionAssessment of human gait posture can be clinically effective in diagnosing human gait deformities early in life. Currently, two methods—static and dynamic—are used to diagnose adult spinal deformity (ASD) and other spinal disorders. Full-spine lateral standing radiographs are used in the standard static method. However, this is a static assessment of joints in the standing position and does not include information on joint changes when the patient walks. Careful observation of long-distance walking can provide a dynamic assessment that reveals an uncompensated posture; however, this increases the workload of medical practitioners. A three-dimensional (3D) motion system is proposed for the dynamic method. Although the motion system successfully detected dynamic posture changes, access to the facilities was limited. Therefore, a diagnostic approach that is facility-independent, has low practice flow, and does not involve patient contact is required.MethodsWe focused on a video-based method to classify patients with spinal disorders either as ASD, or other forms of ASD. To achieve this goal, we present a video-based two-stage machine-learning method. In the first stage, deep learning methods are used to locate the patient and extract the area where the patient is located. In the second stage, a 3D CNN (convolutional neural network) device is used to capture spatial and temporal information (dynamic motion) from the extracted frames. Disease classification is performed by discerning posture and gait from the extracted frames. Model performance was assessed using the mean accuracy, F1 score, and area under the receiver operating characteristic curve (AUROC), with five-fold cross-validation. We also compared the final results with professional observations.ResultsOur experiments were conducted using a gait video dataset comprising 81 patients. The experimental results indicated that our method is effective for classifying ASD and other spinal disorders. The proposed method achieved a mean accuracy of 0.7553, an F1 score of 0.7063, and an AUROC score of 0.7864. Additionally, ablation experiments indicated the importance of the first stage (detection stage) and transfer learning of our proposed method.DiscussionThe observations from the two doctors were compared using the proposed method. The mean accuracies observed by the two doctors were 0.4815 and 0.5247, with AUROC scores of 0.5185 and 0.5463, respectively. We proved that the proposed method can achieve accurate and reliable medical testing results compared with doctors' observations using videos of 1 s duration. All our code, models, and results are available at https://github.com/ChenKaiXuSan/Walk_Video_PyTorch. The proposed framework provides a potential video-based method for improving the clinical diagnosis for ASD and non-ASD. This framework might, in turn, benefit both patients and clinicians to treat the disease quickly and directly and further reduce facility dependency and data-driven systems.

Published in Frontiers in Neuroscience

ISSN: 1662-4548 (Print); 1662-453X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Internal medicine: Neurosciences. Biological psychiatry. Neuropsychiatry
Website: http://www.frontiersin.org/neuroscience

About the journal

Abstract

Keywords