Complex & Intelligent Systems (Oct 2022)

Gaussian guided frame sequence encoder network for action quality assessment

  • Ming-Zhe Li,
  • Hong-Bo Zhang,
  • Li-Jia Dong,
  • Qing Lei,
  • Ji-Xiang Du

DOI
https://doi.org/10.1007/s40747-022-00892-6
Journal volume & issue
Vol. 9, no. 2
pp. 1963 – 1974

Abstract

Read online

Abstract Can a computer evaluate an athlete’s performance automatically? Many action quality assessment (AQA) methods have been proposed in recent years. Limited by the randomness of video sampling and the simple strategy of model training, the performance of the existing AQA methods can still be further improved. To achieve this goal, a Gaussian guided frame sequence encoder network is proposed in this paper. In the proposed method, the image feature of each video frame is extracted by Resnet model. And then, a frame sequence encoder network is applied to model temporal information and generate action quality feature. Finally, a fully connected network is designed to predict action quality score. To train the proposed method effectively, inspired by the final score calculation rule in Olympic game, Gaussian loss function is employed to compute the error between the predicted score and the label score. The proposed method is implemented on the AQA-7 and MTL–AQA datasets. The experimental results confirm that compared with the state-of-the-art methods, our proposed method achieves the better performance. And detailed ablation experiments are conducted to verify the effectiveness of each component in the module.

Keywords