Who danced better? ranked tiktok dance video dataset and pairwise action quality assessment method

Irwandi Hipiny; Hamimah Ujir; Aidil Azli Alias; Musdi Shanat; Mohamad Khairi Ishak

doi:10.26555/ijain.v9i1.919

IJAIN (International Journal of Advances in Intelligent Informatics) (Mar 2023)

Who danced better? ranked tiktok dance video dataset and pairwise action quality assessment method

Irwandi Hipiny,
Hamimah Ujir,
Aidil Azli Alias,
Musdi Shanat,
Mohamad Khairi Ishak

Affiliations

Irwandi Hipiny: Universiti Malaysia Sarawak
Hamimah Ujir: Universiti Malaysia Sarawak
Aidil Azli Alias: Universiti Malaysia Sarawak
Musdi Shanat: Universiti Malaysia Sarawak
Mohamad Khairi Ishak: Universiti Sains Malaysia

DOI: https://doi.org/10.26555/ijain.v9i1.919
Journal volume & issue: Vol. 9, no. 1
pp. 96 – 107

Abstract

Read online

Video-based action quality assessment (AQA) is a non-trivial task due to the subtle visual differences between data produced by experts and non-experts. Current methods are extended from the action recognition domain where most are based on temporal pattern matching. AQA has additional requirements where order and tempo matter for rating the quality of an action. We present a novel dataset of ranked TikTok dance videos, and a pairwise AQA method for predicting which video of a same-label pair was sourced from the better dancer. Exhaustive pairings of same-label videos were randomly assigned to 100 human annotators, ultimately producing a ranked list per label category. Our method relies on a successful detection of the subject’s 2D pose inside successive query frames where the order and tempo of actions are encoded inside a produced String sequence. The detected 2D pose returns a top-matching Visual word from a Codebook to represent the current frame. Given a same-label pair, we generate a String value of concatenated Visual words for each video. By computing the edit distance score between each String value and the Gold Standard’s (i.e., the top-ranked video(s) for that label category), we declare the video with the lower score as the winner. The pairwise AQA method is implemented using two schemes, i.e., with and without text compression. Although the average precision for both schemes over 12 label categories is low, at 0.45 with text compression and 0.48 without, precision values for several label categories are comparable to past methods’ (median: 0.47, max: 0.66).

Published in IJAIN (International Journal of Advances in Intelligent Informatics)

ISSN: 2442-6571 (Print); 2548-3161 (Online)
Publisher: Universitas Ahmad Dahlan
Country of publisher: Indonesia
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://ijain.org/index.php/IJAIN/index

About the journal

Abstract

Keywords