FineTea: A Novel Fine-Grained Action Recognition Video Dataset for Tea Ceremony Actions

Changwei Ouyang; Yun Yi; Hanli Wang; Jin Zhou; Tao Tian

doi:10.3390/jimaging10090216

Journal of Imaging (Aug 2024)

FineTea: A Novel Fine-Grained Action Recognition Video Dataset for Tea Ceremony Actions

Changwei Ouyang,
Yun Yi,
Hanli Wang,
Jin Zhou,
Tao Tian

Affiliations

Changwei Ouyang: School of Mathematics and Computer Science, Gannan Normal University, Ganzhou 341000, China
Yun Yi: School of Mathematics and Computer Science, Gannan Normal University, Ganzhou 341000, China
Hanli Wang: Department of Computer Science and Technology, Tongji University, Shanghai 201804, China
Jin Zhou: School of Mathematics and Computer Science, Gannan Normal University, Ganzhou 341000, China
Tao Tian: School of Computer Science and Artificial Intelligence, Chaohu University, Hefei 238024, China

DOI: https://doi.org/10.3390/jimaging10090216
Journal volume & issue: Vol. 10, no. 9
p. 216

Abstract

Read online

Methods based on deep learning have achieved great success in the field of video action recognition. When these methods are applied to real-world scenarios that require fine-grained analysis of actions, such as being tested on a tea ceremony, limitations may arise. To promote the development of fine-grained action recognition, a fine-grained video action dataset is constructed by collecting videos of tea ceremony actions. This dataset includes 2745 video clips. By using a hierarchical fine-grained action classification approach, these clips are divided into 9 basic action classes and 31 fine-grained action subclasses. To better establish a fine-grained temporal model for tea ceremony actions, a method named TSM-ConvNeXt is proposed that integrates a TSM into the high-performance convolutional neural network ConvNeXt. Compared to a baseline method using ResNet50, the experimental performance of TSM-ConvNeXt is improved by 7.31%. Furthermore, compared with the state-of-the-art methods for action recognition on the FineTea and Diving48 datasets, the proposed approach achieves the best experimental results. The FineTea dataset is publicly available.

Published in Journal of Imaging

ISSN: 2313-433X (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Photography; Medicine: Medicine (General): Computer applications to medicine. Medical informatics; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/jimaging

About the journal

Abstract

Keywords