Computer vision classification detection of chicken parts based on optimized Swin-Transformer

Xianhui Peng; Chenchen Xu; Peng Zhang; Dandan Fu; Yan Chen; Zhigang Hu

doi:10.1080/19476337.2024.2347480

CyTA - Journal of Food (Dec 2024)

Computer vision classification detection of chicken parts based on optimized Swin-Transformer

Xianhui Peng,
Chenchen Xu,
Peng Zhang,
Dandan Fu,
Yan Chen,
Zhigang Hu

Affiliations

Xianhui Peng: School of Mechanical Engineering, Wuhan Polytechnic University, Wuhan, China
Chenchen Xu: School of Mechanical Engineering, Wuhan Polytechnic University, Wuhan, China
Peng Zhang: School of Mechanical Engineering, Wuhan Polytechnic University, Wuhan, China
Dandan Fu: School of Mechanical Engineering, Wuhan Polytechnic University, Wuhan, China
Yan Chen: School of Mechanical Engineering, Wuhan Polytechnic University, Wuhan, China
Zhigang Hu: School of Mechanical Engineering, Wuhan Polytechnic University, Wuhan, China

DOI: https://doi.org/10.1080/19476337.2024.2347480
Journal volume & issue: Vol. 22, no. 1

Abstract

Read online

ABSTRACTIn order to achieve real-time classification and detection of various chicken parts, this study introduces an optimized Swin-Transformer method for the classification and detection of multiple chicken parts. It initially leverages the Transformer’s self-attention structure to capture more comprehensive high-level visual semantic information from chicken part images. The image enhancement technique was applied to the image in the preprocessing stage to enhance the feature information of the image, and the migration learning method was used to train and optimize the Swin-Transformer model on the enhanced chicken parts dataset for classification and detection of chicken parts. Furthermore, this model was compared to four commonly used models in object target detection tasks: YOLOV3-Darknet53, YOLOV3-MobileNetv3, SSD-MobileNetv3, and SSD-VGG16. The results indicated that the Swin-Transformer model outperforms these models with a higher mAP value by 1.62%, 2.13%, 5.26%, and 4.48%, accompanied by a reduction in detection time by 16.18 ms, 5.08 ms, 9.38 ms, and 23.48 ms, respectively. The method of this study fulfills the production line requirements while exhibiting superior performance and greater robustness compared to existing conventional methods.

Published in CyTA - Journal of Food

ISSN: 1947-6337 (Print); 1947-6345 (Online)
Publisher: Taylor & Francis Group
Country of publisher: United Kingdom
LCC subjects: Technology: Home economics: Nutrition. Foods and food supply; Technology: Chemical technology: Food processing and manufacture
Website: https://www.tandfonline.com/journals/tcyt

About the journal

Abstract

Keywords