Nutritional composition analysis in food images: an innovative Swin Transformer approach

Hui Wang; Haixia Tian; Ronghui Ju; Liyan Ma; Ling Yang; Jingyao Chen; Feng Liu

doi:10.3389/fnut.2024.1454466

Frontiers in Nutrition (Oct 2024)

Nutritional composition analysis in food images: an innovative Swin Transformer approach

Hui Wang,
Haixia Tian,
Ronghui Ju,
Liyan Ma,
Ling Yang,
Jingyao Chen,
Feng Liu

Affiliations

Hui Wang: College of Food and Biological Engineering, Beijing Vocational College of Agriculture, Beijing, China
Haixia Tian: China Tea Technology Co., Ltd., Beijing, China
Ronghui Ju: College of Food and Biological Engineering, Beijing Vocational College of Agriculture, Beijing, China
Liyan Ma: College of Food Science and Nutrition Engineering, China Agricultural University, Beijing, China
Ling Yang: College of Food and Biological Engineering, Beijing Vocational College of Agriculture, Beijing, China
Jingyao Chen: College of Food and Biological Engineering, Beijing Vocational College of Agriculture, Beijing, China
Feng Liu: Beijing Sanyuan Foods Co., Ltd., Beijing, China

DOI: https://doi.org/10.3389/fnut.2024.1454466
Journal volume & issue: Vol. 11

Abstract

Read online

Accurate recognition of nutritional components in food is crucial for dietary management and health monitoring. Current methods often rely on traditional chemical analysis techniques, which are time-consuming, require destructive sampling, and are not suitable for large-scale or real-time applications. Therefore, there is a pressing need for efficient, non-destructive, and accurate methods to identify and quantify nutrients in food. In this study, we propose a novel deep learning model that integrates EfficientNet, Swin Transformer, and Feature Pyramid Network (FPN) to enhance the accuracy and efficiency of food nutrient recognition. Our model combines the strengths of EfficientNet for feature extraction, Swin Transformer for capturing long-range dependencies, and FPN for multi-scale feature fusion. Experimental results demonstrate that our model significantly outperforms existing methods. On the Nutrition5k dataset, it achieves a Top-1 accuracy of 79.50% and a Mean Absolute Percentage Error (MAPE) for calorie prediction of 14.72%. On the ChinaMartFood109 dataset, the model achieves a Top-1 accuracy of 80.25% and a calorie MAPE of 15.21%. These results highlight the model's robustness and adaptability across diverse food images, providing a reliable and efficient tool for rapid, non-destructive nutrient detection. This advancement supports better dietary management and enhances the understanding of food nutrition, potentially leading to more effective health monitoring applications.

Published in Frontiers in Nutrition

ISSN: 2296-861X (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Technology: Home economics: Nutrition. Foods and food supply
Website: https://www.frontiersin.org/journals/nutrition/

About the journal

Abstract

Keywords