Automated food intake tracking requires depth-refined semantic segmentation to rectify visual-volume discordance in long-term care homes

Kaylen J. Pfisterer; Robert Amelard; Audrey G. Chung; Braeden Syrnyk; Alexander MacLean; Heather H. Keller; Alexander Wong

doi:10.1038/s41598-021-03972-8

Scientific Reports (Jan 2022)

Automated food intake tracking requires depth-refined semantic segmentation to rectify visual-volume discordance in long-term care homes

Kaylen J. Pfisterer,
Robert Amelard,
Audrey G. Chung,
Braeden Syrnyk,
Alexander MacLean,
Heather H. Keller,
Alexander Wong

Affiliations

Kaylen J. Pfisterer: University of Waterloo, Waterloo, Systems Design Engineering
Robert Amelard: KITE-Toronto Rehabilitation Institute, University Health Network
Audrey G. Chung: University of Waterloo, Waterloo, Systems Design Engineering
Braeden Syrnyk: University of Waterloo, Waterloo, Mechanical and Mechatronics Engineering
Alexander MacLean: University of Waterloo, Waterloo, Systems Design Engineering
Heather H. Keller: Schlegel-UW Research Institute for Aging
Alexander Wong: University of Waterloo, Waterloo, Systems Design Engineering

DOI: https://doi.org/10.1038/s41598-021-03972-8
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Malnutrition is a multidomain problem affecting 54% of older adults in long-term care (LTC). Monitoring nutritional intake in LTC is laborious and subjective, limiting clinical inference capabilities. Recent advances in automatic image-based food estimation have not yet been evaluated in LTC settings. Here, we describe a fully automatic imaging system for quantifying food intake. We propose a novel deep convolutional encoder-decoder food network with depth-refinement (EDFN-D) using an RGB-D camera for quantifying a plate’s remaining food volume relative to reference portions in whole and modified texture foods. We trained and validated the network on the pre-labelled UNIMIB2016 food dataset and tested on our two novel LTC-inspired plate datasets (689 plate images, 36 unique foods). EDFN-D performed comparably to depth-refined graph cut on IOU (0.879 vs. 0.887), with intake errors well below typical 50% (mean percent intake error: $$-4.2$$ - 4.2 %). We identify how standard segmentation metrics are insufficient due to visual-volume discordance, and include volume disparity analysis to facilitate system trust. This system provides improved transparency, approximates human assessors with enhanced objectivity, accuracy, and precision while avoiding hefty semi-automatic method time requirements. This may help address short-comings currently limiting utility of automated early malnutrition detection in resource-constrained LTC and hospital settings.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal