Semantic segmentation of plant roots from RGB (mini-) rhizotron images—generalisation potential and false positives of established methods and advanced deep-learning models

Pavel Baykalov; Bart Bussmann; Richard Nair; Abraham George Smith; Gernot Bodner; Ofer Hadar; Naftali Lazarovitch; Boris Rewald

doi:10.1186/s13007-023-01101-2

Plant Methods (Nov 2023)

Semantic segmentation of plant roots from RGB (mini-) rhizotron images—generalisation potential and false positives of established methods and advanced deep-learning models

Pavel Baykalov,
Bart Bussmann,
Richard Nair,
Abraham George Smith,
Gernot Bodner,
Ofer Hadar,
Naftali Lazarovitch,
Boris Rewald

Affiliations

Pavel Baykalov: Institute of Forest Ecology, Department of Forest and Soil Sciences, University of Natural Resources and Life Sciences, Vienna (BOKU)
Bart Bussmann: IDLab, Department of Computer Science, University of Antwerp - Imec
Richard Nair: Dept. Biogeochemical Integration, Max Planck Institute for Biogeochemistry
Abraham George Smith: Department of Computer Science, University of Copenhagen
Gernot Bodner: Institute of Agronomy, University of Natural Resources and Life Sciences, Vienna (BOKU)
Ofer Hadar: School of Electrical and Computer Engineering, Ben-Gurion University of the Negev
Naftali Lazarovitch: Wyler Department for Dryland Agriculture, French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev
Boris Rewald: Institute of Forest Ecology, Department of Forest and Soil Sciences, University of Natural Resources and Life Sciences, Vienna (BOKU)

DOI: https://doi.org/10.1186/s13007-023-01101-2
Journal volume & issue: Vol. 19, no. 1
pp. 1 – 15

Abstract

Read online

Abstract Background Manual analysis of (mini-)rhizotron (MR) images is tedious. Several methods have been proposed for semantic root segmentation based on homogeneous, single-source MR datasets. Recent advances in deep learning (DL) have enabled automated feature extraction, but comparisons of segmentation accuracy, false positives and transferability are virtually lacking. Here we compare six state-of-the-art methods and propose two improved DL models for semantic root segmentation using a large MR dataset with and without augmented data. We determine the performance of the methods on a homogeneous maize dataset, and a mixed dataset of > 8 species (mixtures), 6 soil types and 4 imaging systems. The generalisation potential of the derived DL models is determined on a distinct, unseen dataset. Results The best performance was achieved by the U-Net models; the more complex the encoder the better the accuracy and generalisation of the model. The heterogeneous mixed MR dataset was a particularly challenging for the non-U-Net techniques. Data augmentation enhanced model performance. We demonstrated the improved performance of deep meta-architectures and feature extractors, and a reduction in the number of false positives. Conclusions Although correction factors are still required to match human labelled root lengths, neural network architectures greatly reduce the time required to compute the root length. The more complex architectures illustrate how future improvements in root segmentation within MR images can be achieved, particularly reaching higher segmentation accuracies and model generalisation when analysing real-world datasets with artefacts—limiting the need for model retraining.

Published in Plant Methods

ISSN: 1746-4811 (Online)
Publisher: BMC
Country of publisher: United Kingdom
LCC subjects: Agriculture: Plant culture; Science: Biology (General)
Website: https://plantmethods.biomedcentral.com

About the journal

Abstract

Keywords