Automatic Bunch Detection in White Grape Varieties Using YOLOv3, YOLOv4, and YOLOv5 Deep Learning Algorithms

Marco Sozzi; Silvia Cantalamessa; Alessia Cogato; Ahmed Kayad; Francesco Marinello

doi:10.3390/agronomy12020319

Agronomy (Jan 2022)

Automatic Bunch Detection in White Grape Varieties Using YOLOv3, YOLOv4, and YOLOv5 Deep Learning Algorithms

Marco Sozzi,
Silvia Cantalamessa,
Alessia Cogato,
Ahmed Kayad,
Francesco Marinello

Affiliations

Marco Sozzi: Department of Land Environment Agriculture and Forestry, University of Padova, 35020 Legnaro, Italy
Silvia Cantalamessa: Department of Agronomy, Food, Natural Resources, Animals, and Environment, University of Padova, 35020 Legnaro, Italy
Alessia Cogato: Department of Agricultural, Food, Environmental and Animal Sciences, University of Udine, 33100 Udine, Italy
Ahmed Kayad: Department of Land Environment Agriculture and Forestry, University of Padova, 35020 Legnaro, Italy
Francesco Marinello: Department of Land Environment Agriculture and Forestry, University of Padova, 35020 Legnaro, Italy

DOI: https://doi.org/10.3390/agronomy12020319
Journal volume & issue: Vol. 12, no. 2
p. 319

Abstract

Read online

Over the last few years, several Convolutional Neural Networks for object detection have been proposed, characterised by different accuracy and speed. In viticulture, yield estimation and prediction is used for efficient crop management, taking advantage of precision viticulture techniques. Convolutional Neural Networks for object detection represent an alternative methodology for grape yield estimation, which usually relies on manual harvesting of sample plants. In this paper, six versions of the You Only Look Once (YOLO) object detection algorithm (YOLOv3, YOLOv3-tiny, YOLOv4, YOLOv4-tiny, YOLOv5x, and YOLOv5s) were evaluated for real-time bunch detection and counting in grapes. White grape varieties were chosen for this study, as the identification of white berries on a leaf background is trickier than red berries. YOLO models were trained using a heterogeneous dataset populated by images retrieved from open datasets and acquired on the field in several illumination conditions, background, and growth stages. Results have shown that YOLOv5x and YOLOv4 achieved an F1-score of 0.76 and 0.77, respectively, with a detection speed of 31 and 32 FPS. Differently, YOLO5s and YOLOv4-tiny achieved an F1-score of 0.76 and 0.69, respectively, with a detection speed of 61 and 196 FPS. The final YOLOv5x model for bunch number, obtained considering bunch occlusion, was able to estimate the number of bunches per plant with an average error of 13.3% per vine. The best combination of accuracy and speed was achieved by YOLOv4-tiny, which should be considered for real-time grape yield estimation, while YOLOv3 was affected by a False Positive–False Negative compensation, which decreased the RMSE.

Published in Agronomy

ISSN: 2073-4395 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Agriculture
Website: http://www.mdpi.com/journal/agronomy

About the journal

Abstract

Keywords