Frontiers in Plant Science (Jan 2024)

Image-based classification of wheat spikes by glume pubescence using convolutional neural networks

  • Nikita V. Artemenko,
  • Nikita V. Artemenko,
  • Mikhail A. Genaev,
  • Mikhail A. Genaev,
  • Rostislav UI. Epifanov,
  • Evgeny G. Komyshev,
  • Yulia V. Kruchinina,
  • Yulia V. Kruchinina,
  • Vasiliy S. Koval,
  • Vasiliy S. Koval,
  • Nikolay P. Goncharov,
  • Dmitry A. Afonnikov,
  • Dmitry A. Afonnikov,
  • Dmitry A. Afonnikov

DOI
https://doi.org/10.3389/fpls.2023.1336192
Journal volume & issue
Vol. 14

Abstract

Read online

IntroductionPubescence is an important phenotypic trait observed in both vegetative and generative plant organs. Pubescent plants demonstrate increased resistance to various environmental stresses such as drought, low temperatures, and pests. It serves as a significant morphological marker and aids in selecting stress-resistant cultivars, particularly in wheat. In wheat, pubescence is visible on leaves, leaf sheath, glumes and nodes. Regarding glumes, the presence of pubescence plays a pivotal role in its classification. It supplements other spike characteristics, aiding in distinguishing between different varieties within the wheat species. The determination of pubescence typically involves visual analysis by an expert. However, methods without the use of binocular loupe tend to be subjective, while employing additional equipment is labor-intensive. This paper proposes an integrated approach to determine glume pubescence presence in spike images captured under laboratory conditions using a digital camera and convolutional neural networks.MethodsInitially, image segmentation is conducted to extract the contour of the spike body, followed by cropping of the spike images to an equal size. These images are then classified based on glume pubescence (pubescent/glabrous) using various convolutional neural network architectures (Resnet-18, EfficientNet-B0, and EfficientNet-B1). The networks were trained and tested on a dataset comprising 9,719 spike images.ResultsFor segmentation, the U-Net model with EfficientNet-B1 encoder was chosen, achieving the segmentation accuracy IoU = 0.947 for the spike body and 0.777 for awns. The classification model for glume pubescence with the highest performance utilized the EfficientNet-B1 architecture. On the test sample, the model exhibited prediction accuracy parameters of F1 = 0.85 and AUC = 0.96, while on the holdout sample it showed F1 = 0.84 and AUC = 0.89. Additionally, the study investigated the relationship between image scale, artificial distortions, and model prediction performance, revealing that higher magnification and smaller distortions yielded a more accurate prediction of glume pubescence.

Keywords