Frontiers in Plant Science (Jul 2024)

YOLOC-tiny: a generalized lightweight real-time detection model for multiripeness fruits of large non-green-ripe citrus in unstructured environments

  • Zuoliang Tang,
  • Zuoliang Tang,
  • Lijia Xu,
  • Haoyang Li,
  • Mingyou Chen,
  • Xiaoshi Shi,
  • Xiaoshi Shi,
  • Long Zhou,
  • Yuchao Wang,
  • Zhijun Wu,
  • Yongpeng Zhao,
  • Kun Ruan,
  • Yong He,
  • Wei Ma,
  • Ning Yang,
  • Lufeng Luo,
  • Yunqiao Qiu

DOI
https://doi.org/10.3389/fpls.2024.1415006
Journal volume & issue
Vol. 15

Abstract

Read online

This study addresses the challenges of low detection precision and limited generalization across various ripeness levels and varieties for large non-green-ripe citrus fruits in complex scenarios. We present a high-precision and lightweight model, YOLOC-tiny, built upon YOLOv7, which utilizes EfficientNet-B0 as the feature extraction backbone network. To augment sensing capabilities and improve detection accuracy, we embed a spatial and channel composite attention mechanism, the convolutional block attention module (CBAM), into the head’s efficient aggregation network. Additionally, we introduce an adaptive and complete intersection over union regression loss function, designed by integrating the phenotypic features of large non-green-ripe citrus, to mitigate the impact of data noise and efficiently calculate detection loss. Finally, a layer-based adaptive magnitude pruning strategy is employed to further eliminate redundant connections and parameters in the model. Targeting three types of citrus widely planted in Sichuan Province—navel orange, Ehime Jelly orange, and Harumi tangerine—YOLOC-tiny achieves an impressive mean average precision (mAP) of 83.0%, surpassing most other state-of-the-art (SOTA) detectors in the same class. Compared with YOLOv7 and YOLOv8x, its mAP improved by 1.7% and 1.9%, respectively, with a parameter count of only 4.2M. In picking robot deployment applications, YOLOC-tiny attains an accuracy of 92.8% at a rate of 59 frames per second. This study provides a theoretical foundation and technical reference for upgrading and optimizing low-computing-power ground-based robots, such as those used for fruit picking and orchard inspection.

Keywords