Frontiers in Plant Science (Feb 2024)

Pest recognition in microstates state: an improvement of YOLOv7 based on Spatial and Channel Reconstruction Convolution for feature redundancy and vision transformer with Bi-Level Routing Attention

  • Junjie He,
  • Junjie He,
  • Shihao Zhang,
  • Chunhua Yang,
  • Chunhua Yang,
  • Houqiao Wang,
  • Houqiao Wang,
  • Jun Gao,
  • Wei Huang,
  • Qiaomei Wang,
  • Xinghua Wang,
  • Wenxia Yuan,
  • Yamin Wu,
  • Yamin Wu,
  • Lei Li,
  • Lei Li,
  • Jiayi Xu,
  • Jiayi Xu,
  • Zejun Wang,
  • Zejun Wang,
  • Rukui Zhang,
  • Baijuan Wang,
  • Baijuan Wang

DOI
https://doi.org/10.3389/fpls.2024.1327237
Journal volume & issue
Vol. 15

Abstract

Read online

IntroductionIn order to solve the problem of precise identification and counting of tea pests, this study has proposed a novel tea pest identification method based on improved YOLOv7 network.MethodsThis method used MPDIoU to optimize the original loss function, which improved the convergence speed of the model and simplifies the calculation process. Replace part of the network structure of the original model using Spatial and Channel reconstruction Convolution to reduce redundant features, lower the complexity of the model, and reduce computational costs. The Vision Transformer with Bi-Level Routing Attention has been incorporated to enhance the flexibility of model calculation allocation and content perception.ResultsThe experimental results revealed that the enhanced YOLOv7 model significantly boosted Precision, Recall, F1, and mAP by 5.68%, 5.14%, 5.41%, and 2.58% respectively, compared to the original YOLOv7. Furthermore, when compared to deep learning networks such as SSD, Faster Region-based Convolutional Neural Network (RCNN), and the original YOLOv7, this method proves to be superior while being externally validated. It exhibited a noticeable improvement in the FPS rates, with increments of 5.75 HZ, 34.42 HZ, and 25.44 HZ respectively. Moreover, the mAP for actual detection experiences significant enhancements, with respective increases of 2.49%, 12.26%, and 7.26%. Additionally, the parameter size is reduced by 1.39 G relative to the original model.DiscussionThe improved model can not only identify and count tea pests efficiently and accurately, but also has the characteristics of high recognition rate, low parameters and high detection speed. It is of great significance to achieve realize the intelligent and precise prevention and control of tea pests.

Keywords