Smart Agricultural Technology (Dec 2024)
Multimodal rapid identification of growth stages and discrimination of growth status for Morchella
Abstract
We introduce a multimodal rapid identification and growth status discrimination method for morchella. Based on the unique biological characteristics and growth environmental requirements of morchella, the efficient and accurate identification of key growth stages of morchella is achieved through the integration of multimodal information acquisition technology. During the rapid identification process of the growth stage of Morchella, the Multi Stage Vision Enhanced Position Encoding Vision Transformer (MS-EP ViT) model is adopted. By introducing multi-stage input embedding, enhanced position encoding, and optimized Transformer Encoder layers, the performance of the model in identifying different growth stages of Morchella mushrooms is significantly improved. In the multimodal Morchella growth state discrimination method, text and image modalities are integrated, a Non downsampled Contourlet Transform Mask Region based Convolutional Neural Network (NSCT Mask R-CNN) model is designed, and a multimodal feature extraction strategy combining Non downsampled Contourlet Transform (NSCT) features with environmental features is explored. This strategy effectively achieves the goals of object detection and instance segmentation, and thus we have accurately evaluated the growth status of Morchella in the later stages of mulberry, young mushroom, and mature. The experimental results show that both models have achieved significant improvements in recognition accuracy and stability, and the rationality of hyperparameter settings has been verified through convergence and parameter sensitivity experiments. Overall, we provide a more accurate and efficient identification method for monitoring the growth of Morchella, which helps to better understand the growth of Morchella and provides scientific basis for optimizing its growth environment.