A Hierarchical Feature-Aware Model for Accurate Tomato Blight Disease Spot Detection: Unet with Vision Mamba and ConvNeXt Perspective
Dongyuan Shi,
Changhong Li,
Hui Shi,
Longwei Liang,
Huiying Liu,
Ming Diao
Affiliations
Dongyuan Shi
Department of Horticulture, Agricultural College of Shihezi University/Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization of Xinjiang Production and Construction Corps, Shihezi 832003, China
Changhong Li
Department of Horticulture, Agricultural College of Shihezi University/Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization of Xinjiang Production and Construction Corps, Shihezi 832003, China
Hui Shi
Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences/National Engineering Research Center for Information Technology in Agriculture/National Engineering Laboratory for Agri-product Quality Traceability/Meteorological Service Center for Urban Agriculture, China Meteorological Administration-Ministry of Agriculture and Rural Affairs, Beijing 100097, China
Longwei Liang
Department of Horticulture, Agricultural College of Shihezi University/Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization of Xinjiang Production and Construction Corps, Shihezi 832003, China
Huiying Liu
Department of Horticulture, Agricultural College of Shihezi University/Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization of Xinjiang Production and Construction Corps, Shihezi 832003, China
Ming Diao
Department of Horticulture, Agricultural College of Shihezi University/Key Laboratory of Special Fruits and Vegetables Cultivation Physiology and Germplasm Resources Utilization of Xinjiang Production and Construction Corps, Shihezi 832003, China
Tomato blight significantly threatened tomato yield and quality, making precise disease detection essential for modern agricultural practices. Traditional segmentation models often struggle with over-segmentation and missed segmentation, particularly in complex backgrounds and with diverse lesion morphologies. To address these challenges, we proposed Unet with Vision Mamba and ConvNeXt (VMC-Unet), an asymmetric segmentation model for quantitative analysis of tomato blight. Built on the Unet framework, VMC-Unet integrated a parallel feature-aware backbone combining ConvNeXt, Vision Mamba, and Atrous Spatial Pyramid Pooling (ASPP) modules to enhance spatial feature focusing and multi-scale information processing. During decoding, Vision Mamba was hierarchically embedded to accurately recover complex lesion morphologies through refined feature processing and efficient up-sampling. A joint loss function was designed to optimize the model’s performance. Extensive experiments on both tomato epidemic and public datasets demonstrated VMC-Unet superior performance, achieving 97.82% pixel accuracy, 87.94% F1 score, and 86.75% mIoU. These results surpassed those of classical segmentation models, underscoring the effectiveness of VMC-Unet in mitigating over-segmentation and under-segmentation while maintaining high segmentation accuracy in complex backgrounds. The consistent performance of the model across various datasets further validated its robustness and generalization potential, highlighting its applicability in broader agricultural settings.