Agronomy (Feb 2024)

Estimation of Relative Chlorophyll Content in Lettuce (<i>Lactuca sativa</i> L.) Leaves under Cadmium Stress Using Visible—Near-Infrared Reflectance and Machine-Learning Models

  • Leijinyu Zhou,
  • Hongbo Wu,
  • Tingting Jing,
  • Tianhao Li,
  • Jinsheng Li,
  • Lijuan Kong,
  • Lina Zhou

DOI
https://doi.org/10.3390/agronomy14030427
Journal volume & issue
Vol. 14, no. 3
p. 427

Abstract

Read online

Chlorophyll content is a crucial assessment parameter in the growth monitoring of lettuce, particularly in cases when it is affected by disease. Accurate estimation of chlorophyll content is beneficial for early detection and prevention of diseases and holds significant importance in practical production. To construct a model for estimating the chlorophyll content in lettuce leaves under cadmium stress, this study utilized lettuce as the experimental material. The visible–near-infrared reflectance spectra of lettuce leaves, as well as the relative chlorophyll content of the leaves, were detected and analyzed under different concentrations of cadmium stress. Subsequently, an inversion model for estimating the relative chlorophyll content in lettuce leaves was established. First, to determine the optimal spectral preprocessing method, eight techniques are utilized: Savitzky–Golay smoothing (SG), multiplicative scatter correction (MSC), standard normal variable transformation (SNV), mean normalization (MN), baseline offset (B), detrending (D), gap derivatives—first derivative (FD), and gap derivatives—second derivative (SD). These methods are used to preprocess the spectra and establish a partial least squares regression (PLSR) monitoring model. The optimal spectral preprocessing method is then selected. Next, the feature bands are extracted from the preprocessed spectral data using the correlation coefficient method. Finally, the selected feature bands will be combined with support vector regression (SVR) to establish a chlorophyll content estimation model using a training-to-testing set ratio of 4:1. The results showed that the PLSR model established after preprocessing with detrending (D) had the highest accuracy, with the coefficient of determination (Rv2) and root mean squared error (RMSEv) values of 0.87 and 1.16, respectively. The feature bands selected by the correlation coefficient method were used to establish SVR models for estimating the chlorophyll content of lettuce leaves under cadmium stress, with the highest accuracy being achieved by the genetic algorithm (GA)–SVR model. It can be seen that near-infrared spectroscopy technology provides a scientific basis for rapid, nondestructive, and accurate detection of lettuce diseases and stress.

Keywords