Applied Computational Intelligence and Soft Computing (Jan 2022)

Rapid Detection of Hybrid Maize Parental Lines Using Stacking Ensemble Machine Learning

  • M. Aqil,
  • M. Azrai,
  • M. J. Mejaya,
  • N. A. Subekti,
  • F. Tabri,
  • N. N. Andayani,
  • Rahma Wati,
  • S. Panikkai,
  • S. Suwardi,
  • Z. Bunyamin,
  • E. Roy,
  • M. Muslimin,
  • M. Yasin,
  • E. Prakasa

DOI
https://doi.org/10.1155/2022/6588949
Journal volume & issue
Vol. 2022

Abstract

Read online

Hybrid maize seed production is a relatively complex task due to the coexistence of three distinct types of maize plants in the field: female, male, and contaminant/off-type plants. Female and contaminant/off-type plants’ tassels should be removed immediately following flowering initiation, while male tassels should be retained to allow cross-pollination between male and female plants. Therefore, development of an intelligent tassel classification system is deemed critical for hybrid purity decision-making. The research’s primary contribution is the integration of two widely used transfer learning architectures, Inception V3 and SqueezeNet, with stacking ensemble machine learning using four algorithms (logistic regression, support vector machine, random forest, and k-nearest neighbors) for rapid classification of tassel images. Tenfold cross-validation was used to evaluate the model performance. Cloud computing was also investigated using EfficientNet to compare the predictive performance of the models. The models’ performance was assessed using four metrics: accuracy, AUC, precision, and recall. The results depicted an appropriate developed model that properly distinguished male, female, and contaminant plants. The integration of the model with machine learnings (logistic regression, SVM, random forest, and KNNs) enables rapid recognition of off-type plants even though it is operated by personnel with limited skills of seed technology on ideotype recognition. Among all the evaluated CNN architecture and stacking models, Inception V3-embedded images with logistic regression metaclassifier outperformed other models with accuracy of about 98%. SqueezeNet and EfficientNet provided comparable results for consistent tassel classification with slightly lower performance measures. The model was also subjected to a multidimensional scaling (MDS) analysis to investigate and comprehend misclassification. Male and female plants are clearly distinguished by MDS, but female and off-type/contamination plants are ambiguous. This indicates that the prediction errors were caused by highly similar data features among female and off-type images. The developed modern plant phenotyping model can be used to assist breeders/technicians in maintaining the quality of large-scale hybrid maize seed production activities in Indonesia.