Scientia Agricola (Jan 2021)

Determining the geographical origin of lettuce with data mining applied to micronutrients and soil properties

  • Camila Maione,
  • Eloá Moura Araujo,
  • Sabrina Novaes dos Santos-Araujo,
  • Alexys Giorgia Friol Boim,
  • Rommel Melgaço Barbosa,
  • Luís Reynaldo Ferracciú Alleoni

DOI
https://doi.org/10.1590/1678-992x-2020-0011
Journal volume & issue
Vol. 79, no. 1

Abstract

Read online

ABSTRACT Lettuce (Lactuca sativa) is the main leafy vegetable produced in Brazil. Since its production is widespread all over the country, lettuce traceability and quality assurance is hampered. In this study, we propose a new method to identify the geographical origin of Brazilian lettuce. The method uses a powerful data mining technique called support vector machines (SVM) applied to elemental composition and soil properties of samples analyzed. We investigated lettuce produced in São Paulo and Pernambuco, two states in the southeastern and northeastern regions in Brazil, respectively. We investigated efficiency of the SVM model by comparing its results with those achieved by traditional linear discriminant analysis (LDA). The SVM models outperformed the LDA models in the two scenarios investigated, achieving an average of 98 % prediction accuracy to discriminate lettuce from both states. A feature evaluation formula, called F–score, was used to measure the discriminative power of the variables analyzed. The soil exchangeable cation capacity, soil contents of low crystalized Al and Zn content in lettuce samples were the most relevant components for differentiation. Our results reinforce the potential of data mining and machine learning techniques to support traceability strategies and authentication of leafy vegetables.

Keywords