Scientific Reports (Nov 2024)

Regression prediction of tobacco chemical components during curing based on color quantification and machine learning

  • Yang Meng,
  • Qiang Xu,
  • Guangqing Chen,
  • Jianjun Liu,
  • Shuoye Zhou,
  • Yanling Zhang,
  • Aiguo Wang,
  • Jianwei Wang,
  • Ding Yan,
  • Xianjie Cai,
  • Junying Li,
  • Xuchu Chen,
  • Qiuying Li,
  • Qiang Zeng,
  • Weimin Guo,
  • Yuanhui Wang

DOI
https://doi.org/10.1038/s41598-024-78426-y
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 16

Abstract

Read online

Abstract Color is one of the most important indicators to characteristic the quality of tobacco, which is strongly related to the variations of chemical components. In order to clarify the relationship between the changes of tobacco color and chemical components, here we established several prediction models of chemical components with the color values of tobacco based on machine learning algorithms. The results of correlation analysis showed that tobacco moisture content was highly significantly correlated with the parameters such as a*, H* and H°, the reducing sugar and total sugar content of tobacco was significantly correlated with the color values, and the starch content was highly significantly correlated with the color values except for b* and C*. The random forest models performed best in predicting tobacco moisture, reducing sugar, total sugar and starch constructed with the R 2 of the model validation set was higher than 0.90, and the RPD value was greater than 2.0. The consistent between the predictions and measurements verified the availability and feasibility using color values to predict some chemical components of the tobacco leaves with high accuracy, and which has distinct advantages and potential application to realize the real-time monitoring of some chemical components in the tobacco curing process.

Keywords