Information Processing in Agriculture (Mar 2023)
Prognosis of forest production using machine learning techniques
Abstract
Forest production and growth are obtained from statistical models that allow the generation of information at the tree or forest stand level. Although the use of regression models is common in forest measurement, there is a constant search for estimation procedures that provide greater accuracy. Recently, machine learning techniques have been used with satisfactory performance in measuring forests. However, methods such as Adaptive Neuro-Fuzzy Inference System (ANFIS) and Random Forest are relatively poorly studied for predicting the volume of wood in eucalyptus plantations in Brazil. Therefore, it is essential to check whether these techniques can provide gains in terms of accuracy. Thus, this study aimed to evaluate the use of Random Forest and ANFIS techniques in the prognosis of forest production. The data used come from continuous forest inventories carried out in stands of eucalyptus clones. The data were divided into 70% for training and 30% for validation. The algorithms used to generate rules in ANFIS were Subtractive Clustering and Fuzzy-C-Means. Besides, training was done with the hybrid algorithm (descending gradient and least squares) with the number of seasons ranging from 1 to 20. Several RFs were trained, varying the number of trees from 50 to 850 and the number of observations by five leaves to 35. Artificial neural networks and decision trees were also trained to compare the feasibility of the techniques. The evaluation of the estimates generated by the techniques for training and validation was calculated based on the following statistics: correlation coefficient (r), relative Bias (RB), and the relative root mean square error (RRMSE) in percentage. In general, the techniques studied in this work showed excellent performance for the training and validation data set with RRMSE values 0.98. The RF presented inferior statistics about the ANFIS for the prognosis of forest production. The Subtractive Clustering (SC) and Fuzzy-C-Means (FCM) algorithms provide accurate baseline and volume projection estimates; both techniques are good alternatives for selecting variables used in modeling forest production.