Remote Sensing (Sep 2023)

Synthetic Forest Stands and Point Clouds for Model Selection and Feature Space Comparison

  • Michelle S. Bester,
  • Aaron E. Maxwell,
  • Isaac Nealey,
  • Michael R. Gallagher,
  • Nicholas S. Skowronski,
  • Brenden E. McNeil

DOI
https://doi.org/10.3390/rs15184407
Journal volume & issue
Vol. 15, no. 18
p. 4407

Abstract

Read online

The challenges inherent in field validation data, and real-world light detection and ranging (lidar) collections make it difficult to assess the best algorithms for using lidar to characterize forest stand volume. Here, we demonstrate the use of synthetic forest stands and simulated terrestrial laser scanning (TLS) for the purpose of evaluating which machine learning algorithms, scanning configurations, and feature spaces can best characterize forest stand volume. The random forest (RF) and support vector machine (SVM) algorithms generally outperformed k-nearest neighbor (kNN) for estimating plot-level vegetation volume regardless of the input feature space or number of scans. Also, the measures designed to characterize occlusion using spherical voxels generally provided higher predictive performance than measures that characterized the vertical distribution of returns using summary statistics by height bins. Given the difficulty of collecting a large number of scans to train models, and of collecting accurate and consistent field validation data, we argue that synthetic data offer an important means to parameterize models and determine appropriate sampling strategies.

Keywords