Applied Sciences (Jul 2023)
Improved Prediction of Aquatic Beetle Diversity in a Stagnant Pool by a One-Dimensional Convolutional Neural Network Using Variational Autoencoder Generative Adversarial Network-Generated Data
Abstract
Building a reasonable model for predicting biodiversity using limited data is challenging. Expanding limited experimental data using a variational autoencoder generative adversarial network (VAEGAN) to improve biodiversity predictions for a region is a new strategy. Aquatic beetle diversity in a large >30-year-old artificial pool that had not had human interference in Nanshe Village (Dapeng Peninsula, Shenzhen City, Guangdong Province, China) was investigated. Eight ecological factors were considered. These were water temperature, salinity, pH, water depth, proportional area of aquatic plants, proportional area of submerged plants, water area, and water level. Field sampling was performed for 1 or 2 days in the middle or late part of each month for a year. A type D net was swept 10 times in the same direction in each ~1 m × ~1 m sample square, generating 132 datasets (experimental data). In total, 39 aquatic beetle species were collected, 19 of which were assigned to Hydrophilidae, 16 to Dytiscidae, 3 to Noteridae, and 1 to Gyrinidae. A one-dimensional convolutional neural network (1-D CNN) was used to assess and predict the grade of the number of individuals and the number of aquatic beetle species. The Bayesian-optimized 1-D CNN established using 112 experimental datasets as the training set and the other 20 datasets as validation and testing sets gave a 74.0% prediction accuracy for the grade of the number of individuals and a 70.0% prediction accuracy for the number of species. The impact of insufficient sample data on the model was assessed using a VAEGAN to expand the training set from 112 to 512 samples, and then the Bayesian-optimized 1-D CNN-based VAEGAN prediction model was re-established. This improved prediction accuracy for the grade of the number of individuals to 86.0% and for the number of species to 85.0%. The grade of the number of individuals’ prediction accuracy was 88.0% and the number of species’ prediction accuracy was 85.0% when the random effects of only obtaining a single individual of a species were excluded. The results indicated that the accuracy of the 1-D CNN in predicting the aquatic beetle species number and abundance from relevant environmental factors can be improved using a VAEGAN to expand the experimental data.
Keywords