Forecasting (Jan 2025)
Methodology Based on BERT (Bidirectional Encoder Representations from Transformers) to Improve Solar Irradiance Prediction of Deep Learning Models Trained with Time Series of Spatiotemporal Meteorological Information
Abstract
Accurate solar resource forecasting is important because of the inherent variability associated with solar energy and its significant impact on the cost for energy producers. The traditional method applied in solar irradiance forecasting involves two main phases, related to (1) data selection and (2) model selection, training, and evaluation. In this study, we propose a novel end-to-end methodology for solar irradiance forecasting that starts with the search for the data and all of the preprocessing operations involved in obtaining a quality dataset, continuing by imputing missing data with the BERT (Bidirectional Encoder Representations from Transformers) model, and ending with obtaining and evaluating the predicted values. This novel methodology is based on three phases; namely, Phase_1, related to the acquisition and preparation of the data, Phase_2, related to the proposed imputation with a BERT model, and Phase_3, related to the training and prediction with new models based on deep learning. These phases of the proposed methodology can be applied in a disjointed manner, and were used on two public datasets accessible to the scientific community. Each of the proposed phases proved to be valuable for the workflow, and the application of the novel method delivered increases in performance of up to 3 percentage points (3%) when compared to the traditional approach.
Keywords