Effective Voting-Based Ensemble Learning for Segregated Load Forecasting With Low Sampling Data

Shahzeb Ahmad Khan; Attique Ur Rehman; Ammar Arshad; Mohammed H. Alqahtani; Karar Mahmoud; Matti Lehtonen

doi:10.1109/ACCESS.2024.3413679

IEEE Access (Jan 2024)

Effective Voting-Based Ensemble Learning for Segregated Load Forecasting With Low Sampling Data

Shahzeb Ahmad Khan,
Attique Ur Rehman,
Ammar Arshad,
Mohammed H. Alqahtani,
Karar Mahmoud,
Matti Lehtonen

Affiliations

Shahzeb Ahmad Khan: ORCiD; Faculty of Electrical Engineering, Ghulam Ishaq Khan Institute of Engineering Sciences and Technology, Topi, Pakistan
Attique Ur Rehman: Faculty of Electrical Engineering, Ghulam Ishaq Khan Institute of Engineering Sciences and Technology, Topi, Pakistan
Ammar Arshad: Faculty of Electrical Engineering, Ghulam Ishaq Khan Institute of Engineering Sciences and Technology, Topi, Pakistan
Mohammed H. Alqahtani: ORCiD; Department of Electrical Engineering, College of Engineering, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi Arabia
Karar Mahmoud: ORCiD; Department of Electrical Engineering, Faculty of Engineering, Aswan University, Aswan, Egypt
Matti Lehtonen: ORCiD; Department of Electrical Engineering and Automation, School of Electrical Engineering, Aalto University, Espoo, Finland

DOI: https://doi.org/10.1109/ACCESS.2024.3413679
Journal volume & issue: Vol. 12
pp. 84074 – 84087

Abstract

Read online

In power system planning and operation, load forecasting is an important task as it helps ensure a reliable and efficient electricity supply. For effective operation of the smart grid, load forecasting is also an important thing to keep balancing dispatch of power, load management, and load shifting. In this regard, this paper aims to propose an accurate load forecasting based on implementing and integrating different load forecasting models using standalone machine learning and ensemble machine learning models, particularly for segregated real-world load data. In the given context, machine learning models namely, k-nearest neighbor, random forest, decision tree, and voting ensemble regression, are used in this study. The time series load data for this research work was acquired from a real-world load database namely, Pecan Street Dataport. For performance evaluation, two statistical error matrices are used, i.e., mean absolute error (MAE) and mean squared error (MSE). For simulation purposes, Python along with different machine-learning libraries was employed. Moreover, for numerical data analysis and visualization, this research work utilizes different packages like NumPy, pandas, and matplotlib. The empirical study presents the comparative performance analysis of machine learning models for load forecasting utilizing low sampling load data, both at aggregated as well as at segregated levels. Standalone and ensemble machine learning algorithms yield very good forecasting results, and this research has revealed that machine learning models trained on segregated data exhibit superior performance compared to those trained on aggregated data. On segregated data, the proposed voting- based ensemble machine learning algorithm outperforms all the other models with MAE 0.05708, followed by k-nearest neighbors (with MAE 0.05879), random forest (with MAE 0.07069), and decision tree (with MAE 0.07361).

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords