Power System Fault Classification and Prediction Based on a Three-Layer Data Mining Structure

Yunliang Wang; Xiaodong Wang; Yanjuan Wu; Yannan Guo

doi:10.1109/access.2020.3034365

IEEE Access (Jan 2020)

Power System Fault Classification and Prediction Based on a Three-Layer Data Mining Structure

Yunliang Wang,
Xiaodong Wang,
Yanjuan Wu,
Yannan Guo

Affiliations

Yunliang Wang: ORCiD; Tianjin Key Laboratory for Control Theory and Applications in Complicated Systems, Tianjin University of Technology, Tianjin, China
Xiaodong Wang: ORCiD; Tianjin Key Laboratory for Control Theory and Applications in Complicated Systems, Tianjin University of Technology, Tianjin, China
Yanjuan Wu: ORCiD; Tianjin Key Laboratory for Control Theory and Applications in Complicated Systems, Tianjin University of Technology, Tianjin, China
Yannan Guo: ORCiD; Tianjin Tianda Qiushi Electric Power High Technology Company Ltd., Tianjin, China

DOI: https://doi.org/10.1109/access.2020.3034365
Journal volume & issue: Vol. 8
pp. 200897 – 200914

Abstract

Read online

In traditional fault diagnosis methods in power systems, it is difficult to accurately classify and predict the types of faults. With the emergence of big data technology, the fault classification and prediction methods based on big data analysis and processing have been applied in power systems. To make the classification and prediction of the fault types more accurate, this paper proposes a hybrid data mining method for power system fault classification and prediction based on clustering, association rules and stochastic gradient descent. This method uses a three-layer data mining model: The first layer uses the K-means clustering algorithm to preprocess the original fault data source, and it proposes to use self-encoding to simplify the data form. The second layer effectively eliminates the data that have little impact on the prediction results by using association rules, and the highly correlated data are mined to become the regression training data. The third layer first uses the cross-validation method to obtain the optimal parameters of each fault model, and then, it uses stochastic gradient descent for data regression training to obtain a classification and prediction model for each fault type. Finally, a verification example shows that compared with a single data mining algorithm model, the proposed method is more comparative in terms of the data mining, and the established power system fault classification and prediction model has global optimality and higher prediction accuracy, which has a certain feasibility for real-time online power system fault classification and prediction. This method reduces the disturbances from low-impact or irrelevant data by mining the fault data three times, and it uses cross-validation to optimize the multiple regression parameters of the regression model to solve the problems of low accuracy, large errors and easily falling into a local optimum, given the conduct of fault classification and prediction.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords