High Voltage (Jun 2021)

Power transformer fault diagnosis considering data imbalance and data set fusion

  • Yang Zhang,
  • Hong Cai Chen,
  • Yaping Du,
  • Min Chen,
  • Jie Liang,
  • Jianhong Li,
  • Xiqing Fan,
  • Xin Yao

DOI
https://doi.org/10.1049/hve2.12059
Journal volume & issue
Vol. 6, no. 3
pp. 543 – 554

Abstract

Read online

Abstract Improving the accuracy of transformer dissolved gas analysis is always an important demand for power companies. However, the requirement for large numbers of fault samples becomes an obstacle to this demand. This article creatively uses a large number of health data, which is much easier to obtain by power companies, to improve diagnosis accuracy. Comprehensive investigations from the view of both data set and methodology to deal with this problem are presented. A data set consists of 9595 health samples and 993 fault samples is used for analysis. The characteristics of the data set and the influence of the health data on diagnostic accuracy are discussed. The performance of many state‐of‐art algorithms that handle the imbalanced problem is evaluated. Meanwhile, an efficient fault diagnosis algorithm named self‐paced ensemble (SPE) is presented. In SPE, classification hardness is proposed to include the data characteristic in the classification. This method can guarantee the diversity of the data set and keep high performance. According to the experiment results, the superior of SPE is confirmed and also proves that involving more health samples can improve transformer diagnosis when fault data are limited.

Keywords