Power transformer fault diagnosis considering data imbalance and data set fusion

Yang Zhang, Hong Cai Chen, Yaping Du, Min Chen, Jie Liang, Jianhong Li, Xiqing Fan, Xin Yao

Research output: Journal article publicationJournal articleAcademic researchpeer-review

25 Citations (Scopus)


Improving the accuracy of transformer dissolved gas analysis is always an important demand for power companies. However, the requirement for large numbers of fault samples becomes an obstacle to this demand. This article creatively uses a large number of health data, which is much easier to obtain by power companies, to improve diagnosis accuracy. Comprehensive investigations from the view of both data set and methodology to deal with this problem are presented. A data set consists of 9595 health samples and 993 fault samples is used for analysis. The characteristics of the data set and the influence of the health data on diagnostic accuracy are discussed. The performance of many state-of-art algorithms that handle the imbalanced problem is evaluated. Meanwhile, an efficient fault diagnosis algorithm named self-paced ensemble (SPE) is presented. In SPE, classification hardness is proposed to include the data characteristic in the classification. This method can guarantee the diversity of the data set and keep high performance. According to the experiment results, the superior of SPE is confirmed and also proves that involving more health samples can improve transformer diagnosis when fault data are limited.

Original languageEnglish
Pages (from-to)543-554
Number of pages12
JournalHigh Voltage
Issue number3
Publication statusPublished - Jun 2021

ASJC Scopus subject areas

  • Energy Engineering and Power Technology
  • Electrical and Electronic Engineering


Dive into the research topics of 'Power transformer fault diagnosis considering data imbalance and data set fusion'. Together they form a unique fingerprint.

Cite this