Simulating run-to-failure SCADA time series to enhance wind turbine fault detection and prognosis

Journal Article (2025)
Author(s)

A. Eftekhari Milani (TU Delft - Wind Energy)

D. Zappalá (TU Delft - Wind Energy)

Francesco Castellani (Università degli Studi di Perugia)

S.J. Watson (TU Delft - Wind Energy)

Research Group
Wind Energy
DOI related publication
https://doi.org/10.5194/wes-10-2563-2025
More Info
expand_more
Publication Year
2025
Language
English
Research Group
Wind Energy
Issue number
11
Volume number
10
Pages (from-to)
2563–2576
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Wind turbine supervisory control and data acquisition (SCADA) datasets available for research usually contain a limited number of failure events. This limitation hinders the successful application of deep learning (DL) methods for fault detection and prognosis, as they require large datasets for robust training and generalisation. This work proposes a method using conditional generative adversarial networks (cGANs) to generate synthetic SCADA time series that replicate wind turbine behaviour under controllable operational, environmental, and degradation conditions. Given a set of SCADA time series representing these conditions, the cGAN generates temperature and pressure time series simulating gearbox operation. Results show that augmenting the training set of an artificial neural network (ANN) fault detection model with synthetic time series reduces false positives in the detected gearbox faults by 84 % on average, enabling the model to blindly detect a fault in a test wind turbine without prior knowledge of the event. Furthermore, training a convolutional autoencoder-based unsupervised health indicator (HI) model with both real and synthetic SCADA time series leads to an HI that more accurately captures the expected degradation trend. Using this HI, the gearbox's remaining useful life (RUL) can be predicted within the defined error bounds from around 4.5 months before the detection of the fault, while the HI obtained without the synthetic data fails to produce reliable RUL estimations.