Predicting streamflow with LSTM networks using global datasets

Journal Article (2023)
Author(s)

K. Wilbrand (TU Delft - Water Resources)

Riccardo Taormina (TU Delft - Sanitary Engineering)

Marie-Claire Ten Veldhuis (TU Delft - Water Resources)

Martijn Visser (Deltares)

Markus Hrachowitz (TU Delft - Water Resources)

Jonathan Nuttall (Deltares)

Ruben J. Dahm (Deltares)

Research Group
Water Resources
Copyright
© 2023 K. Wilbrand, R. Taormina, Marie-claire ten Veldhuis, Martijn Visser, M. Hrachowitz, Jonathan Nuttall, Ruben Dahm
DOI related publication
https://doi.org/10.3389/frwa.2023.1166124
More Info
expand_more
Publication Year
2023
Language
English
Copyright
© 2023 K. Wilbrand, R. Taormina, Marie-claire ten Veldhuis, Martijn Visser, M. Hrachowitz, Jonathan Nuttall, Ruben Dahm
Research Group
Water Resources
Volume number
5
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Streamflow predictions remain a challenge for poorly gauged and ungauged catchments. Recent research has shown that deep learning methods based on Long Short-Term Memory (LSTM) cells outperform process-based hydrological models for rainfall-runoff modeling, opening new possibilities for prediction in ungauged basins (PUB). These studies usually feature local datasets for model development, while predictions in ungauged basins at a global scale require training on global datasets. In this study, we develop LSTM models for over 500 catchments from the CAMELS-US data base using global ERA5 meteorological forcing and global catchment characteristics retrieved with the HydroMT tool. Comparison against an LSTM trained with local datasets shows that, while the latter generally yields superior performances due to the higher spatial resolution meteorological forcing (overall median daily NSE 0.54 vs. 0.71), training with ERA5 results in higher NSE in most catchments of Western and North-Western US (median daily NSE of 0.83 vs. 0.78). No significant changes in performance occur when substituting local with global data sources for deriving the catchment characteristics. These results encourage further research to develop LSTM models for worldwide predictions of streamflow in ungauged basins using available global datasets. Promising directions include training the models with streamflow data from different regions of the world and with higher quality meteorological forcing.