MURAL - Maynooth University Research Archive Library



    Monthly streamflow forecasting by machine learning methods using dynamic weather prediction model outputs over Iran


    Akbarian, Mohammad, Saghafian, Bahram and Golian, Saeed (2023) Monthly streamflow forecasting by machine learning methods using dynamic weather prediction model outputs over Iran. Journal of Hydrology, 620 B (129480). pp. 1-23. ISSN 0022-1694

    Abstract

    Seasonal hydrological forecasts play a critical role in water resources management. The Copernicus Climate Change Service (C3S) data store provides open access to monthly hydrological forecasts for up to six-months. This study aims to evaluate, for the first time, 1- to 3-month runoff forecasts using the European Centre for Medium-Range Weather Forecasts (ECMWF) ensembles of precipitation, runoff, and temperature in 1981–2015 period over a total of 30 s-level basins in Iran. We adopted the 5th, 50th and 95th ECMWF ensemble quantiles for each variable that represent low, medium and high probability of occurrence, respectively. Pearson correlation analysis (Pca), Recursive Feature Elimination (RFE) via random forest (RF) model, and Bayesian Networks (BN) feature selection algorithms were used in order to reduce input variable dimension and select potential predictors to be fed to the machine learning models. Multiple Linear Regression (MLR), Artificial Neural Networks (ANN), Support Vector Regression (SVR), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost) machine learning models were used with Repeated K-Fold cross validation (rK-Fold CV) while model efficiency was evaluated using modified Kling-Gupta efficiency coefficient (KGE’), Nash-Sutcliffe Efficiency coefficient (NSE), and Normalized Root Mean Square Error (NRMSE). Results of this study revealed that C3S runoff ensembles have the highest impact on forecast accuracy of streamflow, followed by precipitation and temperature. Overall, model performance yield a best-to-worst ranking of ANN, XGBoost, RF, MLR, and SVR with KGE’ values of 0.70, 0.68, 0.66, 0.57, and 0.41, respectively. The predictive performance of all models decreased with lead times beyond 1-month, where ANN and XGBoost outperformed other models with KGE’ of 0.65 for 2-month lead time and 0.60 for 3-month lead time. The three superior models of XGBoost, ANN, and RF, were employed with RFE and BN FSAs most frequently across Iran’s 30 s level basins in all lead times. Almost all models in the arid central region of Iran showed the lowest performance while highest skills were achieved in the western regions of Iran. Finally, for all models and over all regions, the model performance reduced by increase in lead-time.
    Item Type: Article
    Keywords: Streamflow forecast; C3S data store; ECMWF; Ensemble; Recursive; Feature Elimination; RFE; Bayesian Networks; BN; Machine learning; ML;
    Academic Unit: Faculty of Social Sciences > Geography
    Faculty of Social Sciences > Research Institutes > Irish Climate Analysis and Research Units, ICARUS
    Item ID: 20832
    Identification Number: 10.1016/j.jhydrol.2023.129480
    Depositing User: IR Editor
    Date Deposited: 11 Nov 2025 16:25
    Journal or Publication Title: Journal of Hydrology
    Publisher: Elsevier
    Refereed: Yes
    Related URLs:
    Use Licence: This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here

    Downloads

    Downloads per month over past year

    Origin of downloads

    Altmetric Badge

    Repository Staff Only (login required)

    Item control page
    Item control page