A geostatistical spatially varying coefficient model for mean annual runoff that incorporates process-based simulations and short records
Peer reviewed, Journal article
Published version
Date
2022Metadata
Show full item recordCollections
- Institutt for matematiske fag [2558]
- Publikasjoner fra CRIStin - NTNU [38689]
Original version
Hydrology and Earth System Sciences. 2022, 26, 5391-5410. 10.5194/hess-26-5391-2022Abstract
We present a Bayesian geostatistical model for mean annual runoff that incorporates simulations from a process-based hydrological model. The simulations are treated as a covariate and the regression coefficient is modeled as a spatial field. This way the relationship between the covariate (simulations from a hydrological model) and the response variable (observed mean annual runoff) can vary in the study area. A preprocessing step for including short records in the modeling is also suggested. We thus obtain a model that can exploit several data sources. By using state-of-the-art statistical methods, fast inference is achieved.
The geostatistical model is evaluated by estimating mean annual runoff for the period 1981–2010 for 127 catchments in Norway based on observations from 411 catchments. Simulations from the process-based HBV model on a 1×1 km grid are used as input. We found that on average the proposed approach outperformed a purely process-based approach (HBV) when predicting runoff for ungauged and partially gauged catchments. The reduction in RMSE compared to the HBV model was 20 % for ungauged catchments and 58 % for partially gauged catchments, where the latter is due to the preprocessing step. For ungauged catchments the proposed framework also outperformed a purely geostatistical method with a 10 % reduction in RMSE compared to the geostatistical method. For partially gauged catchments, however, purely geostatistical methods performed equally well or slightly better than the proposed combination approach. In general, we expect the proposed approach to outperform geostatistics in areas where the data availability is low to moderate.