-
Delaigue Olivier authored7dff15da
title: "airGR -- Overview"
output: rmarkdown::html_vignette
vignette: >
%\VignetteEngine{knitr::rmarkdown}
%\VignetteIndexEntry{airGR}
%\VignetteEncoding{UTF-8}
Introduction
airGR is a package which brings into the R software the hydrological modelling tools used and developed at the Catchment Hydrology Team at Irstea (France), including the GR rainfall-runoff models and a snowmelt and accumulation model, CemaNeige. Each model core is coded in Fortran to ensure low computational time. The other package functions (i.e. mainly the calibration algorithm and the efficiency criteria calculation) are coded in R.
The airGR package has been designed to fulfil two major requirements: to facilitate the use by non-expert users and to allow flexibility regarding the addition of external criteria, models or calibration algorithms. The names of the functions and their arguments were chosen to this end. airGR also contains basics plotting facilities.
Six hydrological models and one snowmelt and accumulation model are implemented in airGR. The snow model can be used alone or together with the daily hydrological models.
The models can be called within airGR using the following functions:
-
RunModel_GR4H()
: four-parameter hourly lumped hydrological model [@mathevet_quels_2005] -
RunModel_GR4J()
: four-parameter daily lumped hydrological model [@perrin_improvement_2003] -
RunModel_GR5J()
: five-parameter daily lumped hydrological model [@le_moine_bassin_2008] -
RunModel_GR6J()
: six-parameter daily lumped hydrological model [@pushpalatha_downward_2011] -
RunModel_GR2M()
: two-parameter monthly lumped hydrological model [@mouelhi_vers_2003; @mouelhi_stepwise_2006] -
RunModel_GR1A()
: one-parameter yearly lumped hydrological model [@mouelhi_vers_2003; @mouelhi_linking_2006] -
RunModel_CemaNeige()
: two-parameter degree-day snowmelt and accumulation model [@valery_as_2014] -
RunModel_CemaNeigeGR4J()
: combined use of GR4J and CemaNeige -
RunModel_CemaNeigeGR5J()
: combined use of GR5J and CemaNeige -
RunModel_CemaNeigeGR6J()
: combined use of GR6J and CemaNeige
The GRP forecasting model and the Otamin predictive uncertainty tool are not availabe in airGR.
Loading data
In the following example, we use a data sample contained in the package. For real applications, the user has to import its data into R and to prepare it with an adequate data.frame format as described below.
First, it is necessary to load the airGR package:
library(airGR)
This is an example of a data.frame
of hydrometeorological observations time series for a fictional catchment included in the airGR package that contains:
- DatesR: dates in the POSIXt format
- P: average precipitation [mm/day]
- T: catchment average air temperature [℃]
- E: catchment average potential evapotranspiration [mm/day]
- Qls: outlet discharge [l/s]
- Qmm: outlet discharge [mm/day]
data(L0123001)
summary(BasinObs)
The usual functions (e.g. read.table()
) can be used to load real-case datasets.
Preparation of functions inputs
To run a model, the functions of the airGR package (e.g. the models, calibration and criteria calculation functions) require data and options with specific formats.
To facilitate the use of the package, there are several functions dedicated to the creation of these objects:
-
CreateInputsModel()
: prepares the inputs for the different hydrological models (times series of dates, rainfall, observed streamflow, etc.) -
CreateRunOptions()
: prepares the options for the hydrological model run (warm-up period, calibration period, etc.) -
CreateInputsCrit()
: prepares the options in order to compute efficiency criterions (choice of the criterion, choice of the transformation on streamflows: "log", "root", etc.) -
CreateCalibOptions()
: prepares the options for the hydrological model calibration algorithm (choice of parameters to optimize, predefined values for uncalibrated parameters, etc.)
InputsModel object
To run a GR hydrologic model, the user has to prepare the input data with the CreateInputsModel()
function.
As arguments, this function needs the function name corresponding to the model the user wants to run, a vector of dates, a vector of precipitation and a vector of potential evapotranspiration.
In the example below, we already have the potential evapotranspiration. If the user doesn't have these data, it is possible to compute it with the Oudin's formula with the PEdaily_Oudin()
function (this function only needs julian days, daily average air temperature and latitude).
Missing values (NA
) of precipitation (or potential evapotranspiration) are not allowed.
InputsModel <- CreateInputsModel(FUN_MOD = RunModel_GR4J, DatesR = BasinObs$DatesR,
Precip = BasinObs$P, PotEvap = BasinObs$E)
str(InputsModel)
RunOptions object
The CreateRunOptions()
function allows to prepare the options required to the RunModel*()
functions, which are the actual models functions.
The user must at least define the following arguments:
-
FUN_MOD
: the name of the model function to run -
InputsModel
: the associated inputs data -
IndPeriod_Run
: the period on which the model is run
To select a period for which the user wants to run the model, select the corresponding indexes for different time periods (not the POSIXt dates), as follows:
Ind_Run <- seq(which(format(BasinObs$DatesR, format = "%d/%m/%Y %H:%M")=="01/01/1990 00:00"),
which(format(BasinObs$DatesR, format = "%d/%m/%Y %H:%M")=="31/12/1999 00:00"))
str(Ind_Run)
The intialization of hydrological models is of the utmost importance. Indeed, an inaccurate intialisation causes poor quality streamflow simulations during the earliest stages of the running period. For example, in the GR models, the reservoirs levels are by default set to 50 % of their capacity, which may be far from their ideal value. Two solutions are offered to accurately initialize the GR models in airGR: manually predefining the initial states or running the models during a warm up period before the actual running period. It is generally advised to set up this warm up period to be equal or superior to one year.
As a consequence, it is possible to define in CreateRunOptions()
the following arguments:
-
IniStates
: the initial states of the 2 unit hydrographs (20 + 40 = 60 units) -
IniResLevels
: the initial levels of the production and routing stores -
IndPeriod_WarmUp
: the warm-up period used to run the model, to be defined in the same format asIndPeriod_Run
RunOptions <- CreateRunOptions(FUN_MOD = RunModel_GR4J,
InputsModel = InputsModel, IndPeriod_Run = Ind_Run,
IniStates = NULL, IniResLevels = NULL, IndPeriod_WarmUp = NULL)
str(RunOptions)
The CreateRunOptions()
function returns warnings if the default initialization options are used:
-
IniStates
andIniResLevels
are automatically set to initialize all the model states at 0, except for the production and routing stores which are initialised at 50 % of their capacity -
IndPeriod_WarmUp
default setting ensures a one-year warm-up using the time steps preceding theIndPeriod_Run
, if available
InputsCrit object
The CreateInputsCrit()
function allows to prepare the input in order to calculate a criterion. It is possible to define the following arguments:
-
FUN_CRIT
: the name of the error criterion function (they are introduced later on) -
InputsModel
: the inputs of the hydrological model previously prepared by theCeateInputsModel()
function -
RunOptions
: the options of the hydrological model previously prepared by theCreateRunOptions()
function -
Qobs
: the observed streamflows expressed in mm/time step
Missing values (NA
) are allowed for observed streamflows.
InputsCrit <- CreateInputsCrit(FUN_CRIT = ErrorCrit_NSE, InputsModel = InputsModel,
RunOptions = RunOptions, Qobs = BasinObs$Qmm[Ind_Run])
str(InputsCrit)
CalibOptions object
The user needs to prepare the calibration options with the CreateCalibOptions()
function. For that, it is necessary to define the following arguments:
-
FUN_MOD
: the name of the model function -
FUN_CALIB
: the name of the calibration algorithm
CalibOptions <- CreateCalibOptions(FUN_MOD = RunModel_GR4J, FUN_CALIB = Calibration_Michel)
str(CalibOptions)
Criteria
The evaluation of the quality of a simulation is estimated through the calculation of criteria. These criteria can be used both as objective-functions during the calibration of the model, or as a measure for evaluating its control performance.
The package offers the possibility to use different criteria:
-
ErrorCrit_RMSE()
: Root mean square error (RMSE) -
ErrorCrit_NSE()
: Nash-Sutcliffe model efficiency coefficient (NSE) -
ErrorCrit_KGE()
: Kling-Gupta efficiency criterion (KGE) -
ErrorCrit_KGE2()
: modified Kling-Gupta efficiency criterion (KGE')
It is also possible to create user-defined criteria. For doing that, it is only necessary to define the function in R following the same syntax that the criteria functions included in airGR.
Calibration
The objective of the calibration algorithm is to identify the model parameters: by comparing the model outputs with observed data, this algorithm determines the combination of parameters that represents the best the behavior of the watershed.
In the airGR package, a function called Calibration_Michel()
is implemented. This functions allows running a calibration with the package models.
The calibration algorithm optimizes the error criterion selected as objective-function. This algorithm works in two steps:
- a screening of the parameters space is performed using either a rough predefined grid or a user-defined list of parameter sets
- a simple steepest descent local search algorithm is performed from the best set of parameters found at the first step
OutputsCalib <- Calibration_Michel(InputsModel = InputsModel, RunOptions = RunOptions,
InputsCrit = InputsCrit, CalibOptions = CalibOptions,
FUN_MOD = RunModel_GR4J, FUN_CRIT = ErrorCrit_NSE)
Param <- OutputsCalib$ParamFinalR
Param
The Calibration_Michel()
function is the only one implemented in the airGR package to calibrate the model, but the user can implement its own calibration function.
This function returns a vector with the parameters of the chosen model, which means that the number of values can differ depending on the model that is used. It is possible to use the Calibration_Michel()
function with user-implemented hydrological models.
Validation
This step assess the predictive capacity of the model. Validation is defined as the estimation of the accuracy of the model on datasets that are not used in its construction, and in particular its calibration. The classical way to perform a validation is to keep data from a period separated from the calibration period. If possible, this control period should correspond to climatic situations rather that differ from those of the calibration period in order to better point out the qualities and weakness of the model. This exercise is necessary for assessing the robustness of the model, that is to say its ability to keep stable performances outside the calibration conditions.
Performing a model validation with airGR is similar to running a simulation (see below).
Simulation
Simulation run
To run a model, the user has to use the RunModel*()
functions (InputsModel
, RunOptions
and parameters).
All the data needed have already been prepared.
OutputsModel <- RunModel_GR4J(InputsModel = InputsModel, RunOptions = RunOptions, Param = Param)
str(OutputsModel)
Results preview
Although it is possible for the user to design its own graphics from the outputs of the RunModel*()
functions, the airGR package offers the possibility to make use of the plot.OutputsModel()
function (or plotl()
with a OutputsModel
object). This function returns a dashboard of results including various graphs (depending on the model used):
- time series of total precipitation and simulated streamflows (and observed streamflows if provided)
- interannual median monthly simulated streamflows (and monthly observed streamflows if provided)
- cumulative frequency plot for simulated streamflows (and for observed streamflows if provided)
- correlation plot between simulated and observed streamflows (if observed streamflows provided)
plot(OutputsModel, Qobs = BasinObs$Qmm[Ind_Run])
Moreover, if the CemaNeige model is used, the temperature and the simulated snowpack time series are plotted.
Efficiency criterion
To evaluate the efficiency of the model, it is possible to use the same criterion as defined at the calibration step or to use an other criterion.
OutputsCrit <- ErrorCrit_NSE(InputsCrit = InputsCrit, OutputsModel = OutputsModel)
OutputsCrit <- ErrorCrit_KGE(InputsCrit = InputsCrit, OutputsModel = OutputsModel)