Commits · 72982b425305bb719cfd5b0ae4dd80c9827f5127 · HYCAR-Hydro / evalhyd / evalhyd-cpp

13 Jan, 2023 2 commits

remove unfinished unit tests on contingency table · 72982b42
Thibault Hallouin authored 2 years ago

72982b42

add parameter to define the kind of streamflow exceedance events · f3b60cd2

while Brier-based scores are symmetric, Contigency Table-based metrics
are not, so that a definition how what an "event" means is required.
A new optional parameter *events* is added to `evalp` (taking either
"high" or "low" as value).

f3b60cd2

11 Jan, 2023 1 commit
- add licensing information · a81fd6b6
  Thibault Hallouin authored 2 years ago
  
  a81fd6b6
04 Jan, 2023 1 commit
- add default type in template for temporal mask · 193c7882
  Thibault Hallouin authored 2 years ago
```
so that specifying the template arguments become optional
```
  193c7882
27 Dec, 2022 2 commits
- add missing opening/closing curly brackets in for/if-else structures · 9fc694bd
  Thibault Hallouin authored 2 years ago
  
  9fc694bd
- fix type mismatch for loop on integer type · 8cec54a3
  Thibault Hallouin authored 2 years ago
  
  8cec54a3
21 Dec, 2022 1 commit
- use xexpression in evalp signature · 153df545
  Thibault Hallouin authored 2 years ago
  
  153df545
01 Dec, 2022 3 commits
- fix paths to datetime files for bootstrap tests · 67ada20f
  Thibault Hallouin authored 2 years ago
  
  67ada20f
- fix error with gcc on CI: missing explicit type · 56401a74
  Thibault Hallouin authored 2 years ago
  
  56401a74
- fix headers and data in tests · 969c8306
  Thibault Hallouin authored 2 years ago
  
  969c8306
20 Oct, 2022 1 commit

pave the way for summary statistics on bootstrap samples · d37e69d2

Thibault Hallouin authored 2 years ago

Ultimately, the objective is for the user to be able to get the raw
sampled metric values, or the mean and standard deviation of the sampled
metric values, or a series of quantiles of the sampled metric values.
There are still problems with the standard deviation on rtensor, and
the computation of the quantiles does not work on n-dim expressions yet.
So the second and third options are not possible yet, so only the raw
values can be returned. Nonetheless, the machinery and the choice of
where to introduce the summary functionality could be implemented,
which is the purpose of this commit. A new parameter of the bootstrap
experiment called "summary" is added: it can be given a value of 0 (to
get the raw values). In the future, it would also take a value of 1 for
mean+std, and 2 for quantiles.

d37e69d2

06 Oct, 2022 1 commit

implement bootstrapping method for metric uncertainty estimation · 16ce8f4e

Thibault Hallouin authored 2 years ago

The bootstrapping method is based on a non-overlapping block sampling
with replacement, where the blocks are years of data. The number of
samples and the sample length (i.e the number of year blocks) are both
customisable.

The method is accessible both for deterministic and probabilistic
evaluation where a new axis is added. For now, the metrics for all the
samples are returned, but in the future, some summary statistics would
be implemented (e.g. quantiles or mean/standard deviation).

/!\ For determinist evaluation, the n-dimensional functionality became
    untenable such that the number of dimensions was fixed and
    restricted to 2D tensors.

New unit tests are included to test both the bootstrapping generator
and the numerical results obtained with the bootstrapping turned on.

16ce8f4e

30 Sep, 2022 1 commit
- refactor data reading into separate function in unit tests · d994478a
  Thibault Hallouin authored 2 years ago
  
  d994478a
15 Sep, 2022 1 commit
- add unittest for new mean/median/quantile# masking conditions · fd1d0485
  Thibault Hallouin authored 2 years ago
  
  fd1d0485
13 Sep, 2022 1 commit

allow masking conditions to be specified on predictions · f31664dd

Thibault Hallouin authored 2 years ago

An earlier implementation of the masking conditions assumed that the
conditions on streamflow would only be on the observations, but this is
not always the case. For example, reliability scores cannot be done on
the observed streamflow and need to be performed on the predicted
streamflow. So this is now possible as the condition syntax is changed
and now *q_obs*/*q_prd_median*/*q_prd_mean* in place of *q*.

f31664dd

12 Sep, 2022 1 commit
- fix mistake in calculation of CRPS · fff30851
  Thibault Hallouin authored 2 years ago
  
  fff30851
02 Sep, 2022 1 commit

change masking conditions parameter type for strings · bb555aee

Thibault Hallouin authored 2 years ago

In order to be an accessible parameter type for Python (and hopefully
R in the future) bindings, `std::string` needed to be replaced with
`std::array<char, 32>`, a fixed-length string type.

bb555aee

31 Aug, 2022 1 commit

implement functionality to generate temporal masks from conditions · b13d2f21

Thibault Hallouin authored 2 years ago

This functionality is inherited from `evalhyd-cli`. It allows the user
to provide conditions as strings to specify how to generate temporal
subsets. Conditions can be based on observed streamflow values (e.g.
q>800, q<=120) or on time indices (e.g. to select particular events).

This functionality is made available both for determinist and
probabilist evaluation, unlike in `evalhyd-cli` where it was only
available for probabilist evaluation.

This is documented in the docstrings, and new unit tests are written.

b13d2f21

19 Aug, 2022 2 commits

deal with missing data flagged as NaN in observations/predictions · 397501ad

Thibault Hallouin authored 2 years ago

The general approach is to "eliminate" the time steps where observations
or predictions are missing as early as possible in the algorithm. The
best approach seemed to update the user-provided temporal masks to
also mask those time steps with missing data.

An alternative approach would have been to create a view on the
observations and predictions, e.g. using something like
`xt::view(obs, ..., xt::drop(...))`, but this produces a non-contiguous
view which cannot be sorted with `xt::sort` later to determine the
quantiles.

This is documented in `evalp` docstring and new unit tests are added.

397501ad

fix typo in unit test on masking · 41a30d84
Thibault Hallouin authored 2 years ago
```
resulting in only checking the first metric (i.e. BS) repeatedly
```
41a30d84

10 Aug, 2022 1 commit

add unittest to check NaN assignment works · a2957cb3

Thibault Hallouin authored 2 years ago

since `xt::allclose` does not have a *equal_nan* like `xt::isclose`
(see https://github.com/xtensor-stack/xtensor/issues/1995), the
check is a bit more convoluted than before...

a2957cb3

08 Aug, 2022 2 commits

add "sites" axis to thresholds dimensions · 1fb4a03f

Thibault Hallouin authored 2 years ago

different thresholds may be required of different sites, e.g.
if based on streamflow statistics, which are intrinsically site-specific

1fb4a03f

add "sites" axis to temporal mask dimensions · a28b9b56

Thibault Hallouin authored 2 years ago

different temporal subsets may be required of different sites, e.g.
if based on streamflow statistics, which are intrinsically site-specific

a28b9b56

11 Jul, 2022 1 commit
- add tests with 1D tensors for all deterministic metrics · ced2a5da
  Thibault Hallouin authored 2 years ago
  
  ced2a5da
30 Jun, 2022 2 commits
- add unittest on QS/CRPS · 92de7a26
  Thibault Hallouin authored 2 years ago
  
  92de7a26
- add unittest on masking functionality · 116fcf67
  Thibault Hallouin authored 2 years ago
  
  116fcf67
29 Jun, 2022 1 commit

add dimensions for sites/lead times to probabilistic evaluator · 295b3208

Thibault Hallouin authored 2 years ago

Internally, rather than using the multi-dimensional character of
tensors to compute all sites and all lead times at once, loops are
performed for each site and each lead time, in turn, in order to
minimise memory imprint. Although at the moment, the input tensors are
expected to feature the sites and lead times dimensions. If memory is
an issue, the user can still send smaller tensors with size 1 for those
dimensions and recompose multi-sites/multi-lead times output arrays
externally.

295b3208

15 Jun, 2022 1 commit

restructure files to split publicly distributed and implementation · aedde95c

Thibault Hallouin authored 3 years ago

It seems that good practice for C++ applications is to only include
*public* headers in "./include" folder and keep source files and
*private* headers and implementation source files in "./src".

aedde95c

13 Jun, 2022 1 commit

return squeezed xarray · 91aec5e1

Thibault Hallouin authored 3 years ago

Since the metrics are typically summary statistics, their size is not
very big, so that using xtensor instead of xarray as a data structure is
not as critical as for input data. In turn, using xarray allows for
metrics of different sizes to be returned without unnecessary size 1
dimensions (e.g. when only one threshold is given, or when no temporal
masking is performed). So all output metrics are now returned in their
"natural" shape (e.g. 1D for mono-component metrics, 2D for
multi-component metrics), plus any additional dimension linked to
multi-thresholds, multi-masking, etc.

91aec5e1

10 Jun, 2022 1 commit
- add support for masking to work on temporal subsets · 2fcf04e9
  Thibault Hallouin authored 3 years ago
  
  2fcf04e9
02 Jun, 2022 3 commits

transpose data in input files · e4b61c01

Thibault Hallouin authored 3 years ago

So that the files for the tests look like typical files `evalhyd` would
expect, rather than its transposed version.

e4b61c01

stop distinction simulation/forecast and rename prediction · 92da0a6b
Thibault Hallouin authored 3 years ago

92da0a6b

reorder positional parameters so that metrics are last · 709745a6

Thibault Hallouin authored 3 years ago

This is because in the CLI, metrics is a sequence of unknown length,
so it was only possible to make it last positional parameters. In order
to keep the interfaces harmonised across Python/R/C++ APIs and the CLI,
metrics needed to be put last positional parameter.

709745a6

25 May, 2022 1 commit

harmonise API with R/Python-bindings for functions · 85a4c818

Thibault Hallouin authored 3 years ago

i.e. `evalhyd::determinist::evaluate` becomes `evalhyd::evald`, and
`evalhyd::probabilist::evaluate` becomes `evalhyd::evalp`

85a4c818

24 May, 2022 1 commit
- make other probabilist metrics uppercase · fe0716dd
  Thibault Hallouin authored 3 years ago
  
  fe0716dd
17 May, 2022 1 commit
- add cmake file for tests build · d306ece0
  Thibault Hallouin authored 3 years ago
  
  d306ece0
16 May, 2022 1 commit
- add first unit tests to check score values · d4371768
  Thibault Hallouin authored 3 years ago
  
  d4371768