You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
> **A list of over 60 metrics, statistical techniques and data processing tools contained in `scores` is [available here](https://scores.readthedocs.io/en/stable/included.html).**
7
7
8
8
`scores` is a Python package containing mathematical functions for the verification, evaluation and optimisation of forecasts, predictions or models. It supports labelled n-dimensional (multidimensional) data, which is used in many scientific fields and in machine learning. At present, `scores` primarily supports the geoscience communities; in particular, the meteorological, climatological and oceanographic communities.
Below is a **curated selection** of the metrics, tools and statistical tests included in `scores`. [(Click here for the full list.)](https://scores.readthedocs.io/en/stable/included.html)
18
21
19
22
||**Description**|**Selection of Included Functions**|
|**[Probability](https://scores.readthedocs.io/en/stable/included.html#probability)**|Scores for evaluating forecasts that are expressed as predictive distributions, ensembles, and probabilities of binary events. |Brier Score, Continuous Ranked Probability Score (CRPS) for Cumulative Density Functions (CDF) and ensembles (including threshold weighted versions), and Isotonic Regression (reliability diagrams). |
23
-
|**[Categorical](https://scores.readthedocs.io/en/stable/included.html#categorical)**|Scores for evaluating forecasts of categories. |18 binary contingency table (confusion matrix) metrics, the FIxed Risk Multicategorical (FIRM) Score, and the SEEPS score.|
24
-
|**[Spatial](https://scores.readthedocs.io/en/stable/included.html#spatial)**|Scores that take into account spatial structure. |Fractions Skill Score. |
25
-
|**[Statistical Tests](https://scores.readthedocs.io/en/stable/included.html#statistical-tests)**|Tools to conduct statistical tests and generate confidence intervals. |Diebold Mariano. |
26
-
|**[Processing Tools](https://scores.readthedocs.io/en/stable/included.html#processing-tools-for-preparing-data)**|Tools to pre-process data. |Data matching, Discretisation, Block Bootstrapping, and Cumulative Density Function Manipulation. |
27
-
|**[Plotting Data](https://scores.readthedocs.io/en/stable/included.html#plotting-data)**|Tools to generate data for plotting. |ROC curves, Murphy diagrams, and Q-Q plots. |
28
-
|**[Emerging](https://scores.readthedocs.io/en/stable/included.html#emerging)**|Emerging scores that are still undergoing mathematical peer review. They may change in line with the peer review process. |Risk Matrix Score. |
24
+
|**[Continuous](https://scores.readthedocs.io/en/stable/included.html#continuous)**|Scores for evaluating single-valued continuous forecasts. |E.g. MAE, MSE, RMSE, Bias, Pearson's Correlation Coefficient, Kling-Gupta Efficiency, NSE, Flip-Flop Index, Quantile Loss, Quantile Interval Score, Interval Score, and threshold weighted scores for expectiles, quantiles and Huber Loss.[See all.](https://scores.readthedocs.io/en/stable/included.html#continuous)|
25
+
|**[Probability](https://scores.readthedocs.io/en/stable/included.html#probability)**|Scores for evaluating forecasts that are expressed as predictive distributions, ensembles, and probabilities of binary events. |E.g. Brier Score, CRPS for CDFs and ensembles (including threshold weighted versions), and Isotonic Regression (reliability diagrams).[See all.](https://scores.readthedocs.io/en/stable/included.html#probability)|
26
+
|**[Categorical](https://scores.readthedocs.io/en/stable/included.html#categorical)**|Scores for evaluating forecasts of categories. |E.g. 18 binary contingency table (confusion matrix) metrics, the FIxed Risk Multicategorical (FIRM) Score, the SEEPS score and the Risk Matrix Score. [See all.](https://scores.readthedocs.io/en/stable/included.html#categorical)|
27
+
|**[Spatial](https://scores.readthedocs.io/en/stable/included.html#spatial)**|Scores that take into account spatial structure. |Fractions Skill Score. [See all.](https://scores.readthedocs.io/en/stable/included.html#spatial)|
28
+
|**[Statistical Tests](https://scores.readthedocs.io/en/stable/included.html#statistical-tests)**|Tools to conduct statistical tests and generate confidence intervals. |Diebold Mariano. [See all.](https://scores.readthedocs.io/en/stable/included.html#statistical-tests)|
29
+
|**[Processing Tools](https://scores.readthedocs.io/en/stable/included.html#processing-tools-for-preparing-data)**|Tools to pre-process data. |E.g. Data matching, Discretisation, Block Bootstrapping, and Cumulative Density Function Manipulation.[See all.](https://scores.readthedocs.io/en/stable/included.html#processing-tools-for-preparing-data)|
30
+
|**[Plotting Data](https://scores.readthedocs.io/en/stable/included.html#plotting-data)**|Tools to generate data for plotting. |ROC curves, Murphy diagrams, and Q-Q plots. [See all.](https://scores.readthedocs.io/en/stable/included.html#plotting-data)|
31
+
|**[Emerging](https://scores.readthedocs.io/en/stable/included.html#emerging)**|Emerging scores that are still undergoing mathematical peer review. They may change in line with the peer review process. |*Note - the Risk Matrix Score has recently been moved to 'categorical' following peer-reviewed publication*.|
29
32
30
33
`scores` not only includes common scores (e.g., MAE, RMSE), it also includes novel scores not commonly found elsewhere (e.g., FIRM, Flip-Flop Index), complex scores (e.g., threshold weighted CRPS), and statistical tests (e.g., the Diebold Mariano test). Additionally, it provides pre-processing tools for preparing data for scores in a variety of formats including cumulative distribution functions (CDF). `scores` provides its own implementations where relevant to avoid extensive dependencies.
31
34
32
-
`scores` primarily supports xarray datatypes for Earth system data allowing it to work with NetCDF4, HDF5, Zarr and GRIB data formats among others. `scores` uses Dask for scaling and performance. Some metrics work with pandas and we aim to expand this capability.
35
+
`scores` primarily supports xarray datatypes for Earth system data allowing it to work with NetCDF4, HDF5, Zarr and GRIB data formats among others. `scores` uses Dask for scaling and performance. Some metrics work with pandas and we aim to expand this capability.
33
36
34
37
All of the scores and metrics in this package have undergone a thorough scientific and software review. Every score has a companion Jupyter Notebook tutorial that demonstrates its use in practice.
35
38
36
39
## Contributing
37
-
To find out more about contributing, see our [contributing guide](https://scores.readthedocs.io/en/stable/contributing.html).
40
+
41
+
Contributions from the community are warmly welcomed. To find out more, see our [contributing guide](https://scores.readthedocs.io/en/stable/contributing.html).
38
42
39
43
All interactions in discussions, issues, emails and code (e.g., pull requests, code comments) will be managed according to the expectations outlined in the [ code of conduct ](https://github.com/nci/scores/blob/main/CODE_OF_CONDUCT.md) and in accordance with all relevant laws and obligations. This project is an inclusive, respectful and open project with high standards for respectful behaviour and language. The code of conduct is the Contributor Covenant, adopted by over 40,000 open source projects. Any concerns will be dealt with fairly and respectfully, with the processes described in the code of conduct.
40
44
@@ -46,7 +50,7 @@ The [installation guide](https://scores.readthedocs.io/en/stable/installation.ht
46
50
47
51
```bash
48
52
# From a local checkout of the Git repository
49
-
pip install -e .[all]
53
+
pip install -e ".[all]"
50
54
```
51
55
**To install the mathematical functions ONLY** (no tutorial dependencies, no developer libraries), use the default *minimal* installation option. *minimal* is a stable version with limited dependencies. This can be installed from the [Python Package Index (PyPI)](https://pypi.org/project/scores/) or with [conda](https://anaconda.org/conda-forge/scores).
52
56
@@ -73,7 +77,7 @@ Here is a short example of the use of `scores`:
73
77
<xarray.DataArray ()>
74
78
array(2.)
75
79
```
76
-
[Jupyter Notebook tutorials](https://scores.readthedocs.io/en/stable/tutorials/Tutorial_Gallery.html) are provided for each metric and statistical test in `scores`, as well as for some of the key features of `scores` (e.g., [dimension handling](https://scores.readthedocs.io/en/stable/tutorials/Dimension_Handling.html) and [weighting results](https://scores.readthedocs.io/en/stable/tutorials/Weighting_Results.html)).
80
+
[Jupyter Notebook tutorials](https://scores.readthedocs.io/en/stable/tutorials/Tutorial_Gallery.html) are provided for each metric and statistical test in `scores`, as well as for some of the key features of `scores` (e.g., [dimension handling](https://scores.readthedocs.io/en/stable/tutorials/Dimension_Handling.html) and [weighting results](https://scores.readthedocs.io/en/stable/tutorials/Weighting_Results.html)).
77
81
78
82
To watch a PyCon AU 2024 conference presentation about `scores`[click here](https://youtu.be/jyq2jOqtXe0?si=HYoW1cNiplbb3R0c).
0 commit comments